2 options
Buckwalter Arabic morphological analyzer : Version 2.0 / Linguistic Data Consortium.
- Format:
- Datafile
- Language:
- English
- Subjects (All):
- Arabic language--Databases.
- Arabic language.
- Linguistics--Databases.
- Linguistics.
- Genre:
- Databases.
- Physical Description:
- 1 computer optical disc : sound ; 4 3/4 in.
- 4 3/4 in.
- Place of Publication:
- [Philadelphia, Pa.] : Linguistic Data Consortium, 2004.
- System Details:
- digital
- optical
- data file
- Summary:
- "The data consists primarily of three Arabic-English lexicon files: prefixes (548 entries), suffixes (906 entries), and stems (78839 entries representing 40219 lemmas). The lexicons are supplemented by three morphological compatibility tables used for controlling prefix-stem combinations (2435 entries), stem-suffix combinations (1612 entries), and prefix-suffix combinations (1138 entries). The actual code for morphology analysis and POS tagging is contained in a Perl script (AraMorph.pl). Sample input (infile.txt) and corresponding output file (outfile.xml) are provided. The documentation consists of a readme file with a description of the three lexicon files, the three morphological compatibility tables, the morphology analysis algorithm, and a table with the author's Arabic transliteration system."
- Notes:
- Title from disc label.
- "LDC2004L02."
- ISBN:
- 1585633240
- 9781585633241
- OCLC:
- 57446536
- Online:
- LDC catalog entry
- Using LDC Data general information
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.