1 option
History, features, and typology of language corpora / Niladri Sekhar Dash, S. Arulmozi.
- Format:
- Book
- Author/Creator:
- Dash, Niladri Sekhar, 1967- author.
- Arulmozi, S., author.
- Language:
- English
- Subjects (All):
- Corpora (Linguistics).
- Linguistics.
- Computational linguistics.
- Language and languages--Study and teaching.
- Language and languages.
- Physical Description:
- xxix, 293 pages : illustrations (some color) ; 25 cm
- Place of Publication:
- Singapore : Springer, [2018]
- Summary:
- This book discusses key issues of corpus linguistics like the definition of the corpus, primary features of a corpus, and utilization and limitations of corpora. It presents a unique classification scheme of language corpora to show how they can be studied from the perspective of genre, nature, text type, purpose, and application. A reference to parallel translation corpus is mandatory in the discussion of corpus generation, which the authors thoroughly address here, with a focus on Indian language corpora and English. Web-text corpus, a new development in corpus linguistics, is also discussed with elaborate reference to Indian web text corpora. The book also presents a short history of corpus generation and provides scenarios before and after the advent of computer-generated digital corpora.
- Contents:
- Definition of 'corpus'
- Features of a corpus
- Genre of text
- Nature of data
- Type and purpose of text
- Nature of text application
- Parallel translation corpus
- Web text corpus
- Pre-digital corpora (part 1)
- Pre-digital corpora (part 2)
- Digital text corpora (part 1)
- Digital text corpora (part 2)
- Digital speech corpora
- Utilization of language corpora
- Limitations of language corpora.
- Notes:
- Includes bibliographical references and indexes.
- ISBN:
- 9811074577
- 9789811074578
- OCLC:
- 1009052480
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.