My Account Log in

3 options

The Open Handbook of Linguistic Data Management / edited by Andrea L. Berez-Kroeker [and three others] ; foreword by Sarah G. Thomason.

DOAB Directory of Open Access Books Available online

View online

MIT Press Direct 2021 Collection Available online

View online

MIT Press Direct OA Available online

View online
Format:
Book
Contributor:
Berez-Kroeker, Andrea L., editor.
Thomason, Sarah Grey, writer of foreword.
Series:
Open Handbooks in Linguistics Series
Language:
English
Subjects (All):
Computational linguistics.
Natural language processing (Computer science).
Data mining.
Physical Description:
1 online resource (xiv, 671 pages) : illustrations.
Edition:
First edition.
Place of Publication:
Cambridge, Massachusetts : The MIT Press, [2021]
Summary:
A guide to principles and methods for the management, archiving, sharing, and citing of linguistic research data, especially digital data.Doing language science depends on collecting, transcribing, annotating, analyzing, storing, and sharing linguistic research data. This volume offers a guide to linguistic data management, engaging with current trends toward the transformation of linguistics into a more data-driven and reproducible scientific endeavor. It offers both principles and methods, presenting the conceptual foundations of linguistic data management and a series of case studies, each of which demonstrates a concrete application of abstract principles in a current practice. In part 1, contributors bring together knowledge from information science, archiving, and data stewardship relevant to linguistic data management. Topics covered include implementation principles, archiving data, finding and using datasets, and the valuation of time and effort involved in data management. Part 2 presents snapshots of practices across various subfields, with each chapter presenting a unique data management project with generalizable guidance for researchers. The Open Handbook of Linguistic Data Management is an essential addition to the toolkit of every linguist, guiding researchers toward making their data FAIR: Findable, Accessible, Interoperable, and Reusable.
Contents:
Intro
Series Page
Title Page
Copyright
Dedication
Table of Contents
Series Foreword
Foreword by Sarah G. Thomason
I. Conceptual Foundations, Principles, and Implementation of Data Management in Linguistics
1. Data, Data Management, and Reproducible Research in Linguistics: On the Need for The Open Handbook of Linguistic Data Management
2. Situating Linguistics in the Social Science Data Movement
3. The Scope of Linguistic Data
4. Indigenous Peoples, Ethics, and Linguistic Data
5. The Linguistic Data Life Cycle, Sustainability of Data, and Principles of Solid Data Management
6. Transforming Data
7. Archiving Research Data
8. Developing a Data Management Plan
9. Copyright and Sharing Linguistic Data
10. Linguistic Data in the Long View
11. Guidance for Citing Linguistic Data
12. Metrics for Evaluating the Impact of Data Sets
13. The Value of Data and Other Non-traditional Scholarly Outputs in Academic Review, Promotion, and Tenure in Canada and the United States
II. Data Management Use Cases
14. Managing Sociolinguistic Data with the Corpus of Regional African American Language (CORAAL)
15. Managing Data for Integrated Speech Corpus Analysis in SPeech Across Dialects of English (SPADE)
16. Data Management at the uOttawa Sociolinguistics Laboratory
17. Managing Legacy Data in a Sociophonetic Study of Vowel Variation and Change
18. Managing Sociophonetic Data in a Study of Regional Variation
19. Data Management Practices in an Ethnographic Study of Language and Migration
20. Managing Conversation Analysis Data
21. Managing Sign Language Data from Fieldwork
22. Managing Data in a Language Documentation Corpus
23. Managing Data for Writing a Reference Grammar.
24. Managing Lexicography Data: A Practical, Principled Approach Using FLEx (FieldWorks Language Explorer)
25. Managing Data from Archival Documentation for Language Reclamation
26. Managing Data for Descriptive and Historical Research
27. Managing Historical Data in the Chirila Database
28. Managing Historical Linguistic Data for Computational Phylogenetics and Computer-Assisted Language Comparison
29. Managing Computational Data for Models of Language Acquisition and Change
30. Managing Sign Language Acquisition Video Data: A Personal Journey in the Organization and Representation of Signed Data
31. Managing Acquisition Data for Developing Large Sesotho, English, and French Corpora for CHILDES
32. Managing Phonological Development Data within PhonBank: The Chisasibi Child Language Acquisition Study
33. Managing Oral and Written Data from an ESL Corpus from Canadian Secondary School Students in a Compulsory, School-Based ESL Program
34. Managing Second Language Acquisition Data with Natural Language Processing Tools
35. Managing Data Workflows for Untrained Forced Alignment: Examples from Costa Rica, Mexico, the Cook Islands, and Vanuatu
36. Managing Transcription Data for Automatic Speech Recognition with Elpis
37. Managing Data and Statistical Code According to the FAIR Principles
38. Managing Synchronic Corpus Data with the British National Corpus (BNC)
39. Managing Data in Sign Language Corpora
40. Managing Sign Language Video Data Collected from the Internet
41. Managing Data from Social Media: The Indigenous Tweets Project
42. Managing Semantic Norms for Cognitive Linguistics, Corpus Linguistics, and Lexicon Studies
43. Managing Treebank Data with the Infrastructure for the Exploration of Syntax and Semantics (INESS).
44. Managing Data in a Formal Syntactic Study of an Underinvestigated Language (Uzbek)
45. Managing Data for Theoretical Syntactic Study of Underdocumented Languages
46. Managing Experimental Data in a Study of Syntax
47. Managing Web Experiments for Psycholinguistics: An Example from Experimental Semantics/Pragmatics
48. Managing, Sharing, and Reusing fMRI Data in Computational Neurolinguistics
49. Managing Phonological Data in a Perception Experiment
50. Managing Speech Perception Data Sets
51. Managing and Analyzing Data with Phonological CorpusTools
52. Managing Phonological Inventory Data in the Development of PHOIBLE
53. Managing Data in a Typological Study
54. Managing Data for Descriptive Morphosemantics of Six Language Varieties
55. Managing Data in TerraLing, a Large-Scale Cross-Linguistic Database of Morphological, Syntactic, and Semantic Patterns
56. Managing AUTOTYP Data: Design Principles and Implementation
Contributors
Index.
Notes:
Description based on print version record.
Includes bibliographical references and index.
ISBN:
0-262-36607-X
0-262-36217-1
OCLC:
1290430329

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account