2014 Workshop on the Use of Computational Methods in the Study of Endangered Languages


Heron Room (4th floor)
Baltimore Marriott Waterfront

26 June 2014

 Paper Session 1: Computational Tools for Endangered Languages Research
9:10–9:30Aikuma: A Mobile App for Collaborative Language Documentation
Steven Bird, Florian R. Hanke, Oliver Adams and Haejoong Lee
9:30–9:50Small Languages, Big Data: Multilingual Computational Tools and Techniques for the Lexicography of Endangered Languages
Martin Benjamin and Paula Radetzky
9:50–10:10LingSync & the Online Linguistic Database: New Models for the Collection and Management of Data for Language Communities, Linguists and Language Learners
Joel Dunham, Gina Cook and Joshua Horner
10:10–10:40Modeling the Noun Morphology of Plains Cree
Conor Snoek, Dorothy Thunder, Kaidi Lõo, Antti Arppe, Jordan Lachler, Sjur Moshagen and Trond Trosterud
10:40–11:00Coffee Break
 Paper Session 2: Applying Computational Methods to Endangered Languages
11:00–11:30Learning Grammar Specifications from IGT: A Case Study of Chintang
Emily M. Bender, Joshua Crowgey, Michael Wayne Goodman and Fei Xia
11:30–12:00Creating Lexical Resources for Endangered Languages
Khang Nhut Lam, Feras Al Tarouti and Jugal Kalita
12:00–12:20Documenting Endangered Languages with the WordsEye Linguistics Tool
Morgan Ulinski, Anusha Balakrishnan, Daniel Bauer, Bob Coyne, Julia Hirschberg and Owen Rambow
2:00–3:00Poster and Tool Demonstration Session
 Estimating Native Vocabulary Size in an Endangered Language
Timofey Arkhangelskiy
 InterlinguaPlus Machine Translation Approach for Local Languages: Ekegusii & Swahili
Edward Ombui, Peter Wagacha and Wanjiku Ng’ang’a
 Building and Evaluating Somali Language Corpora
Abdillahi Nimaan
Additional demonstrations of tools presented in paper session 1
 Paper Session 3: Infrastructure and Community Development for Computational Research on Endangered Languages
3:00–3:30SeedLing: Building and Using a Seed corpus for the Human Language Project
Guy Emerson, Liling Tan, Susanne Fertmann, Alexis Palmer and Michaela Regneri
3:30–4:00Coffee Break
4:00–4:20Short-Term Projects, Long-Term Benefits: Four Student NLP Projects for Low-Resource Languages
Alexis Palmer and Michaela Regneri
4:20–4:50Data Warehouse, Bronze, Gold, STEC, Software
Doug Cooper
4:50–5:20Time to Change the "D" in "DEL"
Stephen Beale
5:20–5:30Concluding Remarks