Downloadable publications

Many of the papers linked to here are prepublication versions only and may have undergone significant revision upon publication.

To appear

African languages and linguistic typology. Imelda Icheji Udoh and Eno-Abasi Essien Urua (eds.), A handbook of contemporary linguistics. Uyo, Nigeria: The Department of Linguistics and Nigerian Languages, University of Uyo.


A (micro-)accretion zone in a remnant zone? Lower Fungom in areal–historical perspective. In Language Typology and Historical Contingency: In honor of Johanna Nichols. Amsterdam: Benjamins. 265–282.


The typology of templates. Language and Linguistics Compass 5:731–747.

Data and language documentation. Peter Austin and Julia Sallabank (eds.), Handbook of Endangered Languages. Cambridge: Cambridge University Press. 212–234.

Valuing technology: Finding the linguist's place in a new technological universe. In Louanna Furbee and Lenore Grenoble (eds.), Language documentation: Practice and values. Amsterdam: Benjamins. 111–131.


A Grand Challenge for Linguistics: Scaling Up and Integrating Models. White paper contributed to NSF's SBE 2020: Future Research in the Social, Behavioral and Economic Sciences initiative. (with Emily M. Bender)

Topic and focus fields in Naki. In Ines Fiedler and Anne Schwarz (eds.), The Expression of Information Structure: A documentation of its diversity across Africa. Amsterdam: Benjamins. 35–68.

Ethical issues in legacy language resources. Language and Communication 30:162–170. (with Carolyn O’Meara)

Modeling and encoding traditional wordlists for machine applications. Proceedings of the 2010 Workshop on NLP and Linguistics: Finding the Common Ground, ACL 2010. Uppsala, Sweden: Association for Computational Linguistics. 1–9. (with Shakthi Poornima)

Modeling wordlists via Semantic Web technologies. In Paul Buitelaar, Philipp Cimiano, and Elena Montiel-Ponsoda (eds.), Proceedings of the First International Workshop on the Multilingual Semantic Web (MSW 2010). Raleigh, North Carolina. (with Shakthi Poornima)


Practical language development: Whose mission? Language 85:619–629. (with Lise Dobrin)

Loanwords in Saramaccan. In Martin Haspelmath and Uri Tadmoor (eds.), Loanwords in the world's languages: A comparative handbook. Berlin: Mouton de Gruyter. 918–943.

Saramaccan vocabulary. In Martin Haspelmath and Uri Tadmor (eds.), World Loanword Database. Munich: Max Planck Digital Library. 1103 word entries.

A twice-mixed creole? Tracing the history of a prosodic split in the Saramaccan lexicon. Studies in Language 33:459–498.

Implementation for discovery: A bipartite lexicon to support morphological and syntactic analysis. Proceedings of Chicago Linguistic Society 41: Panel on Computational Linguistics. 1–16. (with Emily M. Bender)


Introduction. In Jeff Good (ed.) Linguistics Universals and Language Change. Oxford: Oxford University. 1–19.


Slouching towards deponency: A family of mismatches in the Bantu verb stem In Matthew Baerman, Greville G. Corbett, Dunstan Brown, and Andrew Hippisley (eds.), Deponency and morphological mismatches. Oxford: Oxford University. 203–230.

Strong linearity, weak linearity, and the typology of templates. In Matti Miestamo and Bernhard Waelchli (eds.), New challenges in typology: Broadening the horizons and redefining the foundations. Berlin: Mouton. 11–33.

When arguments become adjuncts: Negation and object preposing in Leggbó. In Jonathan E. Cihlar, Amy L. Franklin, David W. Kaiser and Irene Kimbara (eds.), Proceedings of Chicago Linguistic Society 39: Main session. Chicago: Chicago Linguistic Society. 110–129.

The ecology of documentary and descriptive linguistics. In Peter K. Austin (ed.), Language Documentation and Description, Volume 4. London: SOAS.


The phonetics of tone in Saramaccan. In Ana Deumert and Stephanie Durrleman (eds.), Structure and variation in language contact. Amsterdam: Benjamins. 9–28.

Modeling contested categorization in linguistic databases. Proceedings of the EMELD Workshop 2006: Digital Language Documentation: Tools and Standards: The State of the Art. Lansing, Michigan. July 20–22, 2006.


Reconstructing morpheme order in Bantu: The case of causativization and applicativization. Diachronica 22:55–109.

Morphosyntax of two Turkish subject pronominal paradigms. In Lorie Heggie and Francisco Ordóñez (eds.). Clitic and affix combinations: Theoretical perspectives. Amsterdam: Benjamins. 315–341. (with Alan Yu)

Split prosody and creole simplicity: The case of Saramaccan. Journal of Portuguese Linguistics 3:11–30.

Can GOLD ‘cope’ with WALS? Retrofitting an ontology onto the World Atlas of Languages Structures. Proceedings of the E-MELD Workshop 2005: Linguistic Ontologies and Data Categories for Language Resources, July 1–3, 2005, Cambridge, Massachusetts. (with Michael Cysouw, Hans-Jörg Bibiko, and Mihai Albu)


Tone and accent in Saramaccan: Charting a deep split in the phonology of a language. Lingua 114:575–619.

The descriptive grammar as a (meta)database. Proceedings of the E-MELD Workshop 2004: Linguistic Databases and Best Practice, July 15–18, 2004, Detroit, Michigan.

Montage: Leveraging advances in grammar engineering, linguistic ontologies, and mark-up for the documentation of underdescribed languages. Proceedings of the Workshop on First Steps for Language Documentation of Minority Languages: Computational Linguistic Tools for Morphology, Lexicon and Corpus Compilation, LREC 2004. (with Emily M. Bender, Dan Flickinger, and Ivan A. Sag)


The vowel systems of California Hokan. Survey Report #12: Papers from the 50th anniversary conference of the Survey of California and Other Indian Languages.


Creating a database and query-tools for the TELL multi-speaker linguistic corpus. Proceedings from the IRCS Workshop on Linguistic Databases, University of Pennsylvania, December 11–13, 2001. (with Ronald Sprouse)


SGML markup of dictionaries with special reference to comparative and etymological data. Proceedings from the Workshop on Web-Based Language Documentation and Description, University of Pennsylvania, December 12–15, 2000.

Selected handouts, presentations, and drafts


Interoperability for Language Documentation The Role of Semantic Web Tools. (with Tom Myers and Alexander Nakhimovsky)

A sketch of the Ajumbu [muc] noun class system.

A sketch of the Naki [mff] noun class system.


Reassessing Western Beboid. Presented at Bantu 3: Third International Conference on Bantu Languages. Royal Museum for Central Africa, Tervuren, Belgium, March 26. (with Jesse Lovegren)


Information structure, argument linearization, and word order typology in Bantoid. Presented at the annual meeting of the L.S.A., Chicago, January 6, 2008.


The Bantu verbal prefixes and S-Aux-O-V order in Benue-Congo. Presented at the Special Session on the Synchrony and Diachrony of the Bantu Verb at the Bantu Grammar: Description and Theory conference, SOAS. April 22. (with Tom Güldemann)

Rarum begets rarum: A rare clitic and morphosyntactic reduplication in Chechen and Ingush. Presented at the Rara and Rarissima conference, Max Planck Institute for Evolutionary Anthropology, March 29.

Constraining morphosyntactic templates: A case study of Bantu verbal suffixes. Presented at the annual meeting of the LSA, Albuquerque, January 2006.


A sketch of Atsugewi phonology. Presented at the annual winter meeting of SSILA. Boston, January 8–11, 2004.


Reconstructing Achumawi and Atsugewi: Proto-Palaihnihan revisited. Presented at the annual winter meeting of the Society for the Study of the Indigenous Languages of the Americas. Atlanta, January 2–5. (with Mary Paster and Teresa McFarland)


On the origin of a verb phrase template in Chechen and Ingush. Presented at the meeting of the LSA San Francisco, January 3–6, 2002.