Associate Professor,
Department of Computing and Information Systems
University of Melbourne, Australia

Senior Research Associate,
Linguistic Data Consortium, University of Pennsylvania, USA

I study scalable, semi-automatic methods for analyzing spoken and written language, and for preserving endangered languages. This involves a mixture of computational modelling and linguistic fieldwork.

I do training and consulting with the Natural Language Toolkit, Python software for text analysis. Sponsorship supports further development of the software.

Recent publications include: The Human Language Project: Building a universal corpus of the World’s languages, A scalable method for preserving oral literature from small languages, and Fast query for large treebanks.

I’m currently teaching a second year course Language and Computation (description, course website)

I co-lead the Melbourne University Language Technology Group. I occasionally post on LanguageLog.


With Fred Jelinek (18/11/32 – 14/9/10), pioneer of speech recognition and machine translation technology, after presenting him with the ACL Lifetime Achievement Award in 2009.