Home: 412-657-6027
Work: 412-268-6594 Email: drasona@gmail.com
Computational Linguist/Knowledge Engineer
I have 23 years experience in Knowledge Engineering , Natural Language Processing and Computational Linguistics with expertise in morphological processing, grammar writing, designing and maintaining large lexical and semantic databases, annotating data, and designing semantic representations (interlingua) for Machine Translation, Information Extraction and Computer Assisted Language Instruction. I have experience programming in LISP and Perl. I have also trained people in grammar writing and dictionary development.
Computer Skills:
Ø UNIX and Linux Highly Proficient (Decs, HPs, Suns, and PCs.)
§ LISP Highly Proficient (including various dialects)
§ Perl Highly Proficient
§ C++ Familiar (limited experience)
§ EMACS Expert (able to program new functions in LISP)
§ Latex Proficient
§ HTML Familiar
Ø MS Windows Proficient
· Power Point Proficient
· MS Word and Excel Familiar
· Macromedia Director Proficient
· Macromedia Dreamweaver Familiar
Ø Macintosh Familiar
· HyperCard Familiar
· Coral Common Lisp Proficient
Knowledge Engineering Skills:
Expert in grammar writing, designing lexicons and semantic databases and designing interlingua. Experience designing and writing programs to assist in knowledge acquisition.
Professional Experience:
Current: NLP Linguist/Knowledge Engineer (1986-Present)
Research Associate III, Language Technologies Institute,
Carnegie Mellon University, Pittsburgh, Pa.
areas of Machine Translation and Computer Assisted Language Learning.
Ø Worked on numerous MT for spoken and written language and CALL projects.
Ø Headed MT grammar writing team for Nespole!
Ø Headed interlingua design and implementation for international spoken and text machine translation projects: Nespole!, CSTAR, and Estrato.
Ø Implemented Spanish ALICE computer assisted language instruction prototype.
Ø Implemented translation evaluation grading program (Perl and LISP).
Ø Implemented interlingua re-formatter for CMU Nespole! generator.
Ø Implemented inflectional morphology rules for Spanish (for three different projects).
Ø Wrote and presented papers and other talks at various conferences.
1986-Present Knowledge Engineer, Carnegie Mellon University
o Developed grammars and programs to parse and interpret emails for information extraction
o Developed grammars and code to annotate temporal expressions.
o Developed extraction rules for temporal expressions with CMU’s Minorthird information extractor.
o Evaluated performance of temporal modules.
o Headed cross-site interlingua team and CMU grammar writing team.
o Implemented program to maintain and automatically generate the interlingua specification in human and machine readable formats.
o Responsible for grammar writing team and English generation.
o Trained new interlingua users and grammar writers.
o Contributed to various conference papers and presented talks on Nespole! semantic representation.
o Rewrote grading program in Perl.
o Responsible for (but not limited to) writing English grammars and lexicons and training new grammar writers, coordinating evaluations and developing and maintaining interlingua specification for all CSTAR partners.
o Wrote/Co-Wrote and presented conference and journal papers.
o Responsible for Spanish and English analysis and generation grammars.
o Responsible for scheduling task interlingua representation.
o Responsible for system evaluation, wrote grading program in LISP.
o Improved automatic utterance segmentation program.
· Estrato (1991-1993) : Spanish-English machine translation for ISO documents.
o Responsible for training new knowledge engineers.
o Responsible for all Spanish and English knowledge sources.
o Designed and implemented new knowledge acquisition tools.
· ALICE (1990-1991) : Computer Assisted Language Instruction Program
o Implemented proto-type of Spanish ALICE on a Machintosh.
o Created a verb paradigm generator in Common LISP
o Wrote Spanish morphology rules for all inflected words.
o Created ALICE interface in HyperCard-like program.
o Analyzed student essays for errors and created rule patterns for error detection.
o Added videodisc look-up for common Spanish expressions.
· KBMT-89 (1986-1989) : Knowledge Based Machine Translation
o Wrote English grammars, lexicons and lexical semantic mapping rules.
o Wrote programs for testing the translations (LISP).
1985-1986 Graduate Student Researcher for University of Pittsburgh,
Linguistics Department. Pittsburgh, Pa.
o Designed new parser and wrote grammars to catch student errors for an ESL computer assisted language instruction research project.
o Taught Intermediate ESL to spouses of university students.
1985 Student worker on MINA project at University of Pittsburgh, Learning
Research and Development Center. Pittsburgh, Pa.
o Entered lexical database information.
o Wrote grammars and patterns for error detection.
Donna M. Gates
Consulting:
2006 April-August CarnegieSpeech: integrating NLP tools
Languages: Native speaker of English, highly proficient in Spanish, beginner in
Italian and familiar with German and Russian. Excellent presentation,
communication, and writing skills.
Education:
2003-2008 MS, Carnegie Mellon University, Department of Modern Languages and
the Language Technologies Institute’s joint Masters in Computer
Assisted Language Learning program.
1984-1988 University of Pittsburgh, Department of Linguistics. Graduate work in
Linguistics, Spanish Linguistics and Teaching English as a Second Language.
1984 BA, Summa Cum Laude, University of Pittsburgh, Department of Hispanic Languages
and Literature, Dean’s List, Foreign Language National Honor Society (Phi Sigma Iota) Major: Spanish, Minor: Linguistics.
1982 Associates Degree, Community College of Allegheny County,
Awarded the Foreign Languages Departmental Award,
Dean’s List, and Awarded the Annual Pitt Connection Scholarship.
Continuing Education:
Literacy tutor training with Greater Pittsburgh Literacy Council (November 2004)
Workshop on MT Evaluation (Geneva 2001)
WordNet Workshop (ACL 2001)
Beginning Italian (2000 CCAC informal program)
Introduction to Programming in C++ (1999 CMU)
Introduction to Foreign Language Teaching (1992 University of Pittsburgh)
Natural Language Processing I (audit 1991 CMU)
Landmark Education training and development programs (1985-2006)
Membership:
Three Rivers TESOL
Primary Publications:
Alvarez, A., and Lori Levin and Robert Frederking and Simon
Fung and Donna Gates and Jeff Good, “The {MILE} Corpus for Less Commonly Taught Languages”, HLT-NAACL, 2006.
Han, B., Gates, D. and Levin, L., From Language to Time: A Temporal Expression Anchorer. In Proceedings of the 13th International Symposium on Temporal Representation and Reasoning (TIME 2006), 2006.
Han, B., D. Gates, and L. Levin., “Understanding Temporal Expressions in
Emails.” HLT/NAACL-2006.
Han, B., Gates, D. and Levin, L., “Anchoring temporal expressions in
scheduling-related emails.” In G. Katz, J. Pustejovsky, and F.
Schilder, editors, Proceedings of Dagstuhl Seminar on Annotating,
Extracting and Reasoning about Time and Events. 2005.
Donna M. Gates
Miller, K., Gates, D., Underwood, N., Magdalen, J., ``Evaluation of Machine Translation Output for an Unknown Source Language: Report of an ISLE-Based Investigation'' MT Summit VIII, Santiago di Compostela, Spain, September 2001. http://www.eamt.org/summitVIII/papers/miller-2.pdf
Levin, L., D. Gates, A. Lavie, F. Pianesi, D. Wallace, T. Watanabe and M. Woszczyna, "Evaluation of a Practical Interlingua for Task-Oriented Dialogue". In Proceedings of ANLP/NAACL-2000 Workshop on Applied
Interlinguas, Seattle, WA, April 2000.
Gates, D, and Shell, P. "Rule-Based Acquisition and Maintenance of Lexical and Semantic Knowledge" Sixth EACL Conference Proceedings, 1993, Pages 149-157.
Levin, L., D. Gates, A. Lavie and A. Waibel, "An Interlingua Based on Domain Actions for Machine Translation of Task-Oriented Dialogues". In Proceedings of ICSLP-98, Sydney, Australia, November 1998.
Gates, D., A. Lavie, L. Levin, A. Waibel, M. Gavalda, M. Woszczyna and P. Zhan. ``End-to-end Evaluation in JANUS: a Speech-to-speech Translation System''. In Dialogue Processing in Spoken Language Systems: Revised Papers from ECAI-96 Workshop, E. Maier, M. Mast and S. LuperFoy (eds.), LNCS series, Springer Verlag, June 1997.
Lavie, A., D. Gates, N. Coccaro and L. Levin. ``Input Segmentation of Spontaneous Speech in JANUS: a Speech-to-speech Translation System''. In Dialogue Processing in Spoken Language Systems: Revised Papers from ECAI-96 Workshop, E. Maier, M. Mast and S. LuperFoy (eds.), LNCS series, Springer Verlag, June 1997.
Levin, L., Evans, D.A., and Gates, D. "The ALICE System: A workbench for Learning and Using Language." CALICO Journal, Volume 9, No, 1., 1991,Pages 27-56.
Gates, D., McCardell, R., Takeda, K., and Nirenburg, S., "Generation Lexicons." In K. Goodman and S. Nirenburg(eds.) The KBMT Project: A Case study in Knowledge-Based Machine Translation. San Mateo: Morgan Kaufmann. Pages 145-163. 1991
Donna Gates, Koichi Takeda, Teruko Mitamura, Lori Levin and Marion Kee.
"Analysis and Generation Grammars" in Machine Translation 4: 53-66, 1989
Donna Gates, Dawn Haberlach, Todd Kaufmann, Marion Kee, Rita McCardell,
Teruko Mitamura, Ira Monarch, Stephen Morrisson, Sergei Nirenburg,
Eric Nyberg 3rd, Koichi Takeda and Margalit Zabludowski
"Lexicons" in Machine Translation 4: 67-112, 1989
Donna Gates, Lori Levin, "Parsing in Computer Assisted Grammar Checking"
May 1986. Computers and Writing Conference. Learning Research and Development Center, University of Pittsburgh.