Research

I work at the intersection of machine learning and human-computer interaction. Particularly interactive learning systems with applications in natural language processing, biological research, and social computing. For more detail, see a complete list of publications by year.


Active and Interactive Machine Learning

[active learning]

Annotating training data for machine learning is often slow, expensive, and difficult. My research involves systems that learn more economically by playing an active role in learning process, e.g., by asking questions about the learning task. I have written a fairly comprehensive literature survey of this field, and studied several important subtopics.

See also: DUALIST (software)

Natural Language Processing

[NLP]

Language technology is a fascinating application area for machine learning. In particular, I am interested in statistical "machine reading" systems that extract information from large text collections and make use of them in various ways, as well as corpus-based generative models to foster creative thinking in people.

See also: Read the Web project website, @cmunell on Twitter, and The Muse creativity tools

Computational Biology and Bioinformatics

[computational biology]

Biology is an increasingly data-driven (vs. hypothesis-driven) science. Today we can harness intelligent computer systems to help us predict, explain, and explore biological phenomena. I also believe we can exploit the biomedical literature in such systems to aid in biological discovery.

Social Computing

[social computing]

Our modern web-based society creates a lot of data as a byproduct of daily interactions. I study ways of using such arbitrary (often noisy) data to train useful and informative machine learning systems.

See also: FAWM.ORG and TagATune


Research-Related Software

I've also released a few research data sets.