
|
Roni Rosenfeld, Professor 
Office: Wean Hall 5311. Phone 412/268-7678. Fax
412/268-2205.
Secretary: Sharon Cavlovich, Wean
Hall 5315, 412/268-5196.
Mailing address: Carnegie Mellon / Computer Science, 5000 Forbes Ave.,
Pgh PA 15213
|
Note: Starting in Fall
2007, the Machine Learning (15-681) course is being renumbered to 10-601. Here is the web page.
Hi! My research interests are in:
- Project GATTACA: Computational
molecular virology and vaccine design. Retroviruses like HIV and
RNA viruses like Influenza evolve at a much higher rate than DNA life
forms. This is a formidable challenge to vaccine design, but is also
an opportunity to observe evolution as it happens. We use the
fast growing databases of viral sequences to build descriptive and
generative models of viral molecular evolution. We also use them to
infer viral envelope properties and suggest potential antigenic targets
that cannot easily mutate away. In collaboration with
virologists/immunologists, we try to correlate isolate sequence
composition to important biological properties of the isolate, such as pathogenicity, infectivity and neutralizability.
Along they way we design and develop visualization tools for
multiple sequence alignments (MSAs) and
other biological sequence data.
- Computational Molecular Biology
and more specifically Computational
Biolinguistics. Many of the problems in
this area involve statistical modeling of long sequences of
symbols/building blocks (nucleotides or amino acids) and their
relationship to proteins and their function. This is very similar to the
problem of modeling natural language: long sequences of symbols (letters,
words), and their relationship to the deep structure and meaning of
sentences. We are hoping that some of the models and techniques we have
developed in the past decade for language modeling will prove useful in
the biological domain. Current projects include computational molecular
evolution, computational
virology, and multi-species
gene-expression analysis.
- Speech and Language
Technology for Development (SLT4D) is the term we coined for our own
subfield of ICT4D:
finding ways to use speech and language technologies (like automatic
speech recognition and human-machine dialog systems) to help people around
the world help themselves. Our current project, HealthLine, investigates the
use of spoken language interfaces for community health workers across Pakistan
(thank you, Microsoft!).
Current Post-Doc: Andy Walsh
(computational virology).
Past Post-docs: Pierre DuPont (language
modeling), Stan F. Chen
(language modeling), Xiaojin Wang
(machine learning).
My favorite quotes.
