MOTTO:
"Man, I love
science!" – Beakman's World
ABSTRACT:
I am currently a Ph.D. student under the supervision of
William W. Cohen in the Machine Learning Department
within Carnegie Mellon University's
School of Computer Science.
My research is generally concerned with machine learning and data mining, with an underlying interest in
producing interpretable and reusable views of data, features and models. To this end, I am particularly
interested in transfer learning with an emphasis on domain adaptation.
I am currently working on the SLIF system for mining text
and images together. The text (captions, abstracts and main text from biomedical journal articles) directly and
indirectly refers to the images, while the images depict entities (cells, proteins, interactions) that are described in
the text. The combination of these two expressions (text and images) of the same underlying concept (the experiment being
performed) into new features, jointly describing both the text and images, would be closer to representing the actual
object a user would be interested in, rather than disjoint features of text and image alone. A related problem is that of
transfer learning. In this case, we use models and named entity extractors trained on one type of data (abstract
text, for instance) and adapt them to be applied to a related, but distinct type of data (caption text). The intuition is
that it is
easier to learn a certain concept once a related concept has already been mastered.
I have also been lucky to pursue related work outside of school during summer internships. While working with Hang Li and Tie-Yan Liu in the Web
Search and Mining group at Microsoft Research Asia
we developed novel semi-supervised and transfer learning based methods for improving internet search through
query-dependent ranking. The idea behind this work is that, regardless of the specific topic users are interested
in, there are common features linking certain types of queries together. For instance, users searching for either
a
person or company name might both be most interested in the corresponding home page (a navigational query), while
searchers for a disease or country name might be more interested in authoritative sources of information about these
topics
(informational queries). By modeling and leveraging these distributions of types of queries we can better decide
what, exactly, users
want and deliver that to them.
Relatedly, while in the Data
Analytics group at IBM Research Watson, I worked with Naoki Abe and Yan Liu on methods
for learning
causal models from temporally ordered data. We felt that the interpretability offered by a causal model was quite
valuable for the end user in understanding the process being studied. This type of understanding is an essential
component of the scientific process since it leads the researcher to an idea of what experiment to perform next.
An accurate predictive model, without interpretation, provides little insight as to what direction is best to pursue.
This was also the motivation behind my work with Richard
Scheines and Joseph E. Beck on discovering predictive,
semantically and scientifically interpretable high-level features as functions of raw, event level data.
I did my undergraduate work
in the Intrusion
Detection System Group within the Computer
Science Department of Columbia
University, under the supervision of Professor
Salvatore J. Stolfo and Eleazar
Eskin. My work there dealt with applying kernel
methods and support vector machines to the problem of
clustering data (binaries, system calls,
network
packets, etc.) in order to identify possible attacks.
In my spare time I work on applying machine learning
techniques towards opponent modeling for
Texas Hold 'em poker and Tic-Tac-Toe.
PAPERS:
- Andrew Arnold, Ramesh Nallapati and William W. Cohen (2008).
"Exploiting Feature Hierarchy for Transfer Learning in Named
Entity
Recognition."
In proceedings of the 46th Annual Meeting of the Association
for Computational Linguistics: Human Language Technologies (ACL:HLT), June 15-20, 2008, Columbus, OH, to
appear.
- Xiubo Geng, Tie-Yan Liu, Tao Qin, Andrew Arnold, Hang Li and Harry Shum (2008).
"Query Dependent Ranking Using K-Nearest Neighbor."
In proceedings of the 31st Annual International ACM
SIGIR Conference, July 20-24, 2008, Singapore, to appear.
-
Andrew Arnold, Ramesh Nallapati and William W. Cohen (2007).
"A Comparative Study of Methods for Transductive Transfer Learning." In proceedings of the IEEE International Conference on Data Mining (ICDM)
2007 Workshop on Mining and Management of Biological Data, October 28, 2007, Omaha, NE. (Extended version) (Slides)
-
Andrew Arnold, Yan Liu and Naoki Abe (2007).
"Temporal Causal Modeling with Graphical Granger Methods." In proceedings
of the Thirteenth ACM SIGKDD
International Conference on Knowledge Discovery and Data Mining, Aug 12-15, 2007, San Jose, CA. (Slides)
-
Andrew Arnold, Joseph E. Beck and Richard Scheines (2006).
"Feature Discovery in the Context of Educational Data Mining: An Inductive
Approach."
In proceedings of the AAAI2006
Workshop on Educational
Data Mining,
Boston, MA, 7-13.
-
Andrew Arnold, Richard Scheines, Joseph E. Beck and Bill Jerome (2005).
"Time and Attention: Students, Sessions, and Tasks."
In proceedings of the AAAI2005
Workshop on Educational
Data Mining,
Pittsburgh, PA, 62-66.
- Eleazar Eskin, Andrew Arnold, Michael Prerau, Leonid Portnoy and
Salvatore Stolfo (2002). "A Geometric Framework for Unsupervised Anomaly
Detection: Detecting Intrusions in Unlabeled Data."
In Daniel Barbara and Sushil Jajodia (editors), Applications of Data Mining
in
Computer Security, Kluwer.
INVITED ARTICLES:
REPORTS:
TALKS:
- "A Comparative Study of Methods for Transductive Transfer Learning." IEEE International Conference on Data Mining (ICDM)
2007 Workshop on Mining and Management of Biological Data, Omaha, NE (October 28,
2007).
- "Temporal Causal Modeling with Graphical Granger Methods." Thirteenth ACM SIGKDD
International Conference on Knowledge Discovery and Data Mining, San Jose, CA (August 13,
2007).
- "A Comparison of Methods for Transductive Transfer Learning." Information Retrieval and Mining Seminar. Microsoft
Research Asia, Beijing, China (May 30, 2007).
- "Feature Discovery in the
Context of Educational Data Mining: An Inductive
Approach." IBM Mathematical Sciences Department Seminar. IBM Watson Research, Yorktown Heights, NY (July 6,
2006).
- ""Causal Modeling for Anomaly
Detection." IBM Mathematical Sciences
Department 2006 Summer Student Seminar Series. IBM Watson Research, Yorktown Heights, NY (June 23, 2006).
|