Paul Ogilvie

In August of 2010 I began work at LinkedIn as a member of the Recommendations Engine team, where I have been building the article ranking infrastructure and algorithms used to power LinkedIn Today.

Graduate Student at LTI from August, 2000 through June 2010.
Principal Scientist at mSpoke from September, 2007 through July, 2010. (Acquired by LinkedIn)
Advisor - Jamie Callan

Email - pto@cs.cmu.edu
Office - 3612-A Newell Simon Hall

CV
Read my blog


School

Carnegie Mellon
School of Computer Science
Language Technologies Institute


Research and Work

Language Modeling in IR (Since 1999)
  • Modeling annotations and relationships between annotations
  • Hierarchical language models
  • Combining document representations

Design and implementation of utilities for the Lemur Toolkit (since 2001)
  • Overlapping hierarchies of annotations
  • XML
  • Distributed information retrieval utilities
    • Indexing of collection selection databases
    • Query-based sampling of text databases
  • Document tokenizer of early versions Lemur

Distributed Information Retrieval (2000 - 2001)
  • In-depth analysis of query-based sampling techniques
  • Mixing speech and text databases in distributed IR
  • Smoothing and language models in collection selection
  • Query expansion in distributed IR

Participant in the NRRC Reliable Information Access Workshop studying (pseudo-) relevance feedback (Summer 2003)

Professional Service
  • Reviewing:
    • ACM Transactions on Information Systems
    • ACM Transactions on the Web
    • Information Processing & Management
    • Journal of Digital Libraries
    • Journal of Information Retrieval
    • CIKM 2008, 2009
    • EMNLP-CoNLL 2007
    • ECIR 2008, 2009
    • ECIR Posters 2008
    • ICTIR 2007, 2009
    • HLT/NAACL Student Workshop 2003, 2004
    • SIGIR 2006 - 2009
    • SIGIR Demos 2008
    • SIGIR Posters 2005 - 2008
    • SPIRE 2006
    • WWW 2008, 2009
  • SIGIR 2003 online paper review system manager

Teaching Assistant

IR Discussion Series


Honors

  • SIGIR 04 Doctoral Consortium Award
  • Northrop Grumman Fellowship, 2004-2005 School Year


Publications

Journal

Ogilvie, Paul, Ellen Voorhees, and Jamie Callan (2009) On the Number of Terms for Automatic Query Expansion. Information Retrieval, Volume 12(6), pages 666-679. doi: 10.1007/s10791-009-9104-1

Conference

Bilotti, Matthew, Paul Ogilvie, Jamie Callan, and Eric Nyberg (2007) Structured Retrieval for Question Answering. In the Proceedings of the Thirtieth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2007). pdf

Ogilvie, Paul and Mounia Lalmas (2006) Investigating the Exhaustivity Dimension in Content-Oriented XML Element Retrieval Evaluation. In the Proceedings of the Fifteenth International Conference on Information Knowledge Management (CIKM 2006). pdf

Ogilvie, Paul and Jamie Callan (2005) Experiments with Language Models for Known-Item Finding of E-mail Messages. In the Proceedings of the Fourteenth Text Retrieval Conference (TREC-14). pdf

Collins-Thompson, Kevyn, Paul Ogilvie, and Jamie Callan (2004) Initial Results with Structured Queries and Language Models on Half a Terabyte of Text. In the Proceedings of the Thirteenth Text Retreival Conference (TREC-13). pdf

Ogilvie, Paul, and Jamie Callan (2003) Combining Document Representations for Known Item Search. In the Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2003), pp. 143-150. pdf ps

Ogilvie, Paul and Jamie Callan (2003) Combining Structural Information and the Use of Priors in Mixed Named-Page and Homepage Finding. In the Proceedings of the Twelfth Text Retrieval Conference (TREC-12). pdf ps

Si, Luo, Rong Jin, Jamie Callan, and Paul Ogilvie (2002) Language Model Framework for Resource Selection and Results Merging. In the Proceedings of the Eleventh International Conference on Information and Knowledge Management (CIKM 2002), pp. 391-397. pdf ps

Collins-Thomson, Kevyn, Paul Ogilvie, Yi Zhang, and Jamie Callan (2002) Information Filtering, Novelty Detection, and Named-Page Finding. In the Proceedings of the Eleventh Text Retrieval Conference (TREC-11), pp. 107-118. pdf ps

Ogilvie, Paul, and Jamie Callan (2001) The Effectiveness of Query Expansion for Distributed Information Retrieval. In the Proceedings of the Tenth International Conference on Information Knowledge Management (CIKM 2001), pp. 183-190. pdf ps

Ogilvie, Paul and Jamie Callan (2001) Experiments Using the Lemur Toolkit. In the Proceedings of the Tenth Text Retrieval Conference, TREC 2001. NIST Special Publication 500-250, pp. 103-108. pdf ps

Lavrenko, Victor, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, and James Allan (2000) Mining of Concurent Text and Time Series. In the Proceedings of The Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2000), pp. 37-44. pdf ps

Lavrenko, Victor, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, and James Allan (2000) Language Models for Financial News Recommendation. In the Proceedings of the Workshop on Text Mining at the Ninth International Conference on Information Knowledge Management (CIKM 2000), pp. 389-396. pdf ps

Larkey, Leah, Paul Ogilvie, M. Andrew Price, and Brenden Tamilio (2000) Acrophile: An Automated Acronym Extractor and Server. In the Proceedings of the Fifth ACM Conference on Digital Libraries (DL '00), pp. 205-214. pdf ps

Workshop

Ogilvie, Paul and Jamie Callan (2005) Parameter Estimation for a Simple Hierarchical Generative Model for XML Retrieval. In the Proceedings of the Initiative for the Evaluation of XML Retreival Workshop (INEX 2005), pp. 211-224. pdf

Ogilvie, Paul and Jamie Callan (2004) Hierarchical Language Models for Retrieval of XML Components. In the Proceedings of the Initiative for the Evaluation of XML Retrieval Workshop (INEX 2004). pdf

Ogilvie, Paul (2004) Retrieval Using Structure for Question Answering. In the Proceedings of the First Twente Data Management Workshop (TDM`04). pdf

Ogilvie, Paul and Jamie Callan (2003) Using Language Models for Flat Text Queries in XML Retrieval. In the Proceedings of the Initiative for the Evaluation of XML Retrieval Workshop (INEX 2003). pdf ps

Ogilvie, Paul and Jamie Callan (2002) Language Models and Structured Document Retrieval. In the Proceedings of the Initiative for the Evaluation of XML Retrieval Workshop (INEX 2002). pdf ps

Other

Ogilvie, Paul (2004) Understanding Combination of Evidence using Generative Probabilistic Models for Information Retrieval. Prepared for the Doctoral Consortium at the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2004). Winner of the Doctoral Consortium Award. pdf

Ogilvie, Paul (2000) Extracting and Using Relationships Found in Text for Topic Tracking. Undergraduate Honors Thesis, CIIR Technical Report IR-209. pdf ps


Some Presentations

Lemur Toolkit Tutorial. Presented with Trevor Strohman at the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2006), Seattle, USA, August 6, 2006. MS PowerPoint

Combining Document Representations for Known Item Search. Presented at the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2003), Toronto, Canada, July 30, 2003. pdf ps

Language Models and Structured Document Retrieval. Presented at the Initiative for the Evaluation of XML Retrieval Workshop (INEX 2002), Schloss Dagstuhl, Germany, December 9, 2002. pdf ps

The Effectiveness of Query Expansion in Distributed Information Retrieval. Presented at the Tenth Annual Conference on Information and Knowledge Management (CIKM 2001), Atlanta, Georgia, November 7, 2001. pdf ps