Starting September 2015, I have joined Allen Institute for Artificial Intelligence as a Research Scientist. I received Ph.D. from Language Technologies Institute, School of Computer Science, Carnegie Mellon University in August 2015. During Ph.D. (August 2009-15) I was co-advised by Prof. William Cohen and Prof. Jamie Callan, in the area of Information Extraction and Information Retrieval.
I have completed my Masters in 2007 from IIT Bombay under the guidance of Prof. S. Sudarshan. I worked in Google India from 2007-2009 as a Software Engineer.

Research Interests

  • Information Extraction
  • Machine Learning
  • Data Mining
  • Information Retrieval

Thesis Topic

Thesis document [PDF]
My thesis porposes a general framework ``Exploratory Learning'' that tackles learning scenarios which are superset of semi-supervised learning in the sense that some of the classes might be unknown to the algorithm, and can be discovered from the unlabeled data. More information: link.


Journal Papers

  1. Keyword search on external memory data graphs, Bhavana Bharat Dalvi, Meghana Kshirsagar and S. Sudarshan, Proceedings of the VLDB Endowment, VLDB 2008 (Acceptance rate: 16%) [PDF]

Conference Papers

  1. Hierarchical Semi-supervised Classification with Incomplete Class Hierarchies, Bhavana Dalvi Mishra, Aditya Mishra and William W. Cohen, WSDM 2016 (Acceptance rate: 18.2%) [PDF], [Slides], [Poster], [Dataset:link ]
  2. Automatic Gloss Finding for a Knowledge Base using Ontological Constraints, Bhavana Dalvi Mishra, Einat Minkov, Partha Pratim Talukdar, and William W. Cohen, WSDM 2015 (Acceptance rate: 16.8%)
    [PDF], [Code: zipped folder], [Slides](presented in LTI SRS 2014), [Dataset:link ]
  3. Never-Ending Learning, Tom Mitchell, William Cohen, Estevam Hruschka, Partha Talukdar, Justin Betteridge, Andrew Carlson, Bhavana Dalvi, Matt Gardner, Bryan Kisiel, Jayant Krishnmurthy, Ni Lao, Kathryn Mazaitis, Tahir Mohammad, Ndapa Nakashole, Emmanouil Antonios Platanios, Alan Ritter, Mehdi Samadi, Burr Settles, Richard Wang, Derry Wijaya, Abhinav Gupta, Xinlei Chen, Abulhair Saparov, Malcolm Greaves and Joel Welling, AAAI 2015
  4. Integrating Energy Storage in Electricity Distribution Networks., Aditya Mishra, Ramesh Sitaraman, David Irwin, Ting Zhu, Prashant Shenoy, Bhavana Dalvi Mishra, and Stephen Lee, Proceedings of the 6th ACM Intl. Conference on Future Energy Systems (ACM e-Energy) 2015 [PDF]
  5. Multi-View Hierarchical Semi-supervised Learning by Optimal Assignment of Sets of Labels to Instances, Bhavana Dalvi Mishra, and William W. Cohen, In preparation.
    [Draft], [Dataset:link]
  6. Exploratory Learning , Bhavana Dalvi Mishra, William W. Cohen and Jamie Callan, in Proceedings of European Conference on “Machine Learning” ECML/PKDD 2013
    [PDF], [Slides], [Poster], [Code: link]
  7. From Topic Models to Semi-Supervised Learning: Biasing Mixed-membership Models to Exploit Topic-Indicative Features in Entity Clustering , Ramnath Balasubramanyan, Bhavana Dalvi Mishra and William W. Cohen, in Proceedings of European Conference on “Machine Learning” ECML/PKDD 2013 [PDF]
  8. Very Fast Similarity Queries on Semi-Structured Data from the Web , Bhavana Dalvi and William W. Cohen, Proceedings of the SIAM International Conference on Data Mining SDM 2013
    [PDF], [Poster]
  9. WebSets: Extracting Sets of Entities from the Web Using Unsupervised Information Extraction, Bhavana Dalvi, William W. Cohen and Jamie Callan, Proceedings of the The Fifth ACM International Conference on Web Search and Data Mining, WSDM 2012 (Acceptance rate: 20.7%)
    [PDF], [Slides], [Poster], [Datasets and Evaluation : link]
  10. Entity List Completion Using Set Expansion Techniques, Bhavana Dalvi, Jamie Callan and William Cohen, Proceedings of the The Nineteenth Text REtrieval Conference, TREC 2010 [PDF]

Publications from Workshops/Challenges

  1. IKE - An Interactive Tool for Knowledge Extraction, Bhavana Dalvi, Sumithra Bhakthavatsalam, Chris Clark, Peter Clark, Oren Etzioni, Anthony Fader, and Dirk Groeneveld, in Proceedings of AKBC 2016, 5th Knowledge Extraction workshop at NAACL 2016.
    [PDF], [Project page] Demo and code coming up soon...
  2. Multi-view Exploratory Learning for AKBC Problems, Bhavana Dalvi Mishra and William W. Cohen, in Proceedings of AKBC 2014, 4th Knowledge Extraction workshop at NIPS 2014.
    [PDF], [Poster]
  3. A Tale of Two Entity Linking and Discovery Systems in KBP-TAC 2014, Kathryn Mazaitis, Richard C. Wang, Frank Lin, Bhavana Dalvi, Jakob Bauer, William W. Cohen, in Proceedings of, Knowledge Base Population (KBP) workshop, KBP-TAC 2014.
  4. A Language Modeling Approach to Entity Recognition and Disambiguation for Search Queries, Bhavana Dalvi Mishra, Chenyan Xiong, and Jamie Callan, in Proceedings of ERD 2014, Entity Recognition and Disambiguation Challenge at SIGIR 2014. [PDF]
  5. Classifying Entities into an Incomplete Ontology , Bhavana Dalvi Mishra, William W. Cohen and Jamie Callan, in Proceedings of AKBC 2013, 3rd Knowledge Extraction workshop at CIKM 2013.
    [PDF], [Slides], [Poster]
  6. Collectively Representing Semi-Structured Data from the Web , Bhavana Dalvi, William W. Cohen and Jamie Callan, Proceedings of the NAACL HLT 2012 Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction AKBC-WEKEX 2012
    [PDF], [Slides], [Poster], (Best paper runner-up)
  7. Structure, Tie Persistence and Event Detection in Large Phone and SMS Networks, Leman Akoglu and Bhavana Dalvi, Proceedings of the Eighth Workshop on Mining and Learning with Graphs, KDD 2010 [PDF]

