Welcome To My Website
Starting September 2015, I have joined Allen Institute for Artificial Intelligence as a Research Scientist. My new webpage!
I received Ph.D. from Language Technologies Institute, School of Computer Science,
Carnegie Mellon University in August 2015. During Ph.D. (August 2009-15) I was co-advised by Prof. William Cohen and Prof. Jamie Callan, in the area of Information Extraction and Information Retrieval.
I have completed my Masters in 2007 from IIT Bombay under the guidance of Prof. S. Sudarshan. I worked in Google India from 2007-2009 as a Software Engineer.
Research Interests
- Information Extraction
- Machine Learning
- Data Mining
- Information Retrieval
Thesis Topic
Thesis document [PDF]My thesis porposes a general framework ``Exploratory Learning'' that tackles learning scenarios which are superset of semi-supervised learning in the sense that some of the classes might be unknown to the algorithm, and can be discovered from the unlabeled data. More information: link.
Publications
Google Scholar ProfileJournal Papers
- Keyword search on external memory data graphs, Bhavana Bharat Dalvi, Meghana Kshirsagar and S. Sudarshan, Proceedings of the VLDB Endowment, VLDB 2008 (Acceptance rate: 16%) [PDF]
Conference Papers
- Hierarchical Semi-supervised Classification with Incomplete Class Hierarchies, Bhavana Dalvi Mishra, Aditya Mishra and William W. Cohen, WSDM 2016 (Acceptance rate: 18.2%) [PDF],
[Slides],
[Poster],
[Dataset:link ]
- Automatic Gloss Finding for a Knowledge Base using Ontological Constraints, Bhavana Dalvi Mishra, Einat Minkov, Partha Pratim Talukdar, and William W. Cohen, WSDM 2015 (Acceptance rate: 16.8%)
[PDF],
[Code: zipped folder],
[Slides](presented in LTI SRS 2014),
[Dataset:link ]
- Never-Ending Learning, Tom Mitchell, William Cohen, Estevam Hruschka, Partha Talukdar, Justin Betteridge, Andrew Carlson, Bhavana Dalvi, Matt Gardner, Bryan Kisiel, Jayant Krishnmurthy, Ni Lao, Kathryn Mazaitis, Tahir Mohammad, Ndapa Nakashole, Emmanouil Antonios Platanios, Alan Ritter, Mehdi Samadi, Burr Settles, Richard Wang, Derry Wijaya, Abhinav Gupta, Xinlei Chen, Abulhair Saparov, Malcolm Greaves and Joel Welling, AAAI 2015
[PDF]
- Integrating Energy Storage in Electricity Distribution Networks., Aditya Mishra, Ramesh Sitaraman, David Irwin, Ting Zhu, Prashant Shenoy, Bhavana Dalvi Mishra, and Stephen Lee, Proceedings of the 6th ACM Intl. Conference on Future Energy Systems (ACM e-Energy) 2015 [PDF]
- Multi-View Hierarchical Semi-supervised Learning by Optimal Assignment of Sets of Labels to Instances, Bhavana Dalvi Mishra, and William W. Cohen, In preparation.
[Draft],
[Dataset:link]
- Exploratory Learning , Bhavana Dalvi Mishra, William W. Cohen and Jamie Callan, in Proceedings of European Conference on “Machine Learning” ECML/PKDD 2013
[PDF],
[Slides],
[Poster],
[Code: link]
- From Topic Models to Semi-Supervised Learning: Biasing Mixed-membership Models to Exploit Topic-Indicative Features in Entity Clustering , Ramnath Balasubramanyan, Bhavana Dalvi Mishra and William W. Cohen, in Proceedings of European Conference on “Machine Learning” ECML/PKDD 2013 [PDF]
- Very Fast Similarity Queries on Semi-Structured Data from the Web , Bhavana Dalvi and William W. Cohen, Proceedings of the SIAM International Conference on Data Mining SDM 2013
[PDF],
[Poster]
- WebSets: Extracting Sets of Entities from the Web Using
Unsupervised Information Extraction, Bhavana Dalvi, William W. Cohen and Jamie Callan, Proceedings of the The Fifth ACM International Conference on Web Search and Data Mining, WSDM 2012 (Acceptance rate: 20.7%)
[PDF],
[Slides],
[Poster],
[Datasets and Evaluation : link]
- Entity List Completion Using Set Expansion Techniques, Bhavana Dalvi, Jamie Callan and William Cohen, Proceedings of the The Nineteenth Text REtrieval Conference, TREC 2010 [PDF]
Publications from Workshops/Challenges
- IKE - An Interactive Tool for Knowledge Extraction, Bhavana Dalvi, Sumithra Bhakthavatsalam, Chris Clark, Peter Clark, Oren Etzioni, Anthony Fader, and Dirk Groeneveld, in Proceedings of AKBC 2016, 5th Knowledge Extraction workshop at NAACL 2016.
[PDF],
[Project page] Demo and code coming up soon...
- Multi-view Exploratory Learning for AKBC Problems, Bhavana Dalvi Mishra and William W. Cohen, in Proceedings of AKBC 2014, 4th Knowledge Extraction workshop at NIPS 2014.
[PDF],
[Poster]
- A Tale of Two Entity Linking and Discovery Systems in KBP-TAC 2014, Kathryn Mazaitis, Richard C. Wang, Frank Lin, Bhavana Dalvi, Jakob Bauer, William W. Cohen, in Proceedings of, Knowledge Base Population (KBP) workshop, KBP-TAC 2014.
[PDF],
- A Language Modeling Approach to Entity Recognition and Disambiguation for Search Queries, Bhavana Dalvi Mishra, Chenyan Xiong, and Jamie Callan, in Proceedings of ERD 2014, Entity Recognition and Disambiguation Challenge at SIGIR 2014. [PDF]
- Classifying Entities into an Incomplete Ontology , Bhavana Dalvi Mishra, William W. Cohen and Jamie Callan, in Proceedings of AKBC 2013, 3rd Knowledge Extraction workshop at CIKM 2013.
[PDF],
[Slides],
[Poster]
- Collectively Representing Semi-Structured Data from the Web , Bhavana Dalvi, William W. Cohen and Jamie Callan, Proceedings of the NAACL HLT 2012 Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction AKBC-WEKEX 2012
[PDF],
[Slides],
[Poster],
(Best paper runner-up)
- Structure, Tie Persistence and Event Detection in Large Phone and SMS Networks, Leman Akoglu and Bhavana Dalvi, Proceedings of the Eighth Workshop on Mining and Learning with Graphs, KDD 2010 [PDF]
Recent Activities
- Apr 2016: A workshop paper accepted in Automated Knowledge Base Construction Workshop AKBC 2016, at NAACL 2016.
- Mar 2016: Program committee member for IJCAI 2016 25th International Joint Conference on Artificial Intelligence IJCAI-16.
- Jan 2016: Gave a research talk at UWash CSE Women's Research Day link
- Oct 2015: A Full paper accepted in WSDM 2016!!
- Oct 2015: Reviewing for TKDE Journal, IEEE Transactions on Knowledge and Data Engineering.
- Jan 2015: Program committee member for WebDB 2015, at SIGMOD 2015.
- Nov 2014: A workshop paper accepted in Automated Knowledge Base Construction Workshop AKBC 2014, at NIPS 2014.
- October 2014: A full paper accepted in WSDM 2015!!
- October 2014: Gave a guest lecture in CS-601 Machine Learning class at CMU. Slides in [PPT], [PDF]
- October 2014: Program committee member for AKBC 2014 4th Workshop on Automated Knowledge Base Construction at NIPS 2014
- August 2014: Awarded Honorable mention for presentation at the 2014 LTI Student Research Symposium (SRS 2014) held in CMU on 21st Aug'14.
- June 2014: A workshop paper published in Entity Recognition and Disambiguation Challenge ERD 2014, at SIGIR 2014.
- Mar 2014: Program committee member for EMNLP 2014 (Information Extraction track)
- Dec 2013: Presented Thesis proposal. Committee members: Prof. William Cohen(CMU), Prof. Jamie Callan(CMU), Prof. Tom Mitchell(CMU), and Dr. Alon Halevy(Google Research).
- August 2013: Awarded Honorable mention for presentation at the 2013 LTI Student Research Symposium (SRS 2013) held in CMU on 21st Aug'13.
- June 2013: Received 2013 Google U.S./Canada Fellowship in Information Extraction : (List of recipients)
- April 2013: Student travel award for SDM 2013
- Jan 2013: Got married to an awesome person Aditya Mishra :)
- Summer 2012: Intern at Google research, Mountain View. Host : Anish Das Sarma, Team lead by : Alon Halevy
- Program committee member for EMNLP 2012, NAACL HLT 2013, AKBC 2013, AKBC 2014, EMNLP 2014, WebDB 2015, IJCAI 2016, AKBC 2016
- Reviewer for Journal of Internet and Information Systems 2011, WWW Journal 2011, ECML/PKDD 2013 Journal track, WSDM 2014 (secondary reviewer), VLDB 2014 (secondary reviewer), TKDE Journal 2015 (IEEE Transactions on Knowledge and Data Engineering).
- Fall 2012, Spring 2011: Teaching assistant for Analysis of Social Media 10-802
Contact
Email: bhavana DOT dalvi AT gmail DOT com
Resume
[PDF] [Last updated: Spring 2015]
Hobbies
- Listening to music
- Reading motivational books
- Watching animation movies
- Hierarchical Semi-supervised Classification with Incomplete Class Hierarchies, Bhavana Dalvi Mishra, Aditya Mishra and William W. Cohen, WSDM 2016 (Acceptance rate: 18.2%) [PDF], [Slides], [Poster], [Dataset:link ]
- Automatic Gloss Finding for a Knowledge Base using Ontological Constraints, Bhavana Dalvi Mishra, Einat Minkov, Partha Pratim Talukdar, and William W. Cohen, WSDM 2015 (Acceptance rate: 16.8%)
[PDF], [Code: zipped folder], [Slides](presented in LTI SRS 2014), [Dataset:link ] - Never-Ending Learning, Tom Mitchell, William Cohen, Estevam Hruschka, Partha Talukdar, Justin Betteridge, Andrew Carlson, Bhavana Dalvi, Matt Gardner, Bryan Kisiel, Jayant Krishnmurthy, Ni Lao, Kathryn Mazaitis, Tahir Mohammad, Ndapa Nakashole, Emmanouil Antonios Platanios, Alan Ritter, Mehdi Samadi, Burr Settles, Richard Wang, Derry Wijaya, Abhinav Gupta, Xinlei Chen, Abulhair Saparov, Malcolm Greaves and Joel Welling, AAAI 2015
[PDF] - Integrating Energy Storage in Electricity Distribution Networks., Aditya Mishra, Ramesh Sitaraman, David Irwin, Ting Zhu, Prashant Shenoy, Bhavana Dalvi Mishra, and Stephen Lee, Proceedings of the 6th ACM Intl. Conference on Future Energy Systems (ACM e-Energy) 2015 [PDF]
- Multi-View Hierarchical Semi-supervised Learning by Optimal Assignment of Sets of Labels to Instances, Bhavana Dalvi Mishra, and William W. Cohen, In preparation.
[Draft], [Dataset:link] - Exploratory Learning , Bhavana Dalvi Mishra, William W. Cohen and Jamie Callan, in Proceedings of European Conference on “Machine Learning” ECML/PKDD 2013
[PDF], [Slides], [Poster], [Code: link] - From Topic Models to Semi-Supervised Learning: Biasing Mixed-membership Models to Exploit Topic-Indicative Features in Entity Clustering , Ramnath Balasubramanyan, Bhavana Dalvi Mishra and William W. Cohen, in Proceedings of European Conference on “Machine Learning” ECML/PKDD 2013 [PDF]
- Very Fast Similarity Queries on Semi-Structured Data from the Web , Bhavana Dalvi and William W. Cohen, Proceedings of the SIAM International Conference on Data Mining SDM 2013
[PDF], [Poster] - WebSets: Extracting Sets of Entities from the Web Using
Unsupervised Information Extraction, Bhavana Dalvi, William W. Cohen and Jamie Callan, Proceedings of the The Fifth ACM International Conference on Web Search and Data Mining, WSDM 2012 (Acceptance rate: 20.7%)
[PDF], [Slides], [Poster], [Datasets and Evaluation : link] - Entity List Completion Using Set Expansion Techniques, Bhavana Dalvi, Jamie Callan and William Cohen, Proceedings of the The Nineteenth Text REtrieval Conference, TREC 2010 [PDF]
Publications from Workshops/Challenges
- IKE - An Interactive Tool for Knowledge Extraction, Bhavana Dalvi, Sumithra Bhakthavatsalam, Chris Clark, Peter Clark, Oren Etzioni, Anthony Fader, and Dirk Groeneveld, in Proceedings of AKBC 2016, 5th Knowledge Extraction workshop at NAACL 2016.
[PDF],
[Project page] Demo and code coming up soon...
- Multi-view Exploratory Learning for AKBC Problems, Bhavana Dalvi Mishra and William W. Cohen, in Proceedings of AKBC 2014, 4th Knowledge Extraction workshop at NIPS 2014.
[PDF],
[Poster]
- A Tale of Two Entity Linking and Discovery Systems in KBP-TAC 2014, Kathryn Mazaitis, Richard C. Wang, Frank Lin, Bhavana Dalvi, Jakob Bauer, William W. Cohen, in Proceedings of, Knowledge Base Population (KBP) workshop, KBP-TAC 2014.
[PDF],
- A Language Modeling Approach to Entity Recognition and Disambiguation for Search Queries, Bhavana Dalvi Mishra, Chenyan Xiong, and Jamie Callan, in Proceedings of ERD 2014, Entity Recognition and Disambiguation Challenge at SIGIR 2014. [PDF]
- Classifying Entities into an Incomplete Ontology , Bhavana Dalvi Mishra, William W. Cohen and Jamie Callan, in Proceedings of AKBC 2013, 3rd Knowledge Extraction workshop at CIKM 2013.
[PDF],
[Slides],
[Poster]
- Collectively Representing Semi-Structured Data from the Web , Bhavana Dalvi, William W. Cohen and Jamie Callan, Proceedings of the NAACL HLT 2012 Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction AKBC-WEKEX 2012
[PDF],
[Slides],
[Poster],
(Best paper runner-up)
- Structure, Tie Persistence and Event Detection in Large Phone and SMS Networks, Leman Akoglu and Bhavana Dalvi, Proceedings of the Eighth Workshop on Mining and Learning with Graphs, KDD 2010 [PDF]
Recent Activities
- Apr 2016: A workshop paper accepted in Automated Knowledge Base Construction Workshop AKBC 2016, at NAACL 2016.
- Mar 2016: Program committee member for IJCAI 2016 25th International Joint Conference on Artificial Intelligence IJCAI-16.
- Jan 2016: Gave a research talk at UWash CSE Women's Research Day link
- Oct 2015: A Full paper accepted in WSDM 2016!!
- Oct 2015: Reviewing for TKDE Journal, IEEE Transactions on Knowledge and Data Engineering.
- Jan 2015: Program committee member for WebDB 2015, at SIGMOD 2015.
- Nov 2014: A workshop paper accepted in Automated Knowledge Base Construction Workshop AKBC 2014, at NIPS 2014.
- October 2014: A full paper accepted in WSDM 2015!!
- October 2014: Gave a guest lecture in CS-601 Machine Learning class at CMU. Slides in [PPT], [PDF]
- October 2014: Program committee member for AKBC 2014 4th Workshop on Automated Knowledge Base Construction at NIPS 2014
- August 2014: Awarded Honorable mention for presentation at the 2014 LTI Student Research Symposium (SRS 2014) held in CMU on 21st Aug'14.
- June 2014: A workshop paper published in Entity Recognition and Disambiguation Challenge ERD 2014, at SIGIR 2014.
- Mar 2014: Program committee member for EMNLP 2014 (Information Extraction track)
- Dec 2013: Presented Thesis proposal. Committee members: Prof. William Cohen(CMU), Prof. Jamie Callan(CMU), Prof. Tom Mitchell(CMU), and Dr. Alon Halevy(Google Research).
- August 2013: Awarded Honorable mention for presentation at the 2013 LTI Student Research Symposium (SRS 2013) held in CMU on 21st Aug'13.
- June 2013: Received 2013 Google U.S./Canada Fellowship in Information Extraction : (List of recipients)
- April 2013: Student travel award for SDM 2013
- Jan 2013: Got married to an awesome person Aditya Mishra :)
- Summer 2012: Intern at Google research, Mountain View. Host : Anish Das Sarma, Team lead by : Alon Halevy
- Program committee member for EMNLP 2012, NAACL HLT 2013, AKBC 2013, AKBC 2014, EMNLP 2014, WebDB 2015, IJCAI 2016, AKBC 2016
- Reviewer for Journal of Internet and Information Systems 2011, WWW Journal 2011, ECML/PKDD 2013 Journal track, WSDM 2014 (secondary reviewer), VLDB 2014 (secondary reviewer), TKDE Journal 2015 (IEEE Transactions on Knowledge and Data Engineering).
- Fall 2012, Spring 2011: Teaching assistant for Analysis of Social Media 10-802
Contact
Email: bhavana DOT dalvi AT gmail DOT com
Resume
[PDF] [Last updated: Spring 2015]
Hobbies
- Listening to music
- Reading motivational books
- Watching animation movies
[PDF], [Project page] Demo and code coming up soon...
[PDF], [Poster]
[PDF],
[PDF], [Slides], [Poster]
[PDF], [Slides], [Poster], (Best paper runner-up)