Sanket Vaibhav Mehta [SVM]

I'm a PhD candidate at the Language Technologies Institute (LTI) at School of Computer Science, Carnegie Mellon University and I'm advised by Emma Strubell.

Previously, I obtained a Master's degree (2019) from the LTI where I was advised by Jaime Carbonell and Barnabás Póczos.

Before joining CMU, I worked as a member of the research staff at Big Data Research Lab, Adobe Research (2015-17) where I worked on designing algorithms for identifying data-driven geo-fences to assist Adobe’s digital marketing offering.

I graduated from Indian Institute of Technology Roorkee with a B.Tech in Computer Science (2011-15) and a President's Gold Medal.

Email  /  CV  /  Google Scholar  /  X  /  GitHub

profile photo

I'm interested in machine learning, natural language processing and optimization with a specific focus on learning from limited labeled data, multiple tasks, non-stationary data distributions (Lifelong Learning, Transfer Learning, Meta-Learning, Multi-Task Learning).

Latest News
clean-usnob An Empirical Investigation of the Role of Pre-training in Lifelong Learning
Sanket Vaibhav Mehta, Darshan Patil, Sarath Chandar, Emma Strubell
The Journal of Machine Learning Research, 2023
bibtex / code
clean-usnob DSI++: Updating Transformer Memory with New Documents
Sanket Vaibhav Mehta, Jai Gupta, Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Jinfeng Rao, Marc Najork, Emma Strubell, Donald Metzler
arXiv, 2022
bibtex / tweet
clean-usnob Train Flat, Then Compress: Sharpness-Aware Minimization Learns More Compressible Models
Clara Na, Sanket Vaibhav Mehta, Emma Strubell
EMNLP Findings, 2022
bibtex / code / tweet
clean-usnob An Introduction to Lifelong Supervised Learning
Shagun Sodhani, Mojtaba Faramarzi, Sanket Vaibhav Mehta, Pranshu Malviya, Mohamed Abdelsalam, Janarthanan Janarthanan, Sarath Chandar
arXiv, 2022
bibtex / tweet
clean-usnob Improving Compositional Generalization with Self-Training for Data-to-Text Generation
Sanket Vaibhav Mehta, Jinfeng Rao, Yi Tay, Mihir Kale, Ankur Parikh, Emma Strubell
ACL, 2022
bibtex / code / poster
clean-usnob ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
Vamsi Aribandi, Yi Tay, Tal Schuster, Jinfeng Rao, Huaixiu Steven Zheng, Sanket Vaibhav Mehta, Honglei Zhuang, Vinh Q. Tran, Dara Bahri, Jianmo Ni, Jai Gupta, Kai Hui, Sebastian Ruder, Donald Metzler
ICLR, 2022
bibtex / press / tweet
clean-usnob Efficient Meta Lifelong-Learning with Limited Memory
Sanket Vaibhav Mehta*, Zirui Wang*, Barnabás Póczos, Jaime Carbonell
EMNLP, 2020
clean-usnob Learning Rhyming Constraints using Structured Adversaries
Harsh Jhamtani, Sanket Vaibhav Mehta, Jaime Carbonell, Taylor Berg-Kirkpatrick
EMNLP, 2019
bibtex / code / poster
clean-usnob Gradient-Based Inference for Networks with Output Constraints
Jay-Yoon Lee, Sanket Vaibhav Mehta, Michael Wick, Jean-Baptiste Tristan, Jaime Carbonell
AAAI, 2019
bibtex / code
clean-usnob Towards Semi-Supervised Learning for Deep Semantic Role Labeling
Sanket Vaibhav Mehta*, Jay-Yoon Lee*, Jaime Carbonell
EMNLP, 2018
bibtex / code / poster
clean-usnob An LSTM Based System for Prediction of Human Activities with Durations
Kundan Krishna, Deepali Jain, Sanket Vaibhav Mehta, Sunav Choudhary
IMWUT, 2017
clean-usnob Preventing Inadvertent Information Disclosures via Automatic Security Policies
Tanya Goyal, Sanket Vaibhav Mehta, Balaji Vasan Srinivasan
PAKDD, 2017
Issued Patents
1. Generating data-driven geo-fences (US 9,838,843)
2. Propagation of changes in master content to variant content (US 10,102,191)
3. Digital document update (US 10,489,498)
4. Tagging documents with security policies (US 10,783,262)
5. Digital document update using static and transient tags (US 10,846,466)
6. Tenant-side detection, classification, and mitigation of noisy-neighbor-induced performance degradation (US 11,086,646)

Based on Jon Barron's website.