
Abhinav Gupta


  • Chen Sun, Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta. Revisiting Unreasonable Effectiveness of Data in Deep Learning Era. ICCV 2017. (Spotlight)
  • Xiaolong Wang, Kaiming He, Abhinav Gupta. Transitive Invariance for Self-Supervised Visual Representation Learning. ICCV 2017.
  • Xinlei Chen, Abhinav Gupta. Spatial Memory for Context Reasoning in Object Detection. ICCV 2017.
  • Gunnar A. Sigurdsson, Olga Russakovsky, Abhinav Gupta. What Actions Are Needed for Understanding Human Actions in Videos? ICCV 2017. Jacob Walker, Kenneth Marino, Abhinav Gupta, Martial Hebert. The Pose Knows: Video Forecasting by Generating Pose Futures. ICCV 2017.
  • Yuan Yuan, Xiaodan Liang, Xiaolong Wang, Dit-Yan Yeung, Abhinav Gupta. Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection. ICCV 2017
  • Yuke Zhu*, Daniel Gordon*, Eric Kolve, Dieter Fox, Li Fei-Fei, Abhinav Gupta, Roozbeh Mottaghi, Ali Farhadi. Visual Semantic Planning using Deep Successor Representations. ICCV 2017
  • Dhiraj Gandhi, Lerrel Pinto, Abhinav Gupta. Learning to fly by crashing. IROS 2017.
  • Lerrel Pinto, James Davidson, Rahul Sukthankar, Abhinav Gupta. Robust Adversarial Reinforcement Learning. ICML 2017.
  • Ishan Misra, Abhinav Gupta, Martial Hebert. From Red Wine to Red Tomato: Composition With Context. CVPR 2017. (Oral Presentation)
  • Kenneth Marino, Ruslan Salakhutdinov, Abhinav Gupta. The More You Know: Using Knowledge Graphs for Image Classification. CVPR 2017. (Spotlight)
  • Xiaolong Wang, Rohit Girdhar, Abhinav Gupta. Binge Watching: Scaling Affordance Learning From Sitcoms. CVPR 2017. (Spotlight)
  • Siddha Ganju, Olga Russakovsky, Abhinav Gupta. Whats in a Question: Using Visual Questions as a Form of Supervision. CVPR 2017 (Spotlight).
  • Andreas Veit, Neil Alldrin, Gal Chechik, Ivan Krasin, Abhinav Gupta, Serge Belongie. Learning From Noisy Large-Scale Datasets With Minimal Supervision. CVPR 2017 (Spotlight).
  • Xiaolong Wang, Abhinav Shrivastava, Abhinav Gupta. A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection. CVPR 2017.
  • Gunnar A. Sigurdsson, Santosh Divvala, Ali Farhadi, Abhinav Gupta. Asynchronous Temporal Fields for Action Recognition. CVPR 2017
  • Rohit Girdhar, Deva Ramanan, Abhinav Gupta, Josef Sivic, Bryan Russell. Action- VLAD: Learning Spatio-Temporal Aggregation for Action Classification. CVPR 2017.
  • Lerrel Pinto, James Davidson and Abhinav Gupta. Supervision via Competition: Robot Adversaries for Learning Tasks. ICRA 2017
  • Lerrel Pinto and Abhinav Gupta. Learning to Push by Grasping: Using multiple tasks for e↵ective learning. ICRA 2017
  • Yuke Zhu, Roozbeh Mottaghi, Eric Kolve, Joseph J. Lim, Abhinav Gupta, Li Fei-Fei, Ali Farhadi. Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning. ICRA 2017


  • Lerrel Pinto, Dhiraj Gandhi, Yuanfeng Han, Yong-Lae Park and Abhinav Gupta. The Curious Robot: Learning Visual Representations via Physical Interactions. ECCV 2016. (Spotlight)
  • R. Girdhar, D. Fouhey, M. Rodriguez, A. Gupta. Learning a Predictable and Generative Vector Representation for Objects. ECCV 2016. (Spotlight)
  • Xiaolong Wang and Abhinav Gupta. Generative Image Modeling using Style and Struc- ture Adversarial Networks. ECCV 2016.
  • Abhinav Shrivastava, Abhinav Gupta. Contextual Priming and Feedback for Faster R-CNN. ECCV 2016.
  • Gunnar A. Sigurdsson, Gl Varol, Xiaolong Wang, Ivan Laptev, Ali Farhadi, Abhinav Gupta. Hollywood in Homes: Crowdsourcing Data Collection for Activity Understand- ing. ECCV 2016.
  • Roozbeh Mottaghi, Mohammad Rastegari, Abhinav Gupta, Ali Farhadi. What happens if... Learning to Predict the Effect of Forces in Images. ECCV 2016.
  • Gunnar A. Sigurdsson, Xinlei Chen, Abhinav Gupta, Learning Visual Storylines with Skipping Recurrent Neural Networks, ECCV 2016.
  • Lerrel Pinto and Abhinav Gupta. Supersizing Self-supervision: Learning to Grasp from 50K Tries and 700 Robot Hours. In ICRA 2016. arXiv:1509.06825. (Best Student Paper Award).
  • Abhinav Shrivastava, Abhinav Gupta, Ross Girshick. Training Region-based Object Detectors with Online Hard Example Mining. In CVPR 2016. (Oral Presentation)
  • D. Fouhey, A. Gupta, A. Zisserman. 3D Shape Attributes. In CVPR 2016. (Oral Presentation).
  • Xiaolong Wang and Ali Farhadi and Abhinav Gupta. Actions ̃ Transformations. In CVPR 2016.
  • Ishan Misra, Abhinav Shrivastava, Abhinav Gupta and Martial Hebert. Cross-stitch Networks for Multi-Task Learning. In CVPR 2016.
  • Aayush Bansal, Bryan Russell, Abhinav Gupta. Marr Revisited: 2D-3D Alignment via Surface Normal Prediction. In CVPR 2016.


  • Carl Doersch, Abhinav Gupta, and Alexei A. Efros. Unsupervised Visual Representation Learning by Context Prediction. In IEEE International Conference on Computer Vision (ICCV), 2015. (Oral Presentation)
  • Xiaolong Wang and Abhinav Gupta. Unsupervised Learning of Visual Representations using Videos. In IEEE International Conference on Computer Vision (ICCV), 2015.
  • Xinlei Chen, Abhinav Gupta. Learning of Convolutional Networks using Web Data. In IEEE International Conference on Computer Vision (ICCV), 2015 (Oral Presentation)
  • David Fouhey, Abhinav Gupta, and Martial Hebert. Single Image 3D Without a Single 3D Image. In IEEE International Conference on Computer Vision (ICCV), 2015.
  • Jacob Walker, Abhinav Gupta, and Martial Hebert. Dense Optical Flow Prediction from a Static Image. In IEEE International Conference on Computer Vision (ICCV), 2015.
  • Xiaolong Wang, David F. Fouhey, and Abhinav Gupta. Designing Deep Networks for Surface Normal Estimation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
  • Xinlei Chen, Alan Ritter, Abhinav Gupta, Tom Mitchell. Sense Discovery via Co- Clustering on Images and Text. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
  • T. Mitchell, W. Cohen, E. Hruschka, P. Talukdar, J. Betteridge, A. Carlson, B. Dalvi, M. Gardner, B. Kisiel, J. Krishnamurthy, N. Lao, K. Mazaitis, T. Mohamed, N. Nakashole, E. Platanios, A. Ritter, M. Samadi, B. Settles, R. Wang, D. Wijaya, A. Gupta, X. Chen, A. Saparov, M. Greaves, J. Welling. In Proceedings of the Conference on Artificial Intelligence (AAAI), 2015.
  • EM Aminoff, M Toneva, A Shrivastava, X Chen, I Misra, A Gupta, MJ Tarr, Applying artificial vision models to human scene understanding, In Frontiers in Computational Neuroscience, 9 (8), 2015.


  • David Fouhey, Abhinav Gupta and Martial Hebert. Unfolding an Indoor Origami World. In European Conference on Computer Vision (ECCV), 2014. (Oral Presentation)
  • Carl Doersch, Abhinav Gupta and Alexei Efros. Context as Supervisory Signal: Discovering Objects with Predictable Context. In European Conference on Computer Vision (ECCV), 2014.
  • Jacob Walker, Abhinav Gupta and Martial Hebert. Patch to the Future: Unsupervised Visual Prediction. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014. (Oral Presentation)
  • Xinlei Chen, Abhinav Shrivastava and Abhinav Gupta. Enriching Visual Knowledge Bases via Object Discovery and Segmentation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014.


  • Xinlei Chen, Abhinav Shrivastava, Abhinav Gupta. NEIL: Extracting Visual Knowledge from Web Data. In IEEE International Conference on Computer Vision (ICCV), 2013. (Oral Presentation)
  • Abhinav Shrivastava, Abhinav Gupta. Building Parts-based Object Detectors via 3D Geometry. In IEEE International Conference on Computer Vision (ICCV), 2013.
  • David Fouhey, Abhinav Gupta, Martial Hebert. Data-Driven 3D Primitives for Single Image Understanding. In IEEE International Conference on Computer Vision (ICCV), 2013.
  • Carl Doersch, Abhinav Gupta, Alexei Efros. Mid-Level Visual Element Discovery as Discriminative Mode Seeking. In Neural Information Processing Systems (NIPS), 2013.
  • Arpit Jain, Abhinav Gupta, Mikel Rodriguez, Larry S. Davis. Representing Videos using Mid-level Discriminative Patches. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013.


  • David Fouhey, Vincent Delaitre, Abhinav Gupta, Alexei A. Efros, Ivan Laptev, Josef Sivic. People Watching: Human Actions as a Cue for Single View Geometry. In ECCV 2012. (Oral) (PDF)
  • Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta.Constrained Semi-Supervised Learning Using Attributes and Comparative Attributes. In ECCV 2012.(Oral) (PDF)
  • Saurabh Singh, Abhinav Gupta, Alexei A. Efros. Unsupervised Discovery of Mid-Level Discriminative Patches. In ECCV 2012. (PDF)
  • Vincent Delaitre, David Fouhey, Ivan Laptev, Josef Sivic Abhinav Gupta, Alexei A. Efros. Scene Semantics from Long-term Observation of People. In ECCV 2012. (PDF)
  • Carl Doersch, Saurabh Singh, Abhinav Gupta, Josef Sivic, Alexei A. Efros. What makes Paris look like Paris? In SIGGRAPH 2012.(PDF)


  • Abhinav Shrivastava, Tomasz Malisiewicz, Abhinav Gupta, Alexei A. Efros, Data-driven Visual Similarity for Cross-domain Image Matching, In SIGGRAPH Asia 2011 (PDF)
  • Tomasz Malisiewicz, Abhinav Gupta, Alexei A. Efros, Ensemble of Exemplar-SVMs for Object Detection and Beyond, In ICCV 2011.(PDF)
  • Abhinav Gupta, Scott Satkin, Alexei A. Efros and Martial Hebert, From 3D Scene Geometry to Human Workspace. In CVPR 2011. (Oral) (PDF)
  • Xi Chen, Arpit Jain, Abhinav Gupta and Larry S Davis, Piecing Together the Segmentation Jigsaw using Context, In CVPR 2011. (PDF)


  • David C. Lee, Abhinav Gupta, Martial Hebert, and Takeo Kanade, Estimating Spatial Layout of Rooms using Volumetric Reasoning about Objects and Surfaces, In NIPS 2010. (PDF)
  • Abhinav Gupta, Alexei Efros and Martial Hebert, Blocks World Revisited: Image Understanding using Qualitative Geometry and Mechanics, In ECCV 2010. (Oral) (PDF) Best Paper Runner Up Award
  • Arpit Jain, Abhinav Gupta and Larry S. Davis, Learning What and How of Contextual Models for Scene Labeling, In ECCV 2010. (PDF)
  • Behjat Siddiquie and Abhinav Gupta, Beyond Active Noun Tagging: Modeling Contextual Interactions for Multi-Class Active Learning, In CVPR 2010. (Oral) (PDF)


  • Abhinav Gupta, Praveen Srinivasan, Jianbo Shi and Larry S. Davis, Understanding Videos, Constructing Plots: Learning a Visually Grounded Storyline Model from Annotated Videos, In CVPR 2009. (Oral) (PDF)
  • Abhinav Gupta, Aniruddha Kembhavi and Larry S. Davis, Observing Human-Object Interactions: Using Spatial and Functional Compatibility for Recognition. To Appear in IEEE Transactions on Pattern Analysis and Machine Intelligence (Special Issue on Probabilistic Graphical Models) (PDF)


  • Abhinav Gupta, Jianbo Shi and Larry S. Davis, A "Shape Aware" Model for semi-supervised Learning of Objects and its Context, In NIPS 2008 (Spotlight Poster) (PDF)
  • Abhinav Gupta and Larry S. Davis, Beyond nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers, Accepted in ECCV 2008 (Oral) (PDF)
  • Abhinav Gupta, Anurag Mittal and Larry S. Davis, Constraint Integration for Efficient Multiview Pose Estimation with Self-Occlusions, IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(3), March 2008, pp. 493-506.  (PDF)
  • Abhinav Gupta, Trista Chen, Francine Chen, Don Kimber and Larry S. Davis, Context and Observation Driven Latent Variable Model for Human Pose Estimation, In CVPR 2008. (PDF)


  • Abhinav Gupta, Anurag Mittal and Larry S. Davis, COST*: An Approach for Camera Selection and Multi-Object Inference Ordering in Dynamic Scenes, In ICCV 2007 (PDF)
  • Abhinav Gupta and Larry S. Davis, Objects in Action:An Approach for Combining Action Understanding and Object Perception, In CVPR 2007 (PDF)
  • Abhinav Gupta, Anurag Mittal and Larry S. Davis, Constraint Integration for Multiview Pose Estimation of Humans with Self-Occlusions, 3DPVT 2006 (PDF)
  • Abhinav Gupta, V. Shiv Naga Prasad and Larry S. Davis, Extracting Regions of Symmetry, 133-136, ICIP 2005, Genova, Italy (PDF)
  • Ashutosh Saxena, Abhinav Gupta and Amitabha Mukerjee, Non-Linear Dimensionality Reduction by Locally Linear Isomaps, ICONIP 2004, Published in Lecture Notes in Computer Science (PDF)