Abhinav Gupta

 
 

I am a post-doctoral fellow at Robotics Institute in School of Computer Science at Carnegie Mellon University. I am currently working with Alyosha Efros and Martial Hebert. Before coming to Pittsburgh, I spent wonderful years at University of Maryland where I got my Phd under the supervision of Larry Davis. I also spent four months as a visiting scholar in the GRASP lab (UPenn) working with Jianbo Shi.

 
News and Announcements
Software Watches Baseball and Calls the Plays
CVPR 2009 paper is covered in the IEEE Spectrum and Discovery News (See Links)

Human Object Interaction Dataset is Now Available 
PAMI 2009 paper dataset is now available in downloads sectionhttp://www.spectrum.ieee.org/computing/software/software-watches-baseball-and-calls-the-playshttp://www.spectrum.ieee.org/computing/software/software-watches-baseball-and-calls-the-playshttp://dsc.discovery.com/news/2009/07/10/baseball-software.htmlDownloads.htmlshapeimage_1_link_0shapeimage_1_link_1shapeimage_1_link_2shapeimage_1_link_3

About Me

Postdoctoral Fellow

Robotics Institute
Carnegie Mellon University

Office:  NSH 4207
Phone: (412) 268-1234
Email:   abhinavg [at] cs

 

Research Interests

 

Selected Projects

Understanding Videos, Constructing Plots - Learning a Visually Grounded Storyline Model from Annotated Videos

Abhinav Gupta, Praveen Srinivasan, Jianbo Shi and Larry S. Davis
In CVPR 2009 (Oral) (PPT)


Featured in an IEEE Spectrum and Discovery article. Also covered in Ethiopian Review and SiliconIndia.

 

My research interest include:


  1. Bullet Semantic Models: How do we represent link between various concepts ? or How do we represent and store relationships ? I am interested in exploring semantic models for visual inference. These semantic models store relationships between different concepts (nouns  and verbs). I have explored storyline models which represents possible storylines for given video(CVPR 2009). Using this model, instead of inferring only concepts we simultaneously infer concepts and causal links between the concepts. I have also looked in contextual models which are based on language itself (ECCV2008). It has been argued that language is  an instrument of thinking and therefore can represent all the relationships required for visual reasoning. In this work, we represented relationships between nouns using prepositions and comparative adjectives.


  1. Bullet  Exploring synergy between language and vision for learning semantics: What role does language play in helping us learn semantics ? I am interested in exploring how declarative information and other linguistic information can be harnessed to efficiently learn the semantic models. I am also interesting in exploring how we can obtain such linguistic information.


  1. Bullet Computer Vision: Activity Analysis, Object Recognition, Motion and Tracking, Camera Networks, Combining different visual cues for effective recognition.

Beyond Nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers
Abhinav Gupta and Larry S. Davis
In ECCV 2008 (Oral)



 
Observing Human-Object Interactions: Using Spatial and Functional Compatibility for Recognition
Abhinav Gupta, Aniruddha Kembhavi and Larry S. Davis
In Trans. on PAMI (Special Issue on Probabilistic Graphical Models)

Downloads Available: Dataset

Objects in Action: An Approach for Combining Action Understanding and Object Perception

Abhinav Gupta and Larry S. Davis

In CVPR 2007


Downloads Available: Dataset

 
 

A “Shape Aware” Model for semi-supervised Learning of Objects and its Context
Abhinav Gupta, Jianbo Shi and Larry S. Davis
In NIPS 2008 (Spotlight Poster)