Abhinav Gupta
Abhinav Gupta
I am a post-doctoral fellow at Robotics Institute in School of Computer Science at Carnegie Mellon University. I am currently working with Alyosha Efros and Martial Hebert. Before coming to Pittsburgh, I spent wonderful years at University of Maryland where I got my Phd under the supervision of Larry Davis. I also spent four months as a visiting scholar in the GRASP lab (UPenn) working with Jianbo Shi.




About Me
Postdoctoral Fellow
Robotics Institute
Carnegie Mellon University
Office: NSH 4207
Phone: (412) 268-1234
Email: abhinavg [at] cs
Research Interests
Selected Projects
Understanding Videos, Constructing Plots - Learning a Visually Grounded Storyline Model from Annotated Videos
Abhinav Gupta, Praveen Srinivasan, Jianbo Shi and Larry S. Davis
In CVPR 2009 (Oral) (PPT)
Featured in an IEEE Spectrum and Discovery article. Also covered in Ethiopian Review and SiliconIndia.
My research interest include:
Semantic Models: How do we represent link between various concepts ? or How do we represent and store relationships ? I am interested in exploring semantic models for visual inference. These semantic models store relationships between different concepts (nouns and verbs). I have explored storyline models which represents possible storylines for given video(CVPR 2009). Using this model, instead of inferring only concepts we simultaneously infer concepts and causal links between the concepts. I have also looked in contextual models which are based on language itself (ECCV2008). It has been argued that language is an instrument of thinking and therefore can represent all the relationships required for visual reasoning. In this work, we represented relationships between nouns using prepositions and comparative adjectives.
Exploring synergy between language and vision for learning semantics: What role does language play in helping us learn semantics ? I am interested in exploring how declarative information and other linguistic information can be harnessed to efficiently learn the semantic models. I am also interesting in exploring how we can obtain such linguistic information.
Computer Vision: Activity Analysis, Object Recognition, Motion and Tracking, Camera Networks, Combining different visual cues for effective recognition.
Downloads Available: Dataset
Objects in Action: An Approach for Combining Action Understanding and Object Perception
Abhinav Gupta and Larry S. Davis
In CVPR 2007
Downloads Available: Dataset