self

Shoou-I Yu, 余守壹

Ph.D. Student
Language Technologies Institute
School of Computer Science
Carnegie Mellon University
Pittsburgh, PA 15213

Office: 5705 Gates-Hillman Complex
Email: iyu at cs dot cmu dot edu


Hi! I am Shoou-I Yu. I have completed the LTI Ph.D. program under the supervision of Dr. Alexander Hauptmann. My research focus is on Multimedia Event Detection and Multi-Object Tracking.
I have worked on the Multimedia Event Detection (MED) task in the TRECVID competition from 2011 to 2014.
I also work on multi-object tracking in multi-camera environments for surveillance scenarios.

Here is my CV.

News

2016/7/18: Joined Oculus Research @ Pittsburgh as a research scientist.

2016/5/13: Defended! My thesis is here. Slides are here.

2015/5/18 - 2015/8/7: Internship at Google Research Machine Perception group. Mentor: Paul Natsev, Balakrishnan Varadarajan.

2015/4/30: Successfully proposed (my thesis).

2015 Spring: A report from the Pittsburgh Supercomputing Center featuring our work.

2014/11/19: Presented our Multimedia Event Detection GPU work at the NVIDIA GPU Technology Theater @ Supercomputing 2014 (SC '14). [Recording]

2014/11/11: Presented our Multimedia Event Detection work at TRECVID 2014. Slides are here.

2014/11/5: Presented our Instructional Video work at ACM Multimedia 2014.

2014/9/10: Presented our Pose Estimation work at ECCV 2014.

2013/12/19: The Marauder's Map paper was elected as the 13 Incredible Tech Inventions You Won't Believe You Missed In 2013 by Huffington Post!

2013/02/25: The Marauder's Map Multi-Camera Multi-Object Tracking paper accepted at CVPR! [Project Page].

2012/12/07: We got 1st place in pre-specifed event detection task for TRECVID MED 2012! Details are here.

Education

Ph.D. in Language and Information Technologies, Carnegie Mellon University, Pittsburgh, PA (2012 ~ 2016)

M.S. in Language Technologies, Carnegie Mellon University, Pittsburgh, PA (2010 ~ 2012)

B.S. in Compute Science and Information Engineering, National Taiwan University, Taipei, Taiwan (2005 ~ 2009)

Publications

  1. The Solution Path Algorithm for Identity-Aware Multi-Object Tracking [pdf][code and data]
    Shoou-I Yu, Deyu Meng, Wangmeng Zuo, Alexander G. Hauptmann
    CVPR 2016.
  2. Strategies for Searching Video Content with Text Queries or Video Examples [pdf]
    Shoou-I Yu, Yi Yang, Zhongwen Xu, Shicheng Xu, Deyu Meng, Zexi Mao, Zhigang Ma, Ming Lin, Xuanchong Li, Huan Li, Zhenzhong Lan, Lu Jiang, Alexander G. Hauptmann, Chuang Gan, Xingzhong Du, Xiaojun Chang
    ITE Transactions on Media Technology and Applications 4.3 (2016): 227-238.
  3. Long-Term Identity-Aware Multi-Person Tracking for Surveillance Video Summarization [pdf]
    Shoou-I Yu, Yi Yang, Xuanchong Li, Alexander G. Hauptmann
    arXiv 1604.07468.
  4. Content-Based Video Search over 1 Million Videos with 1 Core in 1 Second [pdf]
    Shoou-I Yu, Lu Jiang, Zhongwen Xu, Yi Yang, Alexander G. Hauptmann
    ICMR 2015.
  5. Bridging the Ultimate Semantic Gap: A Semantic Search Engine for Internet Videos [pdf]
    Lu Jiang, Shoou-I Yu, Deyu Meng, Teruko Mitamura, Alexander G. Hauptmann
    ICMR 2015.
  6. Fast and Accurate Content-based Semantic Search in 100M Internet Videos. [pdf] [supplementary material] [project page]
    Lu Jiang, Shoou-I Yu, Deyu Meng, Yi Yang, Teruko Mitamura, Alexander G. Hauptmann
    ACM MM 2015.
  7. Informedia@TRECVID 2014 MED and MER [pdf] [slides]
    Shoou-I Yu, Lu Jiang, Zhongwen Xu, Zhenzhong Lan, Shicheng Xu, Xiaojun Chang, Xuanchong Li, Zexi Mao, Chuang Gan, Yajie Miao, Xingzhong Du, Yang Cai, Lara Martin, Nikolas Wolfe, Anurag Kumar, Huan Li, Ming Lin, Zhigang Ma, Yi Yang, Deyu Meng, Shiguang Shan, Pinar Duygulu Sahin, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Teruko Mitamura, Richard Stern and Alexander Hauptmann
    TRECVID Video Retrieval Evaluation Workshop, NIST, Gaithersburg, MD, November 2014.
  8. Instructional Videos for Unsupervised Harvesting and Learning of Action Examples [pdf] [poster] [Project Page]
    Shoou-I Yu, Lu Jiang, Alexander Hauptmann.
    In ACM MM 2014.
  9. Unsupervised Video Adaptation for Parsing Human Motion [pdf] [Project Page]
    Haoquan Shen, Shoou-I Yu, Yi Yang, Deyu Meng, Alexander Hauptmann.
    In ECCV 2014.
  10. Zero-Example Event Search using MultiModal Pseudo Relevance Feedback [pdf]
    Lu Jiang, Teruko Mitamura, Shoou-I Yu, Alexander G. Hauptmann.
    In ICMR 2014.
  11. Self-paced Learning with Diversity [pdf] [Supplementary Material]
    Lu Jiang, Deyu Meng, Shoou-I Yu, Zhen-Zhong Lan, Shiguang Shan, Alexander Hauptmann.
    In NIPS 2014.
  12. Resource Constrained Multimedia Event Detection [pdf]
    Zhen-zhong Lan, Yi Yang, Nicolas Ballas, Shoou-I Yu, Alexander Hauptmann.
    In MMM'14, 20th Intl. Conf. on Multimedia Modeling 2014.
  13. Harry Potter's Marauder's Map: Localizing and Tracking Multiple Persons-of-Interest by Nonnegative Discretization [pdf] [Demo Page]
    Shoou-I Yu, Yi Yang, Alexander Hauptmann.
    In IEEE CVPR, 2013.
    2014 Nov. IEEE Signal Processing Magazine
  14. Informedia@TRECVID 2013 [pdf] [slides]
    Zhenzhong Lan, Lu Jiang, Shoou-I Yu, Chenqiang Gao, Shourabh Rawat, Yang Cai, Shicheng Xu, Haoquan Shen, Xuanchong Li, Yipei Wang, Waito Sze, Yan Yan, Zhigang Ma, Nicolas Ballas, Deyu Meng, Wei Tong, Yi Yang, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Richard Stern, Teruko Mitamura, Eric Nyberg, and Alexander Hauptmann.
    TRECVID Video Retrieval Evaluation Workshop, NIST, Gaithersburg, MD, November 2013.
  15. E-LAMP: integration of innovative ideas for multimedia event detection [pdf]
    Wei Tong, Yi Yang, Lu Jiang, Shoou-I Yu, Lan Zhen-Zhong, Zhigang Ma, Waito Sze, Ehsan Younessian, Alexander Hauptmann.
    Journal of Machine Vision and Applications, 2013.
  16. Multimedia Classification and Event Detection using Double Fusion [pdf]
    Zhen-zhong Lan, Lei Bao, Shoou-I Yu, Wei Liu, Alexander Hauptmann.
    Journal of Multimedia Tools and Applications, 2013.
  17. Informedia E-Lamp @ TRECVID 2012, Multimedia Event Detection and Recounting [pdf]
    Shoou-I Yu, Zhongwen Xu, Duo Ding, Waito Sze, Francisco Vicente, Zhenzhong Lan, Yang Cai, Shourabh Rawat, Peter Schulam, Nisarga Markandaiah, Sohail Bahmani, Antonio Juarez, Wei Tong, Yi Yang, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj, Richard Stern, Teruko Mitamura, Eric Nyberg and Alexander Hauptmann.
    TRECVID Video Retrieval Evaluation Workshop, NIST, Gaithersburg, MD, November 2012.
  18. Double Fusion for Multimedia Event Detection [pdf]
    Zhenzhong Lan, Lei Bao, Shoou-I Yu, Wei Liu, Alexander Hauptmann
    In MMM'12, 18th Intl. Conf. on Multimedia Modeling, 2012.
  19. Informedia @ TRECVID 2011, Multimedia Event Detection and Semantic Indexing [pdf]
    Lei Bao, Shoou-I Yu, Zhen-zhong Lan, Arnold Overwijk, Qin Jin, Brian Langner, Michael Garbus, Susanne Burger, Florian Metze, Alexander Hauptmann
    TRECVID Video Retrieval Evaluation Workshop, NIST, Gaithersburg, MD, December 2011.
  20. Informedia @ TRECVID 2010 [pdf]
    Huan Li, Lei Bao, Zan Gao, Arnold Overwijk, Wei Liu, Long-fei Zhang, Shoou-I Yu, Ming-yu Chen, Florian Metze and Alexander Hauptmann.
    TRECVID Video Retrieval Evaluation Workshop, NIST, Gaithersburg, MD, December 2010.
  21. A Content-Based Method to Enhance Tag Recommendation [pdf]
    Yu-Ta Lu, Shoou-I Yu, Tsung-Chieh Chang, Jane Yung-jen Hsu
    In IJCAI ‘09: Proceedings of the Twenty-first International Joint Conference on Artificial Intelligence, pages 2064 – 2069, 2009.
  22. Improved Factoring of RSA Modulus
    Jiun-Ming Chen, Shoou-I Yu, Yi Ou-Yang, Po-Han Wang, Chi-Hung Lin, Po-Yi Huang, Bo-Yin Yang, Chi-Sung Laih
    In Proceedings of the 25th Workshop on Combinatorial Mathematics and Computation Theory, Chung Hua University, Hsinchu Hsien, Taiwan, 2008

Others

Here is a link to my undergrad webpage.

Last Updated: 2016/8/7