Po-Yao (Bernie) Huang

Researcher, FAIR

Greetings! I am Po-Yao (Bernie) Huang. I am a research scientist at Facebook AI Research (FAIR) Labs. I obtained my Ph.D. degree from the Language Technologies Institute (LTI) of School of Computer Science (SCS) at Carnegie Mellon University (CMU). My research interest is multimodal machine learning. I am particularly interested in bridging computer vision and natural language processing for the tasks of multimodal machine translation, cross-modal search and retrieval, and large-scale multimodal data mining and analysis.

Contact: berniebear_at_gmail.com
Google Scholar: https://scholar.google.com/citations?user=E8K25LIAAAAJ


Work Experience

Facebook

Senior Research Scientist (FAIR Labs) Aug 2022 - present
Research Scientist (FAIR Labs) Aug 2021 - Aug 2022
Research Intern May 2020 - May 2021

MicroSoft

Research Intern (Microsoft Research) Jun 2017 - Aug 2017

MediaTek

Senior Software Engineer Jun 2012 - Jun 2014
Software Engineer Sep 2010 - May 2012

Education

Carnegie Mellon University

Ph.D. in Computer Science - Language and Information Technologies Aug 2016 - Jul 2021

GPA: 4.33/4.33

M.S. in Computer Science - Language Technologies Aug 2014 - Jul 2016

GPA: 4.21/4.33

National Taiwan University

M.S. in Computer Engineering Aug 2007 - Jul 2009

GPA: 4.00/4.00

B.S. in Electrical Engineering Sep 2003 - Jul 2007

GPA: 3.78/4.00


Preprints

  • MAViL: Masked Audio-Video Learners
    Po-Yao Huang, Vasu Sharma, Hu Xu, Chaitanya Ryali, Haoqi Fan, Yanghao Li, Shang-Wen Li, Gargi Ghosh, Jitendra Malik, Christoph Feichtenhofer
  • Dinov2: Learning robust visual features without supervision
    Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mahmoud Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jegou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski

Publication

  • Masked autoencoders that listen
    Po-Yao Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer
    NeurIPS, 2022.

Awards


Scholarship

  • ACL, ICMR, AAAI, ACM MM, CVPR travel awards/grants, 2015-2019
  • NSF travel awards, 2015-2019
  • CMU Research Fellowship 2014-2019
  • Taiwan's Study Abroad Scholarship, 2016
  • Siebel Scholarship, 2016
  • Din-Jing Memorial Scholarship, 2009