On June 11th, 2020, we launched the
Petuum ML
open source consortium that brings our research and development at Petuum Inc. and CMU Sailing Lab on Distributed ML (e.g.,
AutoDist,
AdaptDL),
Automated ML (e.g.,
Dragonfly,
ProBO),
and Composable ML (e.g.,
Texar,
Forte)
implemented across PyTorch and TensorFlow under a unified umbrella.

On December 25th, 2013, we made an initial
opensource release of Petuum,
a new framework for distributed machine learning with massive data, big
models, and a wide spectrum of algorithms. Updates on Petuum are released every
three months. The latest release (version 1.1) was made in July, 2015.
Teaching:

I have been teaching Probabilistic Graphical Models
(10708), an advanced graduate course on theory, algorithm, and application for multivariate modeling, inference, and deep learning since 2005 at CMU. For all the past versions, please see here.

Video lectures of Probabilistic Graphical Models (10708):
2014,
2019,
2020.

I regularly teach
Graduate Machine Learning (10701), which is a
general Ph.D.level intro. ML for CMU students from all majors.
Sabbatical and Leave:

I was on sabbatical from 2018
to 2019 as the CEO and Chief Scientist of Petuum Inc.. Currently I serve as the Executive Chairman of its Board.

I was on sabbatical from 2010
to 2011 as a visiting professor at Department of Statistics, Stanford University.

I was also a visiting professor during 20102011 at Facebook, working on a variety of projects on social media.
Talks and Tutorials:

A Blueprint of Standardized and Composable Machine Learning
,
[slides]
[video],
Institute for Advanced Study, Princeton, 2020.

Compositionality in Machine Learning
,
[slides]
[video],
Open Data Science Conference (ODSC) West 2019.

A Civil Engineering Perspective on Artificial Intelligence From Petuum
[slides],
Distinguished Lectures in Computational Innovation, Columbia University, 2018.

A Statistical Machine Learning Perspective of Deep Learning: Algorithm, Theory, and Scalable Computing
[slides],
tutorial at the International Summer School on Deep Learning, Genova, Italy, 2018.

Standardized Tests as benchmarks for Artificial Intelligence
[slides],
tutorial at EMNLP, Melbourne, Australia, 2018.

PetuumMed: algorithms and system for EHRbased medical decision support
[slides], MIT, 2018.

System and Algorithm CoDesign, Theory and Practice, for Distributed Machine Learning
[slides],
[video],
at the Simons Institute for the Theory of Computing, Berkeley, 2017.

Strategies & Principles for Distributed Machine Learning
[slides],
[video],
Allen Institute for AI, 2016.

The Machine Learning Behind Reading and Comprehension
[slides],
Summit of Language and AI, China, 2016.

A New Look at the System, Algorithm and Theory Foundations of Distributed Machine Learning
[slides],
tutotial with Dr. Qirong Ho at the
21st ACM SIGKDD Conference on knowledge Discovery and Data Mining (KDD 2015).

Big ML Software for Modern ML Algorithms
[slides],
tutotial with Dr. Qirong Ho at the
2014 IEEE International Conference on Big Data (IEEE BigData 2014).

Topic Models, Latent Space Models, Sparse Coding, and All That: A systematic understanding of probabilistic semantic extraction in large corpus
[slides], tutotial at the
50th Annual Meeting of the Association for Computational Linguistics (ACL 2012).

Modern Statistical Methods for Genetic Association Study: Structured
GenomeTranscriptomePhenome Association Analysis
[slides],
tutotial With Dr. Seyoung Kim, at the
Nineteenth International
Conference on Intelligence Systems for Molecular Biology
(ISMB 2011).
Services:

I am a member of the DARPA Information Science and Technology (ISAT) Advisory Group.

And I serve on the NIH BioData Management and Analysis (BDMA) Study Section.