Home--Research

Home
Research
Papers
Code
Data

CV
People
Links

Nonparametric Graphical Models via Kernel Embeddings

Probabilistic graphical models have become a key tool for representing structured dependencies between random variables in challenging tasks in social networks, natural language processing, computer vision, and beyond. Unfortunately, most successful applications of graphical models rely on situations where each random variable can take on only a relatively small number of values, or, in continuous domains, where the joint distributions are Gaussians. I developed a novel nonparametric representation for graphical models based on the concept of kernel embeddings of distributions. This new representation allows us to conduct learning and inference in highly non-Gaussian continuous settings, and in discrete settings where variables can take on a large number of assignments. Nonparametric graphical models have been applied to various learning problems, such as predicting the categories of papers in the ACM digital library based on abstracts and citations, cross-language document retrieval, estimating depth from a single image, classification and forecast for dynamical system models of video, speech and sensor time series. In these applications, this new method outperforms state-of-the-art techniques.

Modeling, Analyzing and Visualizing Networks

Much of the world's information has a relational structure and can be modelled mathematically as networks and graphs. Examples include biological networks, webgraphs and social networks. Many of these large and complex networks exhibit rich spatial and temporal phenomena. Traditional graph modeling, analysis and visualization algorithms are not able to capture this complex spatial and temporal behavior. We designed new modeling, analyzing and visualizing tools to better understand complex networks.

Learning via Hilbert Space Embedding of Distributions

What is interesting about a distribution is often its behavior when taking expectations. In this work, distributions are embedded into Hilbert spaces via the averages or mean maps, and then all subsequent operations on distributions are carried out in the Hilbert space. This allows us to compute distances between distributions in terms of distances between averages. We have developed a framework of learning based on this which includes density estimation, clustering, feature selection, two sample tests, independence tests, nonparametric sorting, and dimensional reduction. A large number of existing methods appear in this framework as special cases. Furthermore, this often leads to algorithms which are simpler and more effective than information theoretic methods in a broad range of applications.

Biomedical Signal Processing

Brain-computer interface (BCI) is a communication system that relies on the brain rather than the body for control and feedback. My research employs a novel type of features based explicitly on the neurophysiology of EEG signals for classification. Basically, EEG signals are considered as the outputs of a networked dynamical system. The nodes of this system consist of cortical patches, while the links correspond to neural fibers. A large and complex system like this often generates interesting collective dynamics, such as synchronization in the activities of the nodes, and they result in the change of EEG patterns measured on the scalp. These features from the collective dynamics of the system are employed for classification.