Mining Large Time-evolving Data Using Matrix and Tensor Tools
SIGMOD 20007 tutorial, Beijing, China

Christos Faloutsos, CMU
Tamara G. Kolda, Sandia National Labs
Jimeng Sun, CMU


How can we find patterns in sensor streams (eg., a sequence of temperatures, water-pollutant measurements, or machine room measurements)? How can we mine Internet traffic graph over time? Further, how can we make the process incremental? We review the state of the art in four related fields: (a) numerical analysis and linear algebra (b) multi-linear/tensor analysis (c) graph mining and (d) stream mining. We will present both theoretical results and algorithms as well as case studies on several real applications. Our emphasis is on the intuition behind each method, and on guidelines for the practitioner.


Part1, Part2, Part3, Part4 (in pdf)



Researchers who want to get up to speed with the major tools in stream mining, graph mining. Also, practitioners who want a concise, intuitive overview of the state of the art.


This work was partially supported by NSF Grants IIS-0326322, IIS-0534205, by PITA (Pennsylvania Infrastructure Technology Alliance), and by DOE (LLNL) contract No.W-7405-ENG-48.


SDM 2007 (3 hour version)


christos photo Christos Faloutsos is a Professor at Carnegie Mellon University. He has received the Presidential Young Investigator Award by the National Science Foundation (1989), seven ``best paper'' awards, and several teaching awards. He has served as a member of the executive committee of SIGKDD; he has published over 140 refereed articles, one monograph, and holds five patents. His research interests include data mining for streams and networks, fractals, indexing for multimedia and bio-informatics data bases, and performance.
Tamara G.Kolda - photo Tamara G. Kolda is a researcher at Sandia National Laboratories in Livermore, California and has received the Presidential Early Career Award for Scientists and Engineers (2003). She has published over 25 refereed articles and released several software packages including the MATLAB Tensor Toolbox. She is an associate editor for the SIAM Journal on Scientific Computing. Her research interests include multilinear algebra and tensor decompositions, data mining, optimization, nonlinear solvers, graph algorithms, parallel computing and the design of scientific software.
Jimeng Sun - photo
Jimeng Sun is a PhD candidate in Computer Science Department at Carnegie Mellon University. His research interests include data mining on streams, graphs and tensors, anomaly detection.

Last updated: June 13, 2007