Analyzing time series expression data: From individual gene expression to genetic regulatory networks.

I am working on analyzing time series gene expression and other high throughput biological data. Expression profiling allows biologists to investigate thousands of genes simultaneously. Of particular interest are time series expression datasets, which allow us to view not only a snapshot of the activity in the cell, but also the temporal relationships between different genes. We have developed a number of algorithms which assist in analyzing the results of these experiments. Our algorithms address issues ranging from low level analysis (such as missing values, alignment etc.) via pattern recognition (such as clustering) to high level analysis (combining different data sources and temporal genetic regulatory networks, such as the cell cycle network shown to the right). See the papers below for complete details.
Constructing the temporal cell cycle network using high throughput expression and location data. Science, 2002.

Related publications

  • I. Simon, Z. Siegfried, J. Ernst and Z. Bar-Joseph.
    Combined Static and Dynamic Analysis for Determining the Quality of Time-Series Expression Profiles
    Nature Biotechnology, 23(12), pp 1503-1508, 2005.
    Supporting website
    Supporting Methods
    Supporting Resurlts
  • Y. Qi, Z. Bar-Joseph and J. Klein-Seetharaman.
    Comprehensive comparison of approaches for predicting protein-protein interactions from multiple data sources
    Proteins: Structure, Function, and Bioinformatics,, in press, 2005
  • N. Kaminski and Z. Bar-Joseph
    A patient-gene model for temporal expression profiles in clinical studies
    Proceedings of The Tenth Annual International Conference on Research in Computational Molecular Biology (RECOMB), to appear, 2006
  • Y. Qi, J. Klein-Seetharaman and Z. Bar-Joseph
    A mixture of experts approach for protein-protein interaction prediction.
    Proceedings of NIPS workshop on Computational Biology and the Analysis of Heterogeneous Data 2005
  • R Singh, N. Palmer, D. Gifford, B. Berger and Z. Bar-Joseph
    Active Learning for Sampling in Time-Series Experiments With Application to Gene Expression Analysis.
    In Proceedings of the 22 nd International Conference on Machine Learning (ICML), to appear, 2005
  • Supporting website

  • J. Ernst, G. Nau, Z. Bar-Joseph
    Clustering Short Time Series Gene Expression Data.
    Bioinformatics (Proceedings of ISMB 2005), 21 Suppl 1, pp. I159-I168, 2005
  • Supporting website

  • Y. Qi, J. Klein-Seetharaman and Z. Bar-Joseph
    Random Forest Similarity for Protein-Protein Interaction Prediction from Multiple Sources
    Proceedings of the Pacific Symposium on Biocomputing (PSB), 10, 2005.
  • Z. Bar-Joseph, S. Farkash, D.K.Gifford, I Simon, R. Rosenfeld
    Deconvolving cell cycle expression data with complementary information
    Bioinformatics (Proceedings of ISMB 2004), 20 Suppl 1, pp. I23-I30, 2004
  • Supporting website

  • Z. Bar-Joseph
    Analyzing time series gene expression data
    Bioinformatics, 20(16), pp 2493-2503, 2004
  • Z. Bar-Joseph*, G. Gerber*, T. Lee*, N. Rinaldi, J. Yoo, F. Robert, B. Gordon, E. Fraenkel, T. Jaakkola, R. Young, and D. Gifford
    Computational discovery of gene modules and regulatory networks.
    Nature Biotechnology, 21(11) pp. 1337-42, 2003
    * Equal contributing author
  • Supporting website

  • Z. Bar-Joseph, G. Gerber, I. Simon, D. Gifford and T. Jaakkola.
    Comparing the continuous representation of time-series expression profiles to identify differentially expressed genes.
    Proceedings of the National Academy of Science (PNAS), 100(18) pp 10146-51
    Supporting website
    Appendix
  • Z. Bar-Joseph, G. Gerber, D. Gifford, T. Jaakkola and I. Simon.
    Continuous Representations of Time Series Gene Expression Data.
    Journal of Computational Biology, 10(3-4) pp 241-256
  • Ziv Bar-Joseph, Erik D. Demaine, David K. Gifford, Angèle M. Hamel, Tommy S. Jaakkola and Nathan Srebro.
    K-ary Clustering with Optimal Leaf Ordering for Gene Expression Data.
    Bioinformatics, 19(9) pp 1070-8.

  • T.I. Lee, N. J. Rinaldi, F. Robert, D. T. Odom, Z. Bar-Joseph, G. K. Gerber, ... D. K. Gifford and R. A. Young
    Transcriptional Regulatory Networks in Saccharomyces cerevisiae
    Science, 798, 2002 pp 799-804.
  • Supporting website

  • Ziv Bar-Joseph, Erik D. Demaine, David K. Gifford, Angèle M. Hamel, Tommi S. Jaakkola and Nathan Srebro.
    K-ary Clustering with Optimal Leaf Ordering for Gene Expression Data.
    Proceedings of the 2nd Workshop on Algorithms in Bioinformatics (WABI 2002) LNCS 2452, pp 506-520.
  • Z. Bar-Joseph, G. Gerber, D. Gifford, T. Jaakkola and I. Simon.
    A new approach to analyzing gene expression time series data.
    In Proceedings of The Sixth Annual International Conference on Research in Computational Molecular Biology (RECOMB), 2002, pp 39-48.
  • Z. Bar-Joseph, D. Gifford, and T. Jaakkola.
    Fast optimal leaf ordering for hierarchical clustering.
    Bioinformatics (Proceedings of ISMB 2001),, 17(S1), 2001, pp 22-29
  • Supporting website
    Software