Research code

Now that I'm in the end stages of the Ph.D., I'm finally getting organized and making some effort to post some of my research code. I've attempted to remove deadline-induced sloppiness and add documentation, but it has not gone through processes of code review or other niceties for production-level code.

If you do end up using any of this software, please add the appropriate citations (listed in each section). I welcome feedback, though may not be able to respond to all support and feature requests.

-Mary McGlohon, mmcgloho@cs.cmu.edu. 2/15/2010
Cascades code

Cascades, which are described in our SDM 2007 paper, are subgraphs (weakly connected components) of conversations, built from links. Since 2008 I have been building a package for processing them. I have cleaned up part of it and included that part here, in case someone wants to find isomorphic cascades and generate some cool-looking graphviz's from social media link data.

Citation
  • J. Leskovec, J, M. McGlohon, C. Faloutsos, N. Glance, and M. Hurst. Patterns of Cascading Behavior in Large Blog Graphs. Society of Industrial and Applied Mathematics- Data Mining. Minneapolis, Minn., April 2007. [pdf]

  • Topology code

    (Coming soon)

    Citation
  • M. McGlohon, L. Akoglu, and C. Faloutsos. Weighted Graphs and Disconnected Components: Patterns and a Generator. SIG-KDD Las Vegas, Nev., August 2008. [pdf]

  • SNARE

    (Coming soon)

    Citation
  • M. McGlohon, S. Bay, M. Anderle, D. Steier, and C. Faloutsos. SNARE: A Link Analytic System for Graph Labeling and Risk Detection SIG-KDD Paris, France. June 2009. [pdf]