Additive Groves code: TreeExtra package

TreeExtra is a set of tools implementing the following algorithms:

Additive Groves is an ensemble of regression trees developed by Daria Sorokina, Rich Caruana and Mirek Riedewald.

Feature evaluation technique referred as "multiple counts" is developed by Art Munson and all of the above.

All code is written by Daria Sorokina unless stated otherwise. The code is available under BSD license and is free to use for any purpose.

Contact: Daria Sorokina (daria AT cs.cmu.edu)

Please e-mail me any comments, suggestions, bug reports or feature requests. I am interested in how my algorithm is doing: if you have successfully (or unsuccessfully) applied Additive Groves to your data, I'd be happy to hear about your experience with it.

Updates Manuals Downloads Research papers

Updates

Manuals

Downloads

TreeExtra 2.0

Additional code

History of TreeExtra versions

Research papers

Daria Sorokina, Rich Caruana, Mirek Riedewald, Wes Hochachka, Steve Kelling.
Detecting and Interpreting Variable Interactions in Observational Ornithology Data.
To appear in the International Workshop on Domain Driven Data Mining (DDDM'09).

Daria Sorokina.
Application of Additive Groves Ensemble with Multiple Counts Feature Evaluation to KDD Cup'09 Small Data Set.
In proceedings of the KDD Cup 2009 workshop.

Daria Sorokina, Rich Caruana, Mirek Riedewald, Daniel Fink.
Detecting Statistical Interactions with Additive Groves of Trees.
In proceedings of the 25th International Conference on Machine Learning (ICML'08).
Video of ICML presentation
Slides (.ppt)

Daria Sorokina, Rich Caruana, Mirek Riedewald.
Additive Groves of Regression Trees.
In proceedings of the 18th European Conference on Machine Learning (ECML'07) (Best Student Paper award.)
Video of ECML presentation
Slides (.ppt)

R. Caruana, M. Elhawary, A. Munson, M. Riedewald, D. Sorokina, D. Fink, W. Hochachka, S. Kelling.
Mining Citizen Science Data to Predict Prevalence of Wild Bird Species. (Feature evaluation methods described here)
In proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06).