分布式决策树集成学习框架:Brushfire
Brushfire是一个框架用于决策树集成模型的分布式监督学习。</span> Brushfire 当前支持:
- binary and multi-class classifiers
- numeric features (discrete and continuous)
- categorical features (including those with very high cardinality)
- k-fold cross validation and random forests
- chi-squared test as a measure of split quality
- feature importance and brier scores
- Scalding/Hadoop as a distributed computing platform
将来打算支持
- regression trees
- CHAID-like multi-way splits
- error-based pruning
- many more ways to evaluate splits and trees
- Spark and single-node in-memory platforms
项目主页:http://www.open-open.com/lib/view/home/1416747994570