分布式决策树集成学习框架:Brushfire

jopen 11年前

Brushfire是一个框架用于决策树集成模型的分布式监督学习。</span>

Brushfire 当前支持:

  • binary and multi-class classifiers
  • numeric features (discrete and continuous)
  • categorical features (including those with very high cardinality)
  • k-fold cross validation and random forests
  • chi-squared test as a measure of split quality
  • feature importance and brier scores
  • Scalding/Hadoop as a distributed computing platform

将来打算支持

  • regression trees
  • CHAID-like multi-way splits
  • error-based pruning
  • many more ways to evaluate splits and trees
  • Spark and single-node in-memory platforms

项目主页:http://www.open-open.com/lib/view/home/1416747994570