On the Trade-off between the Number of Nodes and the Number of Trees in a Random Forest
CoRR(2023)
摘要
In this paper, we focus on the prediction phase of a random forest and study
the problem of representing a bag of decision trees using a smaller bag of
decision trees, where we only consider binary decision problems on the binary
domain and simple decision trees in which an internal node is limited to
querying the Boolean value of a single variable. As a main result, we show that
the majority function of n variables can be represented by a bag of T (<
n) decision trees each with polynomial size if n-T is a constant, where n
and T must be odd (in order to avoid the tie break). We also show that a bag
of n decision trees can be represented by a bag of T decision trees each
with polynomial size if n-T is a constant and a small classification error is
allowed. A related result on the k-out-of-n functions is presented too.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要