Paper ID: 2406.15500

Hidden Variables unseen by Random Forests

Ricardo Blum, Munir Hiabu, Enno Mammen, Joseph Theo Meyer

Random Forests are widely claimed to capture interactions well. However, some simple examples suggest that they perform poorly in the presence of certain pure interactions that the conventional CART criterion struggles to capture during tree construction. We argue that simple alternative partitioning schemes used in the tree growing procedure can enhance identification of these interactions. In a simulation study we compare these variants to conventional Random Forests and Extremely Randomized trees. Our results validate that the modifications considered enhance the model's fitting ability in scenarios where pure interactions play a crucial role.

Submitted: Jun 19, 2024