On the Power of Decision Trees in Auto-Regressive Language Modeling

Originally proposed for handling time series data, Auto-regressive Decision Trees (ARDTs) have not yet been explored for language modeling. This paper delves into both the theoretical and practical applications of ARDTs in this new context. We theoretically demonstrate that ARDTs can compute complex...

Full description

Bibliographic Details
Main Authors: Gan, Yulu, Galanti, Tomer, Poggio, Tomaso, Malach, Eran
Format: Article
Published: Center for Brains, Minds and Machines (CBMM) 2024
Online Access:https://hdl.handle.net/1721.1/157074