Learning on tabular data
November 30, 2020 — June 21, 2021
approximation
Bayes
clustering
high d
linear algebra
networks
optimization
probabilistic algorithms
probability
sparser than thou
statistics
Learning for tabular data, i.e. the stuff you generally store in spreadsheets and relational databases.
Popular in many areas, notably recommender systems.
(Cheng et al. 2016): jrzaurin/pytorch-widedeep: A flexible package to combine tabular data with text and images using Wide and Deep models in Pytorch. pytorch-widedeep, deep learning for tabular data IV: Deep Learning vs LightGBM
Note that the author of that package advises using gradient boosting machines to get this job done.
1 References
Arik, and Pfister. 2019. “TabNet: Attentive Interpretable Tabular Learning.”
Cheng, Koc, Harmsen, et al. 2016. “Wide & Deep Learning for Recommender Systems.” arXiv:1606.07792 [Cs, Stat].
Gorishniy, Rubachev, Khrulkov, et al. 2023. “Revisiting Deep Learning Models for Tabular Data.”
Grinsztajn, Oyallon, and Varoquaux. 2022. “Why Do Tree-Based Models Still Outperform Deep Learning on Tabular Data?”
Montanari, and Weiner. 2023. “Compressing Tabular Data via Latent Variable Estimation.”
Richetti, Diakogianis, Bender, et al. 2023. “A Methods Guideline for Deep Learning for Tabular Data in Agriculture with a Case Study to Forecast Cereal Yield.” Computers and Electronics in Agriculture.
Shwartz-Ziv, and Armon. 2021. “Tabular Data: Deep Learning Is Not All You Need.” arXiv:2106.03253 [Cs].