# Learning on tabular data

November 30, 2020 — June 21, 2021

approximation

Bayes

clustering

high d

linear algebra

networks

optimization

probabilistic algorithms

probability

sparser than thou

statistics

Learning for tabular data, i.e. the stuff you generally store in spreadsheets and relational databases.

Popular in many areas, notably recommender systems.

(Cheng et al. 2016): jrzaurin/pytorch-widedeep: A flexible package to combine tabular data with text and images using Wide and Deep models in Pytorch. pytorch-widedeep, deep learning for tabular data IV: Deep Learning vs LightGBM

Note that the author of that package advises using gradient boosting machines to get this job done.

## 1 References

Arik, and Pfister. 2019. “TabNet: Attentive Interpretable Tabular Learning.”

Cheng, Koc, Harmsen, et al. 2016. “Wide & Deep Learning for Recommender Systems.”

*arXiv:1606.07792 [Cs, Stat]*.
Gorishniy, Rubachev, Khrulkov, et al. 2023. “Revisiting Deep Learning Models for Tabular Data.”

Grinsztajn, Oyallon, and Varoquaux. 2022. “Why Do Tree-Based Models Still Outperform Deep Learning on Tabular Data?”

Montanari, and Weiner. 2023. “Compressing Tabular Data via Latent Variable Estimation.”

Richetti, Diakogianis, Bender, et al. 2023. “A Methods Guideline for Deep Learning for Tabular Data in Agriculture with a Case Study to Forecast Cereal Yield.”

*Computers and Electronics in Agriculture*.
Shwartz-Ziv, and Armon. 2021. “Tabular Data: Deep Learning Is Not All You Need.”

*arXiv:2106.03253 [Cs]*.