2021
DOI: 10.48550/arxiv.2106.01342
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training

Gowthami Somepalli,
Micah Goldblum,
Avi Schwarzschild
et al.

Abstract: Tabular data underpins numerous high-impact applications of machine learning from fraud detection to genomics and healthcare. Classical approaches to solving tabular problems, such as gradient boosting and random forests, are widely used by practitioners. However, recent deep learning methods have achieved a degree of performance competitive with popular techniques. We devise a hybrid deep learning approach to solving tabular data problems. Our method, SAINT, performs attention over both rows and columns, and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
79
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 46 publications
(79 citation statements)
references
References 32 publications
0
79
0
Order By: Relevance
“…4.2.1) and transformers-based (Sec. 4.2.2) groups exhibit superior predictive performance compared to plain deep neural networks on various data sets (Gorishniy et al, 2021;Ke et al, 2018Ke et al, , 2019Somepalli et al, 2021). This underlines the importance of special-purpose architectures for tabular data.…”
Section: Summary and Trendsmentioning
confidence: 96%
See 4 more Smart Citations
“…4.2.1) and transformers-based (Sec. 4.2.2) groups exhibit superior predictive performance compared to plain deep neural networks on various data sets (Gorishniy et al, 2021;Ke et al, 2018Ke et al, , 2019Somepalli et al, 2021). This underlines the importance of special-purpose architectures for tabular data.…”
Section: Summary and Trendsmentioning
confidence: 96%
“…For heterogeneous tabular data, these techniques are often difficult to apply. However, some frameworks for learning with tabular data, such as VIME (Yoon et al, 2020) and SAINT (Somepalli et al, 2021), use data augmentation strategies in the embedding space.…”
Section: Missing or Complex Irregular Spatial Dependenciesmentioning
confidence: 99%
See 3 more Smart Citations