Transformers Meet Small Datasets

Shao, Renfan; Bi, Xiaojun

doi:10.1109/access.2022.3221138

Cited by 14 publications

(2 citation statements)

References 38 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…After the last T-Block that focuses more on spectral attention, we further enhance the spatial features using spatialspectral domain learning (SDL) module 37 , whose output is the desired deblurred multispectral image e Y = f À1 B ðY Þ. It is quite interesting to notice that numerous recent articles have proposed and successfully demonstrated the training of the Transformer with just small data [38][39][40][41] . The CODE addresses the challenge of small data learning using a completely different philosophy.…”

Section: Code-based Small-data Learning Theorymentioning

confidence: 99%

“…The CODE addresses the challenge of small data learning using a completely different philosophy. Simply speaking, typical techniques [38][39][40][41] have to force the deep network to return a good deep solution (as the final solution), while CODE just accepts the weak DE solution. CODE assumes that though the small scale of data results in such a weak solution, the solution itself still contains useful information.…”

Section: Code-based Small-data Learning Theorymentioning

confidence: 99%

See 1 more Smart Citation

Metasurface-empowered snapshot hyperspectral imaging with convex/deep (CODE) small-data learning theory

Lin,

Huang,

Lin

et al. 2023

Nat Commun

View full text Add to dashboard Cite

Hyperspectral imaging is vital for material identification but traditional systems are bulky, hindering the development of compact systems. While previous metasurfaces address volume issues, the requirements of complicated fabrication processes and significant footprint still limit their applications. This work reports a compact snapshot hyperspectral imager by incorporating the meta-optics with a small-data convex/deep (CODE) deep learning theory. Our snapshot hyperspectral imager comprises only one single multi-wavelength metasurface chip working in the visible window (500-650 nm), significantly reducing the device area. To demonstrate the high performance of our hyperspectral imager, a 4-band multispectral imaging dataset is used as the input. Through the CODE-driven imaging system, it efficiently generates an 18-band hyperspectral data cube with high fidelity using only 18 training data points. We expect the elegant integration of multi-resonant metasurfaces with small-data learning theory will enable low-profile advanced instruments for fundamental science studies and real-world applications.

show abstract

Section: Code-based Small-data Learning Theorymentioning

confidence: 99%

Section: Code-based Small-data Learning Theorymentioning

confidence: 99%