20Soybean (Glycine max [L.] Merr.) is a major crop in animal feed and human nutrition, 21 mainly for its rich protein and oil contents. The remarkable rise in soybean transcriptome 22 studies over the past five years generated an enormous amount of RNA-seq data, 23 encompassing various tissues, developmental conditions, and genotypes. In this study, 24we have collected data from 1,298 publicly available soybean transcriptome samples, 25processed the raw sequencing reads, and mapped them to the soybean reference 26 genome in a systematic fashion. We found that 94% of the annotated genes 27(52,737/56,044) had detectable expression in at least one sample. Unsupervised 28 clustering revealed three major groups, comprising samples from aerial, underground, 29and seed/seed-related parts. We found 452 genes with uniform and constant expression 30 levels, supporting their roles as housekeeping genes. On the other hand, 1,349 genes 31 showed heavily biased expression patterns towards particular tissues. A transcript-level 32 analysis revealed that 95% (70,963/74,490) of the known transcripts overlap with those 33 reported here, whereas 3,256 assembled transcripts represent potentially novel splicing 34isoforms. The dataset compiled here constitute a new resource for the community, which 35can be downloaded or accessed through a user-friendly web interface at 36