“…A total of 28,676 genes were defined as high-confidence genes, with 14,991, 20,878, 15,276, 5,924, 29,095, 20,325, and 28,310 annotated genes in the KOG, PFAM, GO, KEGG, Nr, SwissProt and TrEMBL databases, respectively ( Table 1 ) . The NGD also houses the sequences of 150,589 unique transcript isoforms based on RNA-seq and PacBio SMRT methods from our previous studies 14 , 17 . It also contains the sequences of 1,517 lotus transcription factors (TFs), which are classified into 56 TF (sub)families.…”