Tropical rainforests in Southeast Asia are enriched by multifarious biota dominated by Dipterocarpaceae. In this family,
Shorea robusta
is an ecologically sensitive and economically important timber species whose genomic diversity and phylogeny remain understudied due to lack of datasets on genetic resources. Smattering availability of molecular markers impedes population genetic studies indicating a necessity to develop genomic databases and species-specific markers in
S. robusta
. Accordingly, the present study focused on fostering de novo low-depth genome sequencing, identification of reliable microsatellites markers, and their validation in various populations of
S. robusta
in Uttarakhand Himalayas. With 69.88 million raw reads assembled into 1,97,489 contigs (read mapped to 93.2%) and a genome size of 357.11 Mb (29 × coverage), Illumina paired-end sequencing technology arranged a library of sequence data of ~ 10 gigabases (Gb). From 57,702 microsatellite repeats, a total of 35,049 simple sequence repeat (SSR) primer pairs were developed. Afterward, among randomly selected 60 primer pairs, 50 showed successful amplification and 24 were found as polymorphic. Out of which, nine polymorphic loci were further used for genetic analysis in 16 genotypes each from three different geographical locations of Uttarakhand (India). Prominently, the average number of alleles per locus (
N
a), observed heterozygosity (
H
o), expected heterozygosity (
H
e), and the polymorphism information content (PIC) were recorded as 2.44, 0.324, 0.277 and 0.252, respectively. The accessibility of sequence information and novel SSR markers potentially enriches the current knowledge of the genomic background for
S. robusta
and to be utilized in various genetic studies in species under tribe Shoreae.
Supplementary Information
The online version contains supplementary material available at 10.1007/s10142-023-00975-8.