“…In the field of geoscience, many types of geological data collections have accumulated over a long period of time due to the diversity of technical methods and research directions. In terms of data composition structure, massive geological data repositories include a large amount of structured data and unstructured data, especially textual data and geological map data (Ma et al., 2021; Qiu, Xie, & Wu, 2018; Qiu, Xie, Wu, & Li, 2018; Wu et al., 2017). Currently, a large number of geological reports are accumulated during geological investigations; each report contains information on a particular geological area, such as rocks, minerals, or hydrology, and the contents of these reports are typically saved in a variety of forms, including doc, pdf, jpg, tiff, and spatial data files (Qiu, Xie, Wu, & Li, 2019; Qiu, Xie, Wu, Tao, & Li, 2019; Wang et al., 2021, 2022).…”