“…Traditionally, the function and use of image‐related textual descriptions in academic publications (especially in the area of biomedicine) have been analyzed from the perspective of improving the efficiency and satisfaction of the image retrieval process (Divoli, Wooldridge, & Hearst, ); as an element of the figure in the hybrid (text and image) biomedical document retrieval process (Apostolova et al., ; Christiansen, Lee, & Chang, ; You et al., ); and for summarizing the image content (Agarwal & Yu, ; Bhatia, Lahiri, & Mitra, ; Neveol, Deserno, Darmoni, Guld, & Aronson, ; Yu & Lee, ). Caption extraction from PDF documents in the domains of chemistry, computer science, physics, and astronomy has been studied by Choudhury et al.…”