“…Their characteristics, including number of rows (n), columns (m), nonzeros (nnz), and mean row/column length (μ r /μ c ), are detailed in [40] database, which includes a large set of chemical compounds automatically extracted from text, images, and attachments of patent documents. SC-5M, SC-1M, SC-500K, and SC-100K are random subsets of 5E+6, 1E+6, 5E+5, and 1E+5 compounds, respectively, from the SC-11.5M dataset.…”