Proceedings of the ACM Web Conference 2022 2022
DOI: 10.1145/3485447.3512140
|View full text |Cite
|
Sign up to set email alerts
|

What’s in an Index: Extracting Domain-specific Knowledge Graphs from Textbooks

Abstract: A typical index at the end of a textbook contains a manuallyprovided vocabulary of terms related to the content of the textbook. In this paper, we extend our previous work on extraction of knowledge models from digital textbooks. We are taking a more critical look at the content of a textbook index and present a mechanism for classifying index terms according to their domain specificity: a core domain concept, an in-domain concept, a concept from a related domain, and a concept from a foreign domain. We link t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(15 citation statements)
references
References 47 publications
0
11
0
Order By: Relevance
“…For example, the term cox's theorem 14 is a statistical theorem, it is not part of any of the ten textbooks, but it belongs to the target domain. Using Google Books, this term was found in three textbooks with particular topics: uncertainty theory 15 , statistical evidence measurement 16 , and universal artificial intelligence 17 . All three books mention the term, but it appears only in the index section of the last one, which illustrates the rarity of the term.…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…For example, the term cox's theorem 14 is a statistical theorem, it is not part of any of the ten textbooks, but it belongs to the target domain. Using Google Books, this term was found in three textbooks with particular topics: uncertainty theory 15 , statistical evidence measurement 16 , and universal artificial intelligence 17 . All three books mention the term, but it appears only in the index section of the last one, which illustrates the rarity of the term.…”
Section: Discussionmentioning
confidence: 99%
“…A set of 40 textbooks on university-level Statistics written in English, available in both PDF and EPUB versions, and containing Tables of Content and Index sections have been selected. Additionally, to validate the approach in other domains, five textbooks have been added to the test set for each of the following domains: computer science, history, and literature 9 . This experiment has focused on three different outcomes from the first three stages of the approach (the last stage is the construction of the textbook knowledge model itself).…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations