Enhancing social tagging with automated keywords from the Dewey Decimal Classification

Golub, Koraljka; Lykke, Marianne; Tudhope, Douglas

doi:10.1108/jd-05-2013-0056

Cited by 20 publications

(19 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The work discussed here has evolved out of the document classification approach. Library classification schemes, such as the Dewey Decimal Classification (DDC) have underpinned efforts in automatic subject metadata generation for many years and continue to be employed (e.g., Thompson, Shafer, Vizine‐Goetz, ; Golub, Lykke, Tudhope, ). This article reveals the role of information extraction for purposes of semantic indexing, as opposed to automated subject classification, via ontology‐based approaches rather than traditional library classifications.…”

Section: Introductionmentioning

confidence: 99%

A knowledge‐based approach to Information Extraction for semantic interoperability in the archaeology domain

Vlachidis

Tudhope

2015

Asso for Info Science & Tech

Self Cite

View full text Add to dashboard Cite

The article presents a method for automatic semantic indexing of archaeological grey‐literature reports using empirical (rule‐based) Information Extraction techniques in combination with domain‐specific knowledge organization systems. The semantic annotation system (OPTIMA) performs the tasks of Named Entity Recognition, Relation Extraction, Negation Detection, and Word‐Sense Disambiguation using hand‐crafted rules and terminological resources for associating contextual abstractions with classes of the standard ontology CIDOC Conceptual Reference Model (CRM) for cultural heritage and its archaeological extension, CRM‐EH. Relation Extraction (RE) performance benefits from a syntactic‐based definition of RE patterns derived from domain oriented corpus analysis. The evaluation also shows clear benefit in the use of assistive natural language processing (NLP) modules relating to Word‐Sense Disambiguation, Negation Detection, and Noun Phrase Validation, together with controlled thesaurus expansion. The semantic indexing results demonstrate the capacity of rule‐based Information Extraction techniques to deliver interoperable semantic abstractions (semantic annotations) with respect to the CIDOC CRM and archaeological thesauri. Major contributions include recognition of relevant entities using shallow parsing NLP techniques driven by a complimentary use of ontological and terminological domain resources and empirical derivation of context‐driven RE rules for the recognition of semantic relationships from phrases of unstructured text.

show abstract

Section: Introductionmentioning

confidence: 99%

A knowledge‐based approach to Information Extraction for semantic interoperability in the archaeology domain

Vlachidis

Tudhope

2015

Asso for Info Science & Tech

Self Cite

View full text Add to dashboard Cite

show abstract

“…A related study explored the use of Dewey Decimal Classification (DDC) with mappings to LCSH and showed that the KOS helped find focus for tagging, strengthened consistency and led to increase of access points in retrieval; 36% of additional resources could be found using end-user tags, and a bit more so when using index terms derived from the DDC, compared to the original manual indexing (Golub, Lykke, & Tudhope, 2014). Also, three times as many search terms were found in end-user index terms as in manually assigned controlled terms.…”

Section: End User Index Termsmentioning

confidence: 99%

Potential and Challenges of Subject Access in Libraries Today on the Example of Swedish Libraries

Golub

2016

International Information & Library Review

Self Cite

View full text Add to dashboard Cite

“…Literature has suggested provision of an existing subject indexing or classification system from which the end users or authors could choose, in order to at least address the language‐control issue. For example, one study explored the use of Dewey Decimal Classification (DDC) with mappings to Library of Congress Subject Headings (LCSH) and showed that the DDC and LCSH helped the taggers find focus for tagging, strengthened consistency and led to increase of access points in retrieval. Also, three times as many search terms were found in end‐user index terms as in manually assigned controlled terms.…”

Section: Possible Solutions To Subject Searchingmentioning

confidence: 99%

Some Thoughts on Preserving Functions of Library Catalogs in Networked Environments

Golub¹

2016

Bull of the Asso Info Sci

Self Cite

View full text Add to dashboard Cite

EDITOR'S SUMMARY Classification and subject indexing systems have long been the mainstay of established information providers to deliver content precisely on topic. Logical semantic hierarchies and rich interconnections of related terms and synonyms enable accurate retrieval and browsing of similar resources and ideally should be available in online environments. But the cost of features may not be sustainable with massively growing resources. Efforts to merge databases and map disparate subject terminology require considerable human intervention. A possible solution combines controlled and uncontrolled terms from three sources: authoritative professional indexing, automated term suggestion and uncontrolled keywords proposed by authors or end users' social tags. Research is required to investigate the effectiveness, cost and applicability of combining controlled and uncontrolled terms for information retrieval.

show abstract

Enhancing social tagging with automated keywords from the Dewey Decimal Classification

Cited by 20 publications

References 9 publications

A knowledge‐based approach to Information Extraction for semantic interoperability in the archaeology domain

A knowledge‐based approach to Information Extraction for semantic interoperability in the archaeology domain

Potential and Challenges of Subject Access in Libraries Today on the Example of Swedish Libraries

Some Thoughts on Preserving Functions of Library Catalogs in Networked Environments

Contact Info

Product

Resources

About