2016
DOI: 10.2196/medinform.5748
|View full text |Cite
|
Sign up to set email alerts
|

Consumers’ Use of UMLS Concepts on Social Media: Diabetes-Related Textual Data Analysis in Blog and Social Q&A Sites

Abstract: BackgroundThe widely known terminology gap between health professionals and health consumers hinders effective information seeking for consumers.ObjectiveThe aim of this study was to better understand consumers’ usage of medical concepts by evaluating the coverage of concepts and semantic types of the Unified Medical Language System (UMLS) on diabetes-related postings in 2 types of social media: blogs and social question and answer (Q&A).MethodsWe collected 2 types of social media data: (1) a total of 3711 blo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
33
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 33 publications
(33 citation statements)
references
References 34 publications
(44 reference statements)
0
33
0
Order By: Relevance
“…Answers , we collected a total of 58,422 questions and their corresponding best answers (one for each question) in the diabetes category, and 81,433 questions and answers in the cancer category. Park et al [23] recently assessed the UMLS terminology coverage in the same dataset but did not find a significant difference between the questions and the answers with respect to the frequently-used terms and semantic types. Therefore, we combined each question with its corresponding best answer as one document in the corpus.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Answers , we collected a total of 58,422 questions and their corresponding best answers (one for each question) in the diabetes category, and 81,433 questions and answers in the cancer category. Park et al [23] recently assessed the UMLS terminology coverage in the same dataset but did not find a significant difference between the questions and the answers with respect to the frequently-used terms and semantic types. Therefore, we combined each question with its corresponding best answer as one document in the corpus.…”
Section: Methodsmentioning
confidence: 99%
“…According to a recent work of ours [23], we used the 12 most frequent semantic types to construct 12 semantic context sub-features that can cover more than 80% of the UMLS terms in our dataset: “Amino Acid, Peptide, or Protein,” “Body Part, Organ, or Organ Component,” “Disease or Syndrome,” “Finding,” “Medical Device,” “Organic Chemical,” “Pharmacologic Substance,” “Sign or Symptom,” “Therapeutic or Preventive Procedure.” “Finding,” “Pharmacologic Substance,” and “Disease or Syndrome.” The method for calculating these sub-features is similar to that for the feature POS context features. We give a concrete example of the semantic context feature in Table S2 in the Supplementary Material.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In our previous work [18], we employed a well-known technique called n-gram to extract terms from a word sentence for UMLS concept matching. Specifically, after sentence detection, tokenization, and POS tagging, we applied a fuzzy matching method to match an n-gram to a UMLS term.…”
Section: Methodsmentioning
confidence: 99%
“…Patients could contribute computable phenotype data themselves; however, the ‘terminology gap’ between medical professionals and patients is a hindrance. The terminology gap has also limited patient participation both in research studies and in clinical phenotyping 9 . Current patient vocabularies (such as the Consumer Health Vocabulary Initiative; see URLs) provide broad consumer equivalents for clinical findings, medical procedures and equipment, but are not well integrated with research terminologies; thus, they provide neither the structure nor the coverage required for translational research and diagnostic tools.…”
mentioning
confidence: 99%