2009
DOI: 10.1186/1471-2105-10-14
|View full text |Cite
|
Sign up to set email alerts
|

MBA: a literature mining system for extracting biomedical abbreviations

Abstract: Background: The exploding growth of the biomedical literature presents many challenges for biological researchers. One such challenge is from the use of a great deal of abbreviations. Extracting abbreviations and their definitions accurately is very helpful to biologists and also facilitates biomedical text analysis. Existing approaches fall into four broad categories: rule based, machine learning based, text alignment based and statistically based. State of the art methods either focus exclusively on acronym-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2009
2009
2018
2018

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 20 publications
(18 citation statements)
references
References 14 publications
0
18
0
Order By: Relevance
“…Nevertheless, they can complement acronym-type ABRs recognition techniques by further processing the remaining "candidate" ABRs, as e.g. in the MDA system [17].…”
Section: Related Workmentioning
confidence: 99%
“…Nevertheless, they can complement acronym-type ABRs recognition techniques by further processing the remaining "candidate" ABRs, as e.g. in the MDA system [17].…”
Section: Related Workmentioning
confidence: 99%
“…The second type of post-processing in our hybrid approach is the abbreviation resolution (Sohn et al, 2008;Xu et al, 2009). Abbreviations are widely and commonly used in biomedical text (Sohn et al, 2008).…”
Section: Abbreviation Identification Algorithmmentioning
confidence: 99%
“…For example, BANNER left the abbreviation resolution as its future work (Leaman and Gonzalez, 2008). Various approaches have been proposed to automatically identify the abbreviation and its definition in a biomedical abstracts (Xu et al, 2009;Schwartz and Hearst, 2003;Park and Byrd 2001). We introduce a hybrid abbreviation identification algorithm by combining the features of two existing algorithms (Schwartz and Hearst, 2003;Park and Byrd 2001).…”
Section: Abbreviation Identification Algorithmmentioning
confidence: 99%
“…Also see Wren et al for a review of four methods [67]: ARGH, the Stanford Biomedical Abbreviation Server, AcroMed and SaRAD [67,68,213–216]. A recent paper by Xu et al describes MBA, a system that achieves similarly high performance [69]. It specializes in identifying nonacronym abbreviations such as ‘Fas’ used as an abbreviation for the gene ‘ CD95 ’, using a statistical method in which they count the number of articles that contain both the candidate definition and the abbreviation and then use this in scoring each candidate definition/abbreviation pair.…”
Section: Identification Of Information Within the Documents: Informatmentioning
confidence: 99%