2011
DOI: 10.1186/1758-2946-3-17
|View full text |Cite
|
Sign up to set email alerts
|

ChemicalTagger: A tool for semantic text-mining in chemistry

Abstract: BackgroundThe primary method for scientific communication is in the form of published scientific articles and theses which use natural language combined with domain-specific terminology. As such, they contain free owing unstructured text. Given the usefulness of data extraction from unstructured literature, we aim to show how this can be achieved for the discipline of chemistry. The highly formulaic style of writing most chemists adopt make their contributions well suited to high-throughput Natural Language Pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
150
0
2

Year Published

2012
2012
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 146 publications
(152 citation statements)
references
References 15 publications
0
150
0
2
Order By: Relevance
“…ChemEx employs ChemicalTagger [24], which uses machine learning approach called Maximum Entropy Markov Model Recogniser [25], to (i) recognize chemical names, reaction names, enzymes, and chemistry-related terms such as experimental action verbs or units and (ii) tag general English word classes, such as a noun or a verb, which will be used in the phase parser. ChemEx uses all information from ChemicalTagger.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…ChemEx employs ChemicalTagger [24], which uses machine learning approach called Maximum Entropy Markov Model Recogniser [25], to (i) recognize chemical names, reaction names, enzymes, and chemistry-related terms such as experimental action verbs or units and (ii) tag general English word classes, such as a noun or a verb, which will be used in the phase parser. ChemEx uses all information from ChemicalTagger.…”
Section: Methodsmentioning
confidence: 99%
“…ChemicalTagger [24] also parses and identifies a sentence. Phase parser receives tagged token stream and builds grammatical structure based on predefined grammars.…”
Section: Methodsmentioning
confidence: 99%
“…Return hits only if named entity is resolved to a structure A combination of rule-based chemical text and formal grammar parser has been developed known as ChemicalTAgger [91]. It is a freely available open-source Java-based software which uses both OSCAR and open NLP programs (Fig.…”
Section: Chemically Intelligent Text-mining Toolsmentioning
confidence: 99%
“…Databases usually contain many tables. All the tables can be linked by a common identifier such as a primary key within the database or through foreign key association [91]. 1.22 Databases Some of the most familiar terms used in databases are:…”
Section: Databasesmentioning
confidence: 99%
“…[15][16][17][18][19][20][21][22] There is, consequently, a pressing need for analogous virtual screening of inorganic materials syntheses to complement the growing volume of predicted and screened compounds. 23,24 Such synthesis screening approaches have indeed found recent success in organic chemistry, where a wealth of tabulated reaction data is available, [25][26][27][28][29][30][31][32][33][34][35] and synthesis parameter screening, driven by machine learning, has also been explored for the specific case of organically templated metal vanadium selenites. 20 These efforts have laid the groundwork for analogous large-scale inorganic synthesis screening.…”
Section: Introductionmentioning
confidence: 99%