Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing 2015
DOI: 10.18653/v1/d15-1057
|View full text |Cite
|
Sign up to set email alerts
|

Extraction and generalisation of variables from scientific publications

Abstract: Scientific theories and models in Earth science typically involve changing variables and their complex interactions, including correlations, causal relations and chains of positive/negative feedback loops. Variables tend to be complex rather than atomic entities and expressed as noun phrases containing multiple modifiers, e.g. oxygen depletion in the upper 500 m of the ocean or timing and magnitude of surface temperature evolution in the Southern Hemisphere in deglacial proxy records. Text mining from Earth sc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2016
2016
2018
2018

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 36 publications
0
7
0
Order By: Relevance
“…In addition to methods such as the HBIN model (Sebastian et al. , 2017), a work such as an automatic extraction of variable terms from non-biomedical literatures (Marsi &Öztürk, 2015) is a good example of the initial steps toward this direction.…”
Section: Future Research Areasmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition to methods such as the HBIN model (Sebastian et al. , 2017), a work such as an automatic extraction of variable terms from non-biomedical literatures (Marsi &Öztürk, 2015) is a good example of the initial steps toward this direction.…”
Section: Future Research Areasmentioning
confidence: 99%
“…DARPA's Big Mechanism initiative (Cohen, 2015) encourages a greater usage of LBD systems for uncovering complex scientific mechanisms from diverse literatures. In addition to methods such as the HBIN model (Sebastian et al, 2017), a work such as an automatic extraction of variable terms from non-biomedical literatures (Marsi &Öztürk, 2015) is a good example of the initial steps toward this direction.…”
Section: Encouraging Domain Independencementioning
confidence: 99%
“…Maximal sequences and page ranking have been combined to discover latent keyphrases within scientific articles (Ortiz et al, 2010). Noun phrases containing multiple modifiers have been extracted from earth science publications and generalized by matching tree patterns to the syntax trees of the sources texts (Marsi and Öztürk, 2015). Keyphrase boundary classification has been regarded as a multi-task learning problem using deep recurrent neural network (Augenstein and Søgaard, 2017).…”
Section: Introductionmentioning
confidence: 99%
“…The major challenges we face are the lack of resources like task specific annotated corpora, indexed literature databases covering the entire field, domain dependent NLP tools with good accuracies and knowledge resources (ontologies) as our target domain is almost unexplored. The resources and tools developed in biomedicine domain are not directly usable due to domain difference as shown in [24,25]. Therefore, to meet this goal, a constant effort is being employed to develop resources and tools.…”
Section: Introductionmentioning
confidence: 99%
“…In [25], authors describe an annotation scheme to annotate quantitative variables, their change events, correlations and causal relations among change events, and feedback loops from the abstracts and full-text journal papers collected from the nature publication. In [24], authors automatically identify and extract variables and their direction of changes using a tree pattern matching technique and generalise these variables by progressive pruning of syntax tree using tree transformation operations.…”
Section: Introductionmentioning
confidence: 99%