Determiner-Established Deixis to Communicative Artifacts in Pedagogical Text

Wilson, Shomir; Oberlander, Jon

doi:10.3115/v1/p14-2067

Cited by 2 publications

(2 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Section: Related Workmentioning

confidence: 99%

“…Deixis present in a text can also be considered metadata and detection of deixis helps in structuring the flow of information. Wilson and Oberlander (2014) attempted to capture word senses related to deixis. Topic classification is the problem of segregating a document into different topics, and argumentative zoning (Teufel et al, 1999) was an early effort that shares some goals with the present work, as it addressed the detection of the main thematic areas in research articles.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Supervised and Unsupervised Methods for Robust Separation of Section Titles and Prose Text in Web Documents

Gopinath

Wilson²,

Sadeh

2018

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing

Self Cite

View full text Add to dashboard Cite

The text in many web documents is organized into a hierarchy of section titles and corresponding prose content, a structure which provides potentially exploitable information on discourse structure and topicality. However, this organization is generally discarded during text collection, and collecting it is not straightforward: the same visual organization can be implemented in a myriad of different ways in the underlying HTML. To remedy this, we present a flexible system for automatically extracting the hierarchical section titles and prose organization of web documents irrespective of differences in HTML representation. This system uses features from syntax, semantics, discourse and markup to build two models which classify HTML text into section titles and prose text. When tested on three different domains of web text, our domainindependent system achieves an overall precision of 0.82 and a recall of 0.98. The domaindependent variation produces very high precision (0.99) at the expense of recall (0.75). These results exhibit a robust level of accuracy suitable for enhancing question answering, information extraction, and summarization. 1

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Supervised and Unsupervised Methods for Robust Separation of Section Titles and Prose Text in Web Documents

Gopinath

Wilson²,

Sadeh

2018

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing

Self Cite

View full text Add to dashboard Cite

show abstract

Text mining in education

Mello

Andre

Pinheiro

et al. 2019

WIREs Data Min & Knowl

148

View full text Add to dashboard Cite

The explosive growth of online education environments is generating a massive volume of data, specially in text format from forums, chats, social networks, assessments, essays, among others. It produces exciting challenges on how to mine text data in order to find useful knowledge for educational stakeholders. Despite the increasing number of educational applications of text mining published recently, we have not found any paper surveying them. In this line, this work presents a systematic overview of the current status of the Educational Text Mining field. Our final goal is to answer three main research questions: Which are the text mining techniques most used in educational environments? Which are the most used educational resources? And which are the main applications or educational goals? Finally, we outline the conclusions and the more interesting future trends. This article is categorized under: Application Areas > Education and Learning Ensemble Methods > Text Mining

show abstract

Determiner-Established Deixis to Communicative Artifacts in Pedagogical Text

Cited by 2 publications

References 12 publications

Supervised and Unsupervised Methods for Robust Separation of Section Titles and Prose Text in Web Documents

Supervised and Unsupervised Methods for Robust Separation of Section Titles and Prose Text in Web Documents

Text mining in education

Contact Info

Product

Resources

About