Over the last 20 years, research groups focused on automating the process of extracting valuable information from Natural Language text in order to discover data and process models. In this context, several tools and approaches have been proposed. The overall objective of this survey is to examine existing literature works that transform textual specifications into visual models. This paper aims to give a comprehensive account of the existing tools meant to discover data and process models from natural language text. Our analysis focuses on approaches of these tools in the model extraction process and highlight issues of each proposed approach. In the case of object oriented software modelling of data models extraction we analyze the degree of automation, efficiency and completeness of the transformation process. Regarding process models extraction, the study is not limited only to business process discovery, but it also provides case studies from several fields such as medical or archaeological. Even if not all the tools developed are clearly depicting a Natural Language Processing technique, a review of each approach is presented.
In the Big Data era, the volume of streaming data generated from heterogeneous data sources such as social media, smart devices or sensor networks is overwhelming but enables human users and applications to better understand the world around us. Understanding the context, situation or the environment where the data has been produced is a challenging task due the dynamic and fast nature of the streaming data. Modelling the big data using RDF model and applying continuous SPARQL queries over it using Semantic Web tools helps create machine-understandable meanings and valuable insights of the underlying data. This paper provides an overview of the Big Data theoretical foundation, challenges and stages that need to be taken into consideration when building a Big Data pipeline architecture. Also, the research efforts that have been made in the past years using Semantic Stream Processing techniques are described.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.