Abstract. Based on state of the art machine learning techniques, GRO-BID (GeneRation Of BIbliographic Data) performs reliable bibliographic data extractions from scholar articles combined with multi-level term extractions. These two types of extraction present synergies and correspond to complementary descriptions of an article. This tool is viewed as a component for enhancing the existing and the future large repositories of technical and scientific publications. ObjectivesThe purpose of this demonstration is to show to the digital library community a practical example of the accuracy of current state of the art machine learning techniques applied to information extraction in scholarship articles. The demonstration is based on the web application at the following addresse: http://grobid.no-ip.org. Bibliographical Data ExtractionAfter the selection of a PDF document, GROBID extracts the bibliographical data corresponding to the header information (title, authors, abstract, etc.) and to each reference (title, authors, journal title, issue, number, etc.). The references are associated to their respective citation contexts. The result of the citation extraction can be exported as a whole or per reference following different formats (BibTeX and TEI) and as COInS 1 . The automatic extraction of bibliographical data is a challenging task because of the high variability of the bibliographical formats and presentations. We have applied Conditional Random Fields to this task following the approach of [1] implemented with the Mallet toolkit [2], based on approx. 1000 training examples for header information, and 1200 training examples for cited references. An evaluation with the reference CORA dataset showed a reliable level of accuracy of 98,6% per header field and 74.9% per complete header instance, 95,7% per citation field and 78.9% per citation instance.
The motion of a thin viscous layer of fluid on a horizontal solid surface bounded laterally by a dry spot and a vertical solid wall is considered. A lubrication model with contact line motion is studied. We find that for a container of fixed length the axisymmetric equilibrium solutions with small dry spots are unstable to axisymmetric disturbances. As the size of the dry spot increases, the equilibrium solutions become unstable to nonaxisymmetric disturbances. In addition, we present numerical solutions of the nonlinear evolution equations in the axisymmetric and nonaxisymmetric cases for different values of the parameters. The axisymmetric results show good agreement with existing experimental results.
A thin layer of liquid advancing over a dry, heated, inclined plate is studied. A lubrication model with contact line motion is derived. The plate is at constant temperature, and the surface Biot number is specified. The steady-state solution is obtained numerically. In addition, the steady-state solution is studied analytically in the neighbourhood of the contact line. A linear stability analysis about the steady state is then performed. The effects of gravity, thermocapillarity and contact line motion are discussed. In particular, we determine a band of unstable wavenumbers, and the maximum growth rate as a function of these parameters.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.