Digital document analysis is one where software analysts review documents for assessing an appraisal theme. Digital document analysis can be utilized for obtaining available documents in order to extract relevant data. Most of the research work focuses on a semi-supervised based framework for better parsing performance and traditional statistical setting. However, an inappropriate selection during digital documents analysis may lead to entire process being falsified there by reducing the overall accuracy. To address this issue, in our work, a novel method called, Weighted Score Convolutional Network and Arc-factored Graph-based Dependency Parsing (WSCN-AGDP) is proposed. WSCN-AGDP is split into two sections. First section is concerned with the extraction of relevant features (i.e., words from sentences) by employing Stouffer’s Weighted Score-based Convolutional Neural Network model. In the second section, using the extracted features, Graph-based Dependency Parsing is performed by utilizing Spearman Correlated Arc-Factored model. Four indices were calculated namely, digital document parsing time, parsing overhead, false positive rate and precision are being used to quantitatively assess and rate the algorithms. Different document sizes acquired from Reuters-21578 dataset are considered. Experiments have been conducted to analyze the methods.
Digital document analysis is one where software analysts review documents for assessing an appraisal theme. Digital document analysis can be utilized for obtaining available documents in order to extract relevant data. Most of the research work focuses on a semi-supervised based framework for better parsing performance and traditional statistical setting. However, an inappropriate selection during digital documents analysis may lead to entire process being falsified there by reducing the overall accuracy. To address this issue, in our work, a novel method called, Weighted Score Convolutional Network and Arc-factored Graph-based Dependency Parsing (WSCN-AGDP) is proposed. WSCN-AGDP is split into two sections. First section is concerned with the extraction of relevant features (i.e., words from sentences) by employing Stouffer’s Weighted Score-based Convolutional Neural Network model. In the second section, using the extracted features, Graph-based Dependency Parsing is performed by utilizing Spearman Correlated Arc-Factored model. Four indices were calculated namely, digital document parsing time, parsing overhead, false positive rate and precision are being used to quantitatively assess and rate the algorithms. Different document sizes acquired from Reuters-21578 dataset are considered. Experiments have been conducted to analyze the methods.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.