Rethinking Complex Neural Network Architectures for Document Classification

Adhikari, Ashutosh; Ram, Achyudh; Tang, Rachel; Lin, Jimmy

doi:10.18653/v1/n19-1408

Cited by 85 publications

(57 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For our experiments, we used the IMDB dataset (135,669 documents) [ 28 ], the Yelp-hotel dataset (34,961 documents) [ 29 ], the Yelp-rest dataset (178,239 documents) [ 29 ], and the Amazon dataset (83,159 documents) [ 29 ]. The IMDB dataset is a movie review dataset annotated with 10-scale polarities.…”

Section: Resultsmentioning

confidence: 99%

“…In Table 4 , Kim-CNN [ 8 ] is a sentence classification model that shows good performances, although it uses simple CNNs. Adhikari-logistic regression [ 28 ] and Adhikari-support vector machine [ 28 ] are text classification models based on logistic regression and support vector machine, in which the term frequency and inversed document-frequency scores are used as features, respectively. HAN [ 32 ] extracts meaningful features by modeling the hierarchical structure of a document and classifies the document into predefined classes using two levels of attention mechanisms: Word-level attentions and sentence-level attentions.…”

Section: Resultsmentioning

confidence: 99%

“…HAN [ 32 ] extracts meaningful features by modeling the hierarchical structure of a document and classifies the document into predefined classes using two levels of attention mechanisms: Word-level attentions and sentence-level attentions. LSTM-Reg [ 28 ] is a sentiment classification model based on single-layer bidirectional long short-term memory (BiLSTM). Knowledge distillation (KD)-LSTM [ 28 ] is a modified version of LSTM that uses the KD scheme to increase performances using fine-tuned BERT-Large [ 16 ].…”

Section: Resultsmentioning

confidence: 99%

“…To statistically validate the performance differences, we performed t -tests between the proposed model and the comparison models using the accuracies as the input values of the t -test. The p -values were 0.000311 between the proposed model and Kim-CNN [ 8 ], 0.000164 between the proposed model and HAN [ 32 ], 0.000164 between the proposed model and ALBERT-Base, and 0.001341 between the proposed model and LSTM-Reg [ 28 ], respectively. This implies that the performances are statistically meaningful at a significance level of 99%.…”

Section: Resultsmentioning

confidence: 99%

See 3 more Smart Citations

Improving Document-Level Sentiment Classification Using Importance of Sentences

Choi

Kim

2020

Entropy

View full text Add to dashboard Cite

Previous researchers have considered sentiment analysis as a document classification task, in which input documents are classified into predefined sentiment classes. Although there are sentences in a document that support important evidences for sentiment analysis and sentences that do not, they have treated the document as a bag of sentences. In other words, they have not considered the importance of each sentence in the document. To effectively determine polarity of a document, each sentence in the document should be dealt with different degrees of importance. To address this problem, we propose a document-level sentence classification model based on deep neural networks, in which the importance degrees of sentences in documents are automatically determined through gate mechanisms. To verify our new sentiment analysis model, we conducted experiments using the sentiment datasets in the four different domains such as movie reviews, hotel reviews, restaurant reviews, and music reviews. In the experiments, the proposed model outperformed previous state-of-the-art models that do not consider importance differences of sentences in a document. The experimental results show that the importance of sentences should be considered in a document-level sentiment classification task.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%

See 2 more Smart Citations

Improving Document-Level Sentiment Classification Using Importance of Sentences

Choi

Kim

2020

Entropy

View full text Add to dashboard Cite

show abstract

“…By nature, a movie should be tough to be cleanly categorized, due to its length, complex storyline and turns, and the lack of evaluative criteria. Prior works in document classification (Yang et al, 2016;Liu et al, 2017;Adhikari et al, 2019;Johnson and Zhang, 2015) evaluated on datasets with small document size (Reuters, IMDB, Yelp, etc.). However, our document size on average is at least 65 times longer, which may be challenging for NN-based models to train due to long sequences and the associated computational burden.…”

Section: Related Workmentioning

confidence: 99%

Screenplay Quality Assessment: Can We Predict Who Gets Nominated?

Chiu

Feng

Ren

et al. 2020

Proceedings of the First Joint Workshop on Narrative Understanding, Storylines, and Events

View full text Add to dashboard Cite

Deciding which scripts to turn into movies is a costly and time-consuming process for filmmakers. Thus, building a tool to aid script selection, an initial phase in movie production, can be very beneficial. Toward that goal, in this work, we present a method to evaluate the quality of a screenplay based on linguistic cues. We address this in a twofold approach: (1) we define the task as predicting nominations of scripts at major film awards with the hypothesis that the peer-recognized scripts should have a greater chance to succeed. (2) based on industry opinions and narratology, we extract and integrate domain-specific features into common classification techniques. We face two challenges (1) scripts are much longer than other document datasets (2) nominated scripts are limited and thus difficult to collect. However, with narratology-inspired modeling and domain features, our approach offers clear improvements over strong baselines. Our work provides a new approach for future work in screenplay analysis.

show abstract

Predicting the Survival of Patients With Cancer From Their Initial Oncology Consultation Document Using Natural Language Processing

Nunez

Leung²,

Ho³

et al. 2023

JAMA Netw Open

View full text Add to dashboard Cite

ImportancePredicting short- and long-term survival of patients with cancer may improve their care. Prior predictive models either use data with limited availability or predict the outcome of only 1 type of cancer.ObjectiveTo investigate whether natural language processing can predict survival of patients with general cancer from a patient’s initial oncologist consultation document.Design, Setting, and ParticipantsThis retrospective prognostic study used data from 47 625 of 59 800 patients who started cancer care at any of the 6 BC Cancer sites located in the province of British Columbia between April 1, 2011, and December 31, 2016. Mortality data were updated until April 6, 2022, and data were analyzed from update until September 30, 2022. All patients with a medical or radiation oncologist consultation document generated within 180 days of diagnosis were included; patients seen for multiple cancers were excluded.ExposuresInitial oncologist consultation documents were analyzed using traditional and neural language models.Main Outcomes and MeasuresThe primary outcome was the performance of the predictive models, including balanced accuracy and receiver operating characteristics area under the curve (AUC). The secondary outcome was investigating what words the models used.ResultsOf the 47 625 patients in the sample, 25 428 (53.4%) were female and 22 197 (46.6%) were male, with a mean (SD) age of 64.9 (13.7) years. A total of 41 447 patients (87.0%) survived 6 months, 31 143 (65.4%) survived 36 months, and 27 880 (58.5%) survived 60 months, calculated from their initial oncologist consultation. The best models achieved a balanced accuracy of 0.856 (AUC, 0.928) for predicting 6-month survival, 0.842 (AUC, 0.918) for 36-month survival, and 0.837 (AUC, 0.918) for 60-month survival, on a holdout test set. Differences in what words were important for predicting 6- vs 60-month survival were found.Conclusions and RelevanceThese findings suggest that models performed comparably with or better than previous models predicting cancer survival and that they may be able to predict survival using readily available data without focusing on 1 cancer type.

show abstract

Rethinking Complex Neural Network Architectures for Document Classification

Cited by 85 publications

References 16 publications

Improving Document-Level Sentiment Classification Using Importance of Sentences

Improving Document-Level Sentiment Classification Using Importance of Sentences

Screenplay Quality Assessment: Can We Predict Who Gets Nominated?

Predicting the Survival of Patients With Cancer From Their Initial Oncology Consultation Document Using Natural Language Processing

Contact Info

Product

Resources

About