Overcrowding in hospitals in Vietnam has caused many disadvantages in receiving and treating patients. Especially at the stage of receiving and diagnosing procedures taking patients to the treatment departments in the hospital takes up much time. This study proposes a text-based disease diagnosis using text processing techniques (such as Bag of Words, Term Frequency- Inverse Document Frequency, and Tokenizer) combined with classifiers (such as Random Forests (RF), Multi-Layer Perceptron (MLP), Embeddings and Bidirectional Long Short-term memory (LSTM)) on symptoms. As observed from the results, deep Bidirectional LSTM can reach 0.982 in AUC in the classification of 10 diseases on 230,457 samples of pre-diagnosis collected from Vietnam hospitals used in the training and testing phases. The proposed approach is expected to provide a way to automate patient flow in hospitals to improve healthcare in the future.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.