2021
DOI: 10.1155/2021/6653508
|View full text |Cite
|
Sign up to set email alerts
|

Breast Cancer Identification from Patients’ Tweet Streaming Using Machine Learning Solution on Spark

Abstract: Twitter integrates with streaming data technologies and machine learning to add new value to healthcare. This paper presented a real-time system to predict breast cancer based on streaming patient’s health data from Twitter. The proposed system consists of two major components: developing an offline building model and an online prediction pipeline. For the first component, we made a correlation between the features to determine the correlation between features and reduce the number of features from the Breast … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 31 publications
0
3
0
Order By: Relevance
“…In regular ML approach, five ML algorithms, such as decision tree (DT) [ 29 ], support vector machine (SVM) [ 30 ], K -nearest neighbor algorithm (KNN) [ 31 ], random forest (RF) [ 32 ], and naive Bayes (NB) [ 33 ] were used to compare with the optimized deep RNN. Grid search with cross-validation is used to optimize ML algorithms and improve ML algorithms performance.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In regular ML approach, five ML algorithms, such as decision tree (DT) [ 29 ], support vector machine (SVM) [ 30 ], K -nearest neighbor algorithm (KNN) [ 31 ], random forest (RF) [ 32 ], and naive Bayes (NB) [ 33 ] were used to compare with the optimized deep RNN. Grid search with cross-validation is used to optimize ML algorithms and improve ML algorithms performance.…”
Section: Methodsmentioning
confidence: 99%
“…CV separates the data set into k subsets in order to train ML algorithms on k −1 subsets (the training set). The remainder is used to test ML algorithms [ 29 ].…”
Section: Methodsmentioning
confidence: 99%
“…As a result of the research direction in content analysis, the research organizations start raising funding to provide novel solutions to combat COVID-19 in terms of analyzing the misleading information about the COVID-19 pandemic [ 5 – 10 ]. Recently, machine learning and deep learning are playing a vital role in different areas such as sentiment analysis [ 11 , 12 ]; Alzheimer detection [ 13 ]; prediction cancer [ 14 ], and others [ 15 , 16 ]. The researchers have utilized the collected datasets related to COVID-19 through social media to evaluate their proposed approaches [ 17 ].…”
Section: Introductionmentioning
confidence: 99%