Abstract:Aiming at the problems of poor data quality and low application rate in the construction of existing media corpus, this paper proposes the construction and application research of media corpus based on big data. Media corpus data are collected, the data are divided into four categories, the heuristic data item column sorting algorithm is introduced to sort all collection processes, the minimum value of data item collection rate is determined, on this basis, the maximum value of quantity in media corpus is dete… Show more
“…Tis article has been retracted by Hindawi following an investigation undertaken by the publisher [1]. Tis investigation has uncovered evidence of one or more of the following indicators of systematic manipulation of the publication process:…”
“…Tis article has been retracted by Hindawi following an investigation undertaken by the publisher [1]. Tis investigation has uncovered evidence of one or more of the following indicators of systematic manipulation of the publication process:…”
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.