2019
DOI: 10.5815/ijitcs.2019.07.03
|View full text |Cite
|
Sign up to set email alerts
|

A hybrid Technique for Cleaning Missing and Misspelling Arabic Data in Data Warehouse

Abstract: Real-World datasets accumulated over a number of years tend to be incomplete, inconsistent and contain noisy data, this, in turn, will cause an inconsistency of data warehouses. Data owners are having hundred-millions to billions of records written in different languages, hence continuously increases the need for comprehensive, efficient techniques to maintain data consistency and increase its quality. It is known that the data cleaning is a very complex and difficult task, especially for the data written in A… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 29 publications
0
1
0
Order By: Relevance
“…In this step, misspelled words are detected. Many techniques, such as dictionary search and morphology analysis, are used to detect errors in Arabic languages [25], [26]. Dictionary lookup is the most common and the fastest method due to the size of the dictionary (corpus).…”
Section: Error Detectionmentioning
confidence: 99%
“…In this step, misspelled words are detected. Many techniques, such as dictionary search and morphology analysis, are used to detect errors in Arabic languages [25], [26]. Dictionary lookup is the most common and the fastest method due to the size of the dictionary (corpus).…”
Section: Error Detectionmentioning
confidence: 99%