2014 International Conference on Computer and Communication Engineering 2014
DOI: 10.1109/iccce.2014.48
|View full text |Cite
|
Sign up to set email alerts
|

Complexity Algorithm Analysis for Edit Distance

Abstract: Natural Language Processing (NLP) is a method which works on any language processing. Some of the algorithms are based on edit distance analysis. It is a process where the statistical calculations between two words or sentences are analyzed. Some of used edit distances for NLP are Levenshtein, Jaro Wrinkler, Soundex, N-grams, and Mahalanobis.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
2
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 4 publications
0
2
0
1
Order By: Relevance
“…Konsep dari Levenshtein distance yaitu mencari jumlah minimum point mutation yang diperlukan untuk merubah suatu string ke string yang lain. Point mutation tersebut adalah insertion, subtitution dan deletion [13], [14].…”
Section: B Levenshtein Distanceunclassified
“…Konsep dari Levenshtein distance yaitu mencari jumlah minimum point mutation yang diperlukan untuk merubah suatu string ke string yang lain. Point mutation tersebut adalah insertion, subtitution dan deletion [13], [14].…”
Section: B Levenshtein Distanceunclassified
“…The driving force towards incorporating Levenshtein distance as the formula to normalise words stem from Maarif et. al aimed to determine the complexity algorithm of each of the sub-algorithms that branched from the edit distance tree such as the Levenshtein Distance (LD), the Jaro Winkler Distance (JWD), the Mahalanobis Distance, the Soundex Distance and the N-Gram Distance [3]. The importance of the study was to find out which edit distance was best suited for processing longer sentence comparison in correcting grammar in a Sign Language Synthesizer as proposed by the study.…”
Section: Levenshtein Edit Distancementioning
confidence: 99%
“…It is a string metric for measuring the difference between two sequences. Other popular measures of edit distance, which are calculated using a different set of allowable edit operations are: 1) the Damerau-Levenshtein (DL) distance allows insertion, deletion, substitution, and the transposition of two adjacent characters [7]; 2) the Longest Common Subsequence (LCS) distance allows only insertion and deletion, not substitution [8]; 3) the Hamming Distance (HD) allows only substitution, hence, it only applies to strings of the same length [9]; and 4) the Jaro distance allows only transposition [10]. These edit distance algorithms can also be computed between two longer strings, but the cost to compute it, which is roughly proportional to the product of the two string lengths, makes this impractical.…”
Section: Modifications and Enhancements Related To Damerau-levenshtein Distancementioning
confidence: 99%