Proceedings of the 28th International Conference on Computational Linguistics: Industry Track 2020
DOI: 10.18653/v1/2020.coling-industry.18
|View full text |Cite
|
Sign up to set email alerts
|

Learning Domain Terms - Empirical Methods to Enhance Enterprise Text Analytics Performance

Abstract: Performance of standard text analytics algorithms are known to be substantially degraded on consumer generated data, which are often very noisy. These algorithms also do not work well on enterprise data which has a very different nature from News repositories, storybooks or Wikipedia data. Text cleaning is a mandatory step which aims at noise removal and correction to improve performance. However, enterprise data need special cleaning methods since it contains many domain terms which appear to be noise against… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 21 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?