Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2015
DOI: 10.1145/2783258.2788575
|View full text |Cite
|
Sign up to set email alerts
|

Transfer Learning for Bilingual Content Classification

Abstract: LinkedIn Groups provide a platform on which professionals with similar background, target and specialities can share content, take part in discussions and establish opinions on industry topics. As in most online social communities, spam content in LinkedIn Groups poses great challenges to the user experience and could eventually lead to substantial loss of active users. Building an intelligent and scalable spam detection system is highly desirable but faces difficulties such as lack of labeled training data, p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 21 publications
0
2
0
Order By: Relevance
“…In [51] the authors developed a spam classification system for Spanish messages exchanged by LinkedIn users. The goal is to classify messages exchanged by social network users, messages are not necessarily job postings, e.g., they may be unsolicited advertisements.…”
Section: The Micro Perspectivementioning
confidence: 99%
“…In [51] the authors developed a spam classification system for Spanish messages exchanged by LinkedIn users. The goal is to classify messages exchanged by social network users, messages are not necessarily job postings, e.g., they may be unsolicited advertisements.…”
Section: The Micro Perspectivementioning
confidence: 99%
“…Every time the KMM is applied, it will utilise both component data and target data (whether they are labelled or not) to build a potential transferring bridge. In general, there is no way to infer a good estimator based on two different joint distributions in source component Prcfalse(x,yfalse) and target project Pnormalrtfalse(x,yfalse) because of the two distributions could be arbitrarily far apart [30]. Due to this unsolvable estimation problem, KMM assumes that the conditional probability of these two distributions are fixed, meaning that Pnormalrcfalse(yfalsefalse|xfalse) = Pnormalrtfalse(yfalsefalse|xfalse).…”
Section: Proposed Approachmentioning
confidence: 99%