2022
DOI: 10.1007/s10489-022-03281-1
|View full text |Cite
|
Sign up to set email alerts
|

Word-level human interpretable scoring mechanism for novel text detection using Tsetlin Machines

Abstract: Recent research in novelty detection focuses mainly on document-level classification, employing deep neural networks (DNN). However, the black-box nature of DNNs makes it difficult to extract an exact explanation of why a document is considered novel. In addition, dealing with novelty at the word level is crucial to provide a more fine-grained analysis than what is available at the document level. In this work, we propose a Tsetlin Machine (TM)-based architecture for scoring individual words according to their… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
3
1

Relationship

3
4

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 48 publications
0
4
0
Order By: Relevance
“…Generating an episode (a path from the starting point to the terminal), using the -greedy policy with = 1 (taking random actions at each state) as the behaviour policy. Equation (5) shows the updating formula for the value iteration algorithm. Here, α, s and s , stand for the learning rate, the current state, and the next state after taking action a, respectively.…”
Section: Off-policy Learning With the Tsetlin Machinementioning
confidence: 99%
See 1 more Smart Citation
“…Generating an episode (a path from the starting point to the terminal), using the -greedy policy with = 1 (taking random actions at each state) as the behaviour policy. Equation (5) shows the updating formula for the value iteration algorithm. Here, α, s and s , stand for the learning rate, the current state, and the next state after taking action a, respectively.…”
Section: Off-policy Learning With the Tsetlin Machinementioning
confidence: 99%
“…Several researchers have lately explored various TMbased natural language processing models, including text classification [4,21], novelty detection [5], semantic relation analysis [22], and aspect-based sentiment analysis [27], using conjunctive clauses to capture textual patterns. Other application areas are network attack detection [11], keyword spotting [12], biomedical systems design [16], and game playing [9].…”
Section: Introductionmentioning
confidence: 99%
“…This clause score provides the weightage of each input on the model. It has been employed in many applications such as word scoring mechanism [Bhattarai et al, 2022] and novelty detection [Bhattarai et al, 2020].…”
Section: Global Interpretation Of Tm On Spurious Correlationmentioning
confidence: 99%
“…In [18], a novel variant of the TM that randomly drops clauses has been proposed, from which we observe up to +10% increase in accuracy and 2× to 4× faster learning compared with vanilla TM. For natural language processing, the TM has also achieved competitive results in text classification [19], [20], word sense disambiguation [21], novelty detection [22], [23], fake news detection [24], semantic relation analysis [25], aspect-based sentiment analysis [26], and robustness towards counterfactual data [27]. Lately, more advanced architectures have appeared, such as the relational TM [3] and the coalesced TM [28].…”
Section: Introductionmentioning
confidence: 99%