2019
DOI: 10.34028/iajit/17/1/5
|View full text |Cite
|
Sign up to set email alerts
|

Designing Punjabi Poetry Classifiers Using Machine Learning and Different Textual Features

Abstract: Analysis of poetic text is very challenging from computational linguistic perspective. Computational analysis of literary arts, especially poetry, is very difficult task for classification. For library recommendation system, poetries can be classified on various metrics such as poet, time period, sentiments and subject matter. In this work, content-based Punjabi poetry classifier was developed using Weka toolset. Four different categories were manually populated with 2034 poems Nature and Festival (NAFE), Ling… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
2
2

Relationship

0
10

Authors

Journals

citations
Cited by 19 publications
(8 citation statements)
references
References 12 publications
0
8
0
Order By: Relevance
“…Processing (NLP) (Kaur and Jatinderkumar, 2019). TF-IDF is consisted of Term Frequency (TF) and Inverse Document Frequency (IDF).…”
Section: Polypeptide Frequency Of Word Frequencymentioning
confidence: 99%
“…Processing (NLP) (Kaur and Jatinderkumar, 2019). TF-IDF is consisted of Term Frequency (TF) and Inverse Document Frequency (IDF).…”
Section: Polypeptide Frequency Of Word Frequencymentioning
confidence: 99%
“…A lightweight stemmer is proposed for Hindi, which conflates terms by providing suffix list. The stemmer has been evaluated by computing under stemming and over stemming figures for a corpus of documents [12][13][14].…”
Section: Bakgroundmentioning
confidence: 99%
“…Different Indian languages [20] such as Hindi [27][28] [29] are explored by different researchers and NLP elements explored for each language are stated. A new framework that is bag of synset is proposed for multilingual document classification using synset document matrix and BabelNet knowledge base [15] Poetry corpus creation along with preprocessing of the corpus is achieved by Punjabi corpus and classifiers are executed [25]. Diacritic extraction methods are used for the Gujarati language along with information retrieval, stop word identification and classification and machine translation.…”
Section: Different Types Of Stemming Techniques For Indian Andmentioning
confidence: 99%