Film Censorship and Identity in Kenya

Ndanyi, Samson Kaunga

doi:10.5070/f742253948

Cited by 3 publications

(1 citation statement)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Native speakers and linguistic experts participated in the translation. As supplementary material, dictionaries (Aswani (1995); Marlo et al, 2008;Ndanyi (2005); Odaga (1997); Parker (1998); Sibuor (2013); TUKI, 2013) were used as references. The translations done were for Dholuo-Kiswahili and Luhya-Kiswahili language pairs.…”

Section: Translationmentioning

confidence: 99%

Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for Natural Language Processing Tasks

Wanjawa

Wanzare

Indede

et al. 2023

JLCL

View full text Add to dashboard Cite

Indigenous African languages are categorized as under-served in Natural Language Processing. They therefore experience poor digital inclusivity and information access. The processing challenge with such languages has been how to use machine learning and deep learning models without the requisite data. The Kencorpus project intends to bridge this gap by collecting and storing text and speech data that is good enough for data-driven solutions in applications such as machine translation, question answering and transcription in multilingual communities. The Kencorpus dataset is a text and speech corpus for three languages predominantly spoken in Kenya: Swahili, Dholuo and Luhya (three dialects of Lumarachi, Lulogooli and Lubukusu). Data collection was done by researchers who were deployed to the various data collection sources such as communities, schools, media, and publishers. The Kencorpus' dataset has a collection of 5,594 items, being 4,442 texts of 5.6 million words and 1,152 speech files worth 177 hours. Based on this data, other datasets were also developed such as Part of Speech tagging sets for Dholuo and the Luhya dialects of 50,000 and 93,000 words tagged respectively. We developed 7,537 Question-Answer pairs from 1,445 Swahili texts and also created a text translation set of 13,400 sentences from Dholuo and Luhya into Swahili. The datasets are useful for downstream machine learning tasks such as model training and translation. Additionally, we developed two proof of concept systems: for Kiswahili speech-to-text and a machine learning system for Question Answering task. These proofs provided results of a performance of 18.87% word error rate for the former, and 80% Exact Match (EM) for the latter system. These initial results give great promise to the usability of Kencorpus to the machine learning community. Kencorpus is one of few public domain corpora for these three low resource languages and forms a basis of learning and sharing experiences for similar works especially for low resource languages. Challenges in developing the corpus included deficiencies in the data sources, data cleaning challenges, relatively short project timelines and the Coronavirus disease (COVID-19) pandemic that restricted movement and hence the ability to get the data in a timely manner.

show abstract

Section: Translationmentioning

confidence: 99%

Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for Natural Language Processing Tasks

Wanjawa

Wanzare

Indede

et al. 2023

JLCL

View full text Add to dashboard Cite

show abstract

LGBTQ+ Books on Library Shelves: The Predicaments of Libraries in Africa

Bouaamri,

Otike

2023

The Reference Librarian

View full text Add to dashboard Cite

Exploring Intellectual Freedom and Collection Development Policies in School Libraries: Perspectives from US and Kenya.

Munyao

2024

PAAC

View full text Add to dashboard Cite

Information is one of the essentials for decision making and intellectual development, thus it is difficult for democratic societies to exist without access to diverse information (Knox, 2011). Within school libraries, access to information resources should be enhanced through the selection, evaluation, and acquisition of diverse resources in different formats to cater to the different needs of students, support the curriculum and promote their intellectual freedom. The fight for intellectual freedom in schools is not a recent development and the number of challenges and book bans is escalating and anticipated to continue rising in the years ahead (ALA,2022). While issues regarding intellectual freedom and censorship are global affairs, much is known about the US, but this study also focuses on Kenya, two countries with varying approaches on different issues thus addressing a significant research gap on intellectual freedom beyond the US. Employing a mixed methods approach, the study involves semi structured interviews among school librarians (10 from the US and 10 from Kenya) and an analysis of the school’s collection development policies. A qualitative thematic analysis of the data is being carried out to identify school librarians’ perspectives on censorship and advocacy for intellectual freedom through policies. The study aims at creating an awareness of the social reality regarding intellectual freedom between the two contexts. School librarians, administrators and policy makers will be able to restructure existing policies to enhance intellectual freedom for students.

show abstract

Film Censorship and Identity in Kenya

Cited by 3 publications

References 5 publications

Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for Natural Language Processing Tasks

Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for Natural Language Processing Tasks

LGBTQ+ Books on Library Shelves: The Predicaments of Libraries in Africa

Exploring Intellectual Freedom and Collection Development Policies in School Libraries: Perspectives from US and Kenya.

Contact Info

Product

Resources

About