Dbias: Detecting biases and ensuring Fairness in news articles

Raza, Shaina; Reji, Deepak John

doi:10.21203/rs.3.rs-1356281/v1

Cited by 5 publications

(1 citation statement)

References 47 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Analyzing the impact of these biases on the outputs generated by LLMs highlighted the potential for reinforcement of harmful stereotypes, emphasizing the need for continuous monitoring and improvement [6]. Sentiment analysis techniques were applied to detect and quantify biases in model responses, providing a quantifiable measure of bias presence [7,8]. Frameworks for bias detection were refined, incorporating advanced metrics that allowed for a more nuanced understanding of bias manifestations in LLMs [9].…”

Section: Bias and Fairness In Large Language Modelsmentioning

confidence: 99%

Benchmarking the Ethics of Large Language Models with Polarizing Topics

Moriyama,

Yamada,

Tanaka

2024

Preprint

View full text Add to dashboard Cite

The increasing deployment of artificial intelligence in various domains has raised critical questions about the ethical implications of large language models, particularly in politically sensitive contexts. Benchmarking the ethical performance of these models is both novel and significant because it provides a systematic approach to evaluating their handling of politically polarizing topics. This research focuses on Google Gemini and Anthropic Claude, utilizing an automated framework to perform sentiment analysis, bias detection, and ethical evaluation without human intervention. The analysis revealed that while both models exhibit strengths in fairness and transparency, they still demonstrate moderate levels of political bias, showing the need for ongoing refinement of ethical evaluation techniques. The study's findings highlight the importance of rigorous ethical benchmarks in ensuring that AI systems operate within ethical boundaries, thereby fostering greater trust and accountability. The proposed framework offers a scalable and reproducible method for ethical evaluation, contributing valuable insights to the field of AI ethics.

show abstract

Section: Bias and Fairness In Large Language Modelsmentioning

confidence: 99%