Vijit Malik scite author profile

Vijit Malik

5Publications

63Citation Statements Received

65Citation Statements Given

How they've been cited

How they cite others

Affiliations

Indian Institute of Technology Kanpur

Publications

Order By: Most citations

ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation

Malik¹,

Sanjay²,

Nigam³

et al. 2021

View full text Add to dashboard Cite

An automated system that could assist a judge in predicting the outcome of a case would help expedite the judicial process. For such a system to be practically useful, predictions by the system should be explainable. To promote research in developing such a system, we introduce ILDC (Indian Legal Documents Corpus). ILDC is a large corpus of 35k Indian Supreme Court cases annotated with original court decisions. A portion of the corpus (a separate test set) is annotated with gold standard explanations by legal experts. Based on ILDC, we propose the task of Court Judgment Prediction and Explanation (CJPE). The task requires an automated system to predict an explainable outcome of a case. We experiment with a battery of baseline models for case predictions and propose a hierarchical occlusion based model for explainability. Our best prediction model has an accuracy of 78% versus 94% for human legal experts, pointing towards the complexity of the prediction task. The analysis of explanations by the proposed algorithm reveals a significant difference in the point of view of the algorithm and legal experts for explaining the judgments, pointing towards scope for future research.

show abstract

Socially Aware Bias Measurements for Hindi Language Representations

Malik¹,

Dev²,

Nishi³

et al. 2022

View full text Add to dashboard Cite

Trigger warning: This paper contains examples of stereotypes and other harms that could be offensive and triggering to individuals.Language representations are efficient tools used across NLP applications, but they are strife with encoded societal biases. These biases are studied extensively, but with a primary focus on English language representations and biases common in the context of Western society. In this work, we investigate biases present in Hindi language representations with focuses on caste and religion-associated biases. We demonstrate how biases are unique to specific language representations based on the history and culture of the region they are widely spoken in, and how the same societal bias (such as binary gender-associated biases) is encoded by different words and text spans across languages. The discoveries of our work highlight the necessity of culture awareness and linguistic artifacts when modeling language representations, in order to better understand the encoded biases.

show abstract

ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation

Malik¹,

Sanjay²,

Nigam³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

Semantic Segmentation of Legal Documents via Rhetorical Roles

Malik¹,

Sanjay²,

Guha³

et al. 2021

Preprint

View full text Add to dashboard Cite

Legal documents are unstructured, use legal jargon, and have considerable length, making it difficult to process automatically via conventional text processing techniques. A legal document processing system would benefit substantially if the documents could be semantically segmented into coherent units of information. This paper proposes a Rhetorical Roles (RR) system for segmenting a legal document into semantically coherent units: facts, arguments, statute, issue, precedent, ruling, and ratio. With the help of legal experts, we propose a set of 13 fine-grained rhetorical role labels and create a new corpus of legal documents annotated with the proposed RR. We develop a system for segmenting a document into rhetorical role units. In particular, we develop a multitask learning-based deep learning model with document rhetorical role label shift as an auxiliary task for segmenting a legal document. We experiment extensively with various deep learning models for predicting rhetorical roles in a document, and the proposed model shows superior performance over the existing models. Further, we apply RR for predicting the judgment of legal cases and show that the use of RR enhances the prediction compared to the transformer-based models.

show abstract

Socially Aware Bias Measurements for Hindi Language Representations

Malik¹,

Dev²,

Nishi³

et al. 2021

Preprint

View full text Add to dashboard Cite

Language representations are an efficient tool used across NLP, but they are strife with encoded societal biases. These biases are studied extensively, but with a primary focus on English language representations and biases common in the context of Western society. In this work, we investigate the biases present in Hindi language representations such as caste and religion associated biases. We demonstrate how biases are unique to specific language representations based on the history and culture of the region they are widely spoken in, and also how the same societal bias (such as binary gender associated biases) when investigated across languages is encoded by different words and text spans. With this work, we emphasize on the necessity of social-awareness along with linguistic and grammatical artefacts when modeling language representations, in order to understand the biases encoded.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Vijit Malik

ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation

Socially Aware Bias Measurements for Hindi Language Representations

ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation

Semantic Segmentation of Legal Documents via Rhetorical Roles

Socially Aware Bias Measurements for Hindi Language Representations

Contact Info

Product

Resources

About