X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Models

Held, Jan; Itani, Hani; Cioppa, Anthony; Giancola, Silvio; Ghanem, Bernard; Van Droogenbroeck, Marc

doi:10.1109/cvprw63382.2024.00332

2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2024

DOI: 10.1109/cvprw63382.2024.00332

|View full text |Cite

X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Models

Jan Held,

Hani Itani,

Anthony Cioppa

et al.

Abstract: The rapid advancement of artificial intelligence has led to significant improvements in automated decision-making. However, the increased performance of models often comes at the cost of explainability and transparency of their decision-making processes. In this paper, we investigate the capabilities of large language models to explain decisions, using football refereeing as a testing ground, given its decision complexity and subjectivity. We introduce the EXplainable Video Assistant Referee System, X-VARS, a … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Other3

Relationship

Self Cite0

Independent3

Authors

Journals

Cited by 3 publications

References 45 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

SoccerNet-Depth: a Scalable Dataset for Monocular Depth Estimation in Sports Videos

Leduc,

Cioppa,

Giancola

et al. 2024

2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

View full text Add to dashboard Cite

Monocular Depth Estimation (MDE) is fundamental in sports video understanding, enhancing augmented graphics, scene understanding, and game state reconstruction. Despite remarkable progress in autonomous driving and indoor scene understanding, there is currently a lack of MDE datasets tailored for sports. Furthermore, most existing datasets only focus on single images, disregarding the temporal aspect. In this work, we introduce the first video dataset for MDE in sports, SoccerNet-Depth, focusing on football and basketball videos. In particular, we leverage the graphic engine from video games to automatically extract video sequences and their associated depth maps, making our dataset easily scalable. Furthermore, we benchmark and fine-tune several state-of-the-art MDE methods on our dataset. Our analysis shows that MDE in sports is far from being solved, making our dataset a perfect playground for future research. Dataset and codes: https://github.com/SoccerNet/sn-depth.

show abstract

SoccerNet-Depth: a Scalable Dataset for Monocular Depth Estimation in Sports Videos

Leduc,

Cioppa,

Giancola

et al. 2024

2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

View full text Add to dashboard Cite

show abstract

A Universal Protocol to Benchmark Camera Calibration for Sports

Magera,

Hoyoux,

Barnich

et al. 2024

2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

View full text Add to dashboard Cite

Beyond the Premier: Assessing Action Spotting Transfer Capability Across Diverse Domains

Cabado,

Cioppa,

Giancola

et al. 2024

2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

View full text Add to dashboard Cite

Football stands as one of the most successful sports in history thanks to the plethora of professional leagues broadcasted worldwide followed by avid fans, further fueled by the abundance of amateur and grassroots leagues across nearly every country, encompassing countless players who devote their time to the sport. Despite the tremendous amount of visual data available worldwide for developing automatic systems to extract game events, most efforts focus on the few professional league matches. However, the recording quality and broadcasts editing vary considerably across leagues, creating a disparity in the analytical capabilities of deep learning models. This paper delves into an analysis of how action spotting models transfer to diverse domains, analyzing the performance gap between various types of broadcasts. In particular, we investigate the transfer capability of state-of-the-art action spotting models across leagues, from amateur to professional, and broadcast quality, from AI-piloted camera to professional broadcast editing. Our analysis shows that transferring across leagues is challenging, with the most impactful feature being broadcasting editing quality. This analysis paper therefore seeks to spotlight this pressing issue and catalyze future research endeavors in the field of domain adaptation for action spotting methods.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Models

Cited by 3 publications

References 45 publications

SoccerNet-Depth: a Scalable Dataset for Monocular Depth Estimation in Sports Videos

SoccerNet-Depth: a Scalable Dataset for Monocular Depth Estimation in Sports Videos

A Universal Protocol to Benchmark Camera Calibration for Sports

Beyond the Premier: Assessing Action Spotting Transfer Capability Across Diverse Domains

Contact Info

Product

Resources

About