In this paper, we aim to develop a method to create personalized and high-presence multi-channel contents for a sport game through realtime content curation from various media streams captured/created by spectators. We use the live TV broadcast as a ground truth data and construct a machine learning-based model to automatically conduct curation from multiple videos which spectators captured from different angles and zoom levels. The live TV broadcast of a baseball game has some curation rules which select a specific angle camera for some specific scenes (e.g., a pitcher throwing a ball). As inputs for constructing a model, we use meta data such as image feature data (e.g., a pitcher is on the screen) in each fixed interval of baseball videos and game progress data (e.g., the inning number and the batting order). Output is the camera ID (among multiple cameras of spectators) at each point of time. For evaluation, we targeted Spring-Selection high-school baseball games. As training data, we used image features, game progress data, and the camera position at each point of time in the TV broadcast. We used videos of a baseball game captured from 7 different points in Hanshin Koshien Stadium with handy video cameras and generated sample data set by dividing the videos to fixed interval segments. We divided the sample data set into the training data set and the test data set and evaluated our method through two validation methods: (1) 10-fold crossvalidation method and (2) hold-out methods (e.g., learning first and second innings and testing third inning). As a result, our method predicted the camera switching timings with accuracy (F-measure) of 72.53% on weighted average for the base camera work and 92.1% for the fixed camera work.
Emerging Internet of Things (IoT) technologies will allow spectators in a sport game to produce various video streams from various angles. With existing technologies, however, it is difficult to process massive and various data streams for multi-channel contents in real-time. To solve this problem, we aim to construct a software agent (called “Curator”) that compiles video contents automatically according to his/her values. In this paper, we propose a system to automatically switch multiple video streams that general sports spectators have taken using Random Forests classifier. Meta data such as image feature data and game progress data is extracted for each video scene as the input of the classifier. For evaluation, we constructed a camera switching timing estimation model using the live TV broadcast of some baseball game data. A video of another baseball game was curated with the constructed model. As a result, our system predicted the camera switching timing with accuracy (F-measure) of 85.3% on weighted average for the base camera work and 99.7% for the fixed camera work.
Emerging Internet of Things (IoT) technologies will allow spectators in a sport game to produce various video streams from various angles. With existing technologies, however, it is difficult to process massive and various data streams for multi-channel contents in real-time. To solve this problem, we aim to construct a software agent (called “Curator”) that compiles video contents automatically according to his/her values. In this paper, we propose a system to automatically switch multiple video streams that general sports spectators have taken using Random Forests classifier. Meta data such as image feature data and game progress data is extracted for each video scene as the input of the classifier. For evaluation, we constructed a camera switching timing estimation model using the live TV broadcast of some baseball game data. A video of another baseball game was curated with the constructed model. As a result, our system predicted the camera switching timing with accuracy (F-measure) of 85.3% on weighted average for the base camera work and 99.7% for the fixed camera work.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.