The media industry is currently being pulled in the often-opposing\ud
directions of increased realism (high resolution, stereoscopic,\ud
large screen) and personalization (selection and control of\ud
content, availability on many devices). We investigate the\ud
feasibility of an end-to-end format-agnostic approach to support\ud
both these trends. In this paper, different aspects of a format-\ud
agnostic capture, production, delivery and rendering system are\ud
discussed. At the capture stage, the concept of layered scene\ud
representation is introduced, including panoramic video and 3D\ud
audio capture. At the analysis stage, a virtual director component\ud
is discussed that allows for automatic execution of\ud
cinematographic principles, using feature tracking and saliency\ud
detection. At the delivery stage, resolution-independent\ud
audiovisual transport mechanisms for both managed and\ud
unmanaged networks are treated. In the rendering stage, a\ud
rendering process that includes the manipulation of audiovisual\ud
content to match the connected display and loudspeaker properties\ud
is introduced. Different parts of the complete system are revisited\ud
demonstrating the requirements and the potential of this advanced\ud
concept.Peer ReviewedPostprint (published version