Viewport Prediction
Viewport prediction aims to anticipate a user's viewing area within immersive video content (e.g., 360° or volumetric video) to optimize bandwidth usage and enhance streaming quality. Current research focuses on improving prediction accuracy using various techniques, including transformer networks that leverage multimodal data (video content and user viewing history) and reinforcement learning algorithms to dynamically adapt bitrate allocation. These advancements are crucial for enabling efficient and high-quality streaming of increasingly prevalent immersive video formats, particularly in bandwidth-constrained environments like mobile VR/AR applications.
Papers
May 13, 2024
November 28, 2023
September 26, 2023