OVBench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
Published in Arixv Preprint, 2024
A benchmark built for online visual understanding, with tasks under three different online understanding taxonomy: Bachward Tracing, Real-Time Visual Perception, and Forward Active Responding.
Recommended citation: Your Name, You. (2024). "Paper Title Number 3." GitHub Journal of Bugs. 1(3).
Download Paper