Skyra achieves state-of-the-art performance on the proposed ViF-Bench and the external GenVideo benchmark.
| Method | Type | Acc (%) | F1 Score |
|---|---|---|---|
| DeMamba | Binary | 64.29 | 73.00 |
| GPT-4.1-mini | MLLM | 54.08 | 24.21 |
| Gemini-2.5-flash | MLLM | 53.36 | 57.48 |
| BusterX++ | MLLM-based | 56.90 | 21.94 |
| Skyra (Ours-RL) | MLLM-based | 91.02 | 90.27 |
[cite_start]Comparison on ViF-Bench (Mean). Skyra significantly outperforms both binary detectors and generic MLLMs. [cite: 378]