TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition
Paper
• 2512.01248 • Published
• 12
None defined yet.
Are Video Reasoning Models Ready to Go Outside?
EntroPE: Entropy-Guided Dynamic Patch Encoder for Time Series Forecasting