shi-labs/vpt_OLA-VLM-CLIP-ConvNeXT-Llama3-8b
Image-Text-to-Text • 9B • Updated • 4 • 2
Computer Vision, AI, Machine Learning
PAI-Bench: A Comprehensive Benchmark For Physical AI
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance