Mobile Perception Systems Lab

university

https://www.tue-mps.org/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

nielsr submitted a paper 3 days ago

Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini

orsveri updated a model 8 days ago

tue-mps/towards-video-image-frozen

nielsr submitted a paper 9 days ago

View all activity

Papers

PMT: Plain Mask Transformer for Image and Video Segmentation with Frozen Vision Encoders

VidEoMT: Your ViT is Secretly Also a Video Segmentation Model

View all Papers

submitted a paper to Daily Papers 3 days ago

Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini

Paper • 2605.27295 • Published 4 days ago • 17

updated a model 8 days ago

tue-mps/towards-video-image-frozen

Updated 8 days ago • 1

submitted a paper to Daily Papers 9 days ago

Stable Audio 3

Paper • 2605.17991 • Published 12 days ago • 18

published a model 12 days ago

tue-mps/towards-video-image-frozen

Updated 8 days ago • 1

submitted a paper to Daily Papers about 1 month ago

Scaling Test-Time Compute for Agentic Coding

Paper • 2604.16529 • Published Apr 16 • 12

authored a paper about 1 month ago

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Paper • 2604.04913 • Published Apr 6 • 12

authored a paper about 1 month ago

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Paper • 2604.04913 • Published Apr 6 • 12

submitted a paper to Daily Papers about 1 month ago

Geometric Context Transformer for Streaming 3D Reconstruction

Paper • 2604.14141 • Published Apr 15 • 21

submitted a paper to Daily Papers about 2 months ago

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Paper • 2604.04913 • Published Apr 6 • 12

in tue-mps/coco_panoptic_pmt_large_640_dinov3 about 2 months ago

Adding `safetensors` variant of this model

#1 opened 2 months ago by

in tue-mps/coco_panoptic_pmt_small_640_dinov3 about 2 months ago

Adding `safetensors` variant of this model

#1 opened 2 months ago by

in tue-mps/coco_panoptic_pmt_base_640_dinov3 about 2 months ago

Adding `safetensors` variant of this model

#1 opened 2 months ago by

submitted a paper to Daily Papers about 2 months ago

MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

Paper • 2603.28130 • Published Mar 30 • 11

authored a paper 2 months ago

PMT: Plain Mask Transformer for Image and Video Segmentation with Frozen Vision Encoders

Paper • 2603.25398 • Published Mar 26 • 3

updated 6 models 2 months ago

tue-mps/ade20k_semantic_pmt_large_512_dinov3

Image Segmentation • Updated Mar 27

tue-mps/coco_instance_pmt_large_640_dinov3

Image Segmentation • Updated Mar 27

tue-mps/coco_panoptic_pmt_large_1280_dinov3

Image Segmentation • Updated Mar 27

tue-mps/coco_instance_pmt_large_1280_dinov3

Image Segmentation • Updated Mar 27

tue-mps/coco_panoptic_pmt_small_640_dinov3

Image Segmentation • 7.7M • Updated Apr 6

tue-mps/coco_panoptic_pmt_base_640_dinov3

Image Segmentation • 30.4M • Updated Apr 6