Alex Open Research Wiki

Tag: vision-language

13 items with this tag.

Jul 15, 2026
VL-JEPA
Jul 15, 2026
VLWM
Jul 15, 2026
VL-JEPA: Joint Embedding Predictive Architecture For Vision-Language
Jul 15, 2026
Planning with Reasoning using Vision Language World Model
Jul 15, 2026
Vision-Language Models
- vision-language
- multimodal
Jul 03, 2026
LeVLJEPA
Jul 03, 2026
LeVLJEPA: End-to-End Vision-Language Pretraining Without Negatives
Jun 26, 2026
Revisiting the Platonic Representation Hypothesis: An Aristotelian View
May 22, 2026
Florence-2
May 22, 2026
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
May 18, 2026
Perception Encoder
May 18, 2026
Perception Encoder
May 16, 2026
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models

Created with Quartz v4.5.2 © 2026