Alex Open Research Wiki

Tag: vision

21 items with this tag.

Jul 19, 2026
DINOv3
- entity
- vision
Jul 19, 2026
DINOv3
- vision
- self-supervised-learning
Jul 19, 2026
Flow Matching in Feature Space for Stochastic World Modeling
Jul 19, 2026
Vision Foundation Models
- vision
- foundation-models
Jul 15, 2026
VISReg
Jul 15, 2026
VISReg: Variance-Invariance-Sketching Regularization for JEPA training
Jun 20, 2026
The Thinking Pixel / Recursive Sparse Reasoning
Jun 20, 2026
The Thinking Pixel: Recursive Sparse Reasoning in Multimodal Diffusion Latents
Jun 15, 2026
S4L: Self-Supervised Semi-Supervised Learning
Jun 04, 2026
Gemma 4 12B Encoder-Free Multimodal Release
May 25, 2026
leAutoencoder
May 25, 2026
Self-Teaching Autoencoder
May 22, 2026
Florence-2
May 22, 2026
RAEv2
May 22, 2026
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
May 22, 2026
Improved Baselines with Representation Autoencoders
May 18, 2026
Perception Encoder
May 18, 2026
Next-Embedding Prediction Makes Strong Vision Learners
May 18, 2026
Perception Encoder
May 18, 2026
The Prism Hypothesis: Harmonizing Semantic And Pixel Representations Via Unified Autoencoding
May 18, 2026
Tuna-2: Pixel Embeddings Beat Vision Encoders For Multimodal Understanding And Generation

Created with Quartz v4.5.2 © 2026