Alex Open Research Wiki

Tag: vision-language

8 items with this tag.

  • Jun 04, 2026

    Vision-Language Models

    • vision-language
    • multimodal
  • May 22, 2026

    Florence-2

    • entity
    • vision
    • vision-language
    • dataset-bootstrapping
  • May 22, 2026

    Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

    • vision
    • vision-language
    • dataset-bootstrapping
    • data-engine
    • unified-representation
  • May 18, 2026

    Perception Encoder

    • entity
    • vision
    • vision-language
  • May 18, 2026

    VL-JEPA

    • entity
    • vision-language
    • jepa
  • May 18, 2026

    Perception Encoder

    • vision
    • vision-language
    • representation-learning
    • intermediate-layers
  • May 18, 2026

    VL-JEPA: Joint Embedding Predictive Architecture For Vision-Language

    • jepa
    • vision-language
    • selective-decoding
  • May 16, 2026

    Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models

    • vision-language
    • multimodal
    • open-weights
    • datasets

Created with Quartz v4.5.2 © 2026