Alex Open Research Wiki
Search
Search
Dark mode
Light mode
Explorer
Tag: robotics
39 items with this tag.
Jun 04, 2026
World Model for Robot Learning: A Comprehensive Survey
world-models
robotics
vision-language-action
simulation
survey
Jun 04, 2026
Fast/Slow Thinking For Robotics And Time Series
robotics
time-series
world-models
control
systems
hierarchy
May 31, 2026
Robotics Text Conditioning
robotics
language-conditioning
vision-language-action
planning
multimodal
May 31, 2026
Robotics Time-Series Modeling
robotics
time-series
world-models
actions
multimodal
May 28, 2026
stable-worldmodel: A Platform for Reproducible World Modeling Research and Evaluation
world-models
jepa
planning
benchmarks
robotics
reproducibility
May 25, 2026
Genie
entity
world-models
video
latent-actions
robotics
May 25, 2026
Genie: Generative Interactive Environments
world-models
video
latent-actions
robotics
generative-ai
May 18, 2026
ACT
entity
robotics
imitation-learning
action-chunks
May 18, 2026
Diffusion Policy
entity
robotics
diffusion
control-inputs
May 18, 2026
FAST
entity
robotics
action-tokenization
vision-language-action
May 18, 2026
Gemini Robotics 1.5
entity
robotics
vision-language-action
embodied-reasoning
motion-transfer
May 18, 2026
GR00T N1
entity
robotics
humanoids
vision-language-action
flow-matching
May 18, 2026
Helix 02
entity
robotics
humanoids
vision-language-action
loco-manipulation
May 18, 2026
Helix
entity
robotics
humanoids
vision-language-action
control
May 18, 2026
Octo
entity
robotics
generalist-robot-policy
diffusion
May 18, 2026
OpenVLA
entity
robotics
vision-language-action
action-tokens
May 18, 2026
π0
entity
robotics
vision-language-action
flow-matching
control-inputs
May 18, 2026
π0.7
entity
robotics
vision-language-action
flow-matching
prompting
metadata
May 18, 2026
RDT-1B
entity
robotics
bimanual-manipulation
diffusion
action-chunks
May 18, 2026
RT-2
entity
robotics
vision-language-action
action-tokens
May 18, 2026
Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware
robotics
imitation-learning
action-chunks
bimanual-manipulation
continuous-control
May 18, 2026
BridgeData V2: A Dataset for Robot Learning at Scale
robotics
datasets
manipulation
trajectories
actions
May 18, 2026
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning
causal
robotics
interventions
actions
May 18, 2026
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
robotics
imitation-learning
diffusion
action-trajectories
control-inputs
May 18, 2026
DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
robotics
datasets
manipulation
trajectories
multimodal
May 18, 2026
FAST: Efficient Action Tokenization for Vision-Language-Action Models
robotics
vision-language-action
action-tokenization
control-inputs
trajectories
May 18, 2026
Gemini Robotics 1.5: Pushing the Frontier of Generalist Robots with Advanced Embodied Reasoning, Thinking, and Motion Transfer
robotics
vision-language-action
embodied-reasoning
motion-transfer
control-inputs
multimodal
May 18, 2026
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots
robotics
humanoids
vision-language-action
diffusion
flow-matching
trajectories
actions
May 18, 2026
Introducing Helix 02: Full-Body Autonomy
robotics
humanoids
vision-language-action
loco-manipulation
trajectories
multimodal
May 18, 2026
Helix: A Vision-Language-Action Model for Generalist Humanoid Control
robotics
humanoids
vision-language-action
visuomotor-control
trajectories
actions
May 18, 2026
Octo: An Open-Source Generalist Robot Policy
robotics
generalist-robot-policy
diffusion
transformers
actions
multimodal
May 18, 2026
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
robotics
datasets
vision-language-action
actions
embodiment
May 18, 2026
OpenVLA: An Open-Source Vision-Language-Action Model
robotics
vision-language-action
action-tokens
open-source
multimodal
May 18, 2026
π0: A Vision-Language-Action Flow Model for General Robot Control
robotics
vision-language-action
flow-matching
action-chunks
control-inputs
embodiment
May 18, 2026
π0.7: a Steerable Generalist Robotic Foundation Model with Emergent Capabilities
robotics
vision-language-action
flow-matching
prompting
metadata
action-chunks
world-models
May 18, 2026
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
robotics
bimanual-manipulation
diffusion
vision-language-action
action-chunks
control-inputs
May 18, 2026
Reconstruction Or Semantics? What Makes A Latent Space Useful For Robotic World Models
world-models
robotics
latent-space
May 18, 2026
RoboTurk: A Crowdsourcing Platform for Robotic Skill Learning through Imitation
robotics
datasets
teleoperation
imitation-learning
actions
May 18, 2026
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
robotics
vision-language-action
action-tokens
language-conditioning
trajectories