Entity Pages
Files
act-2023.md - Action Chunking with Transformers method for continuous robot action chunks.
anomod-2026.md - Multimodal microservice anomaly-detection and root-cause-analysis dataset.
armt-2024.md - Associative Recurrent Memory Transformer method extending RMT with layerwise associative memory.
atlas-2025.md - ATLAS test-time memory module and DeepTransformers family.
boom-2025.md - Datadog observability metrics forecasting benchmark.
bolmo-2025.md - Fully open byte-level LM family produced by byteifying subword LMs.
bittokens-2025.md - Bit-level single-token number encoding method based on IEEE 754 features.
charm-2025.md - Channel-aware JEPA embedding model for multivariate time series.
chatts-2024.md - Time-series multimodal LLM trained from synthetic time-series/text data.
chronograph-2025.md - Graph-native multivariate microservice time-series benchmark with temporal node and edge features.
compute-optimal-tokenization-2026.md - Tokenization-aware scaling-law study centered on bytes per parameter and compute-optimal compression rate.
context-is-key-2024.md - ServiceNow benchmark for probabilistic forecasting with essential natural-language context.
cwm-2025.md - Meta FAIR Code World Model for code generation through computational-environment action-observation trajectories.
diffusionblocks-2026.md - Block-wise training framework that reinterprets residual-network depth as diffusion-style denoising blocks.
diffusion-policy-2023.md - Visuomotor diffusion policy over future continuous action trajectories.
dinov3-2025.md - Self-supervised vision foundation model suite.
dragon-hatchling-2025.md - Pathway BDH / Dragon Hatchling recurrent attention/state-space architecture.
ebt-2025.md - Energy-Based Transformer method for candidate-prediction scoring, gradient-based refinement, and dynamic compute.
eggroll-2025.md - Low-rank perturbation method for hyperscale evolution strategies.
fade-2026.md - Adaptive per-parameter weight-decay method for controlled forgetting in continual learning.
eidos-2026.md - Time-series foundation model family based on latent-space predictive learning.
elt-2026.md - Elastic Looped Transformer architecture for any-time visual generation.
flowstate-2025.md - SSM-based time-series foundation model with continuous functional-basis decoding.
florence-2-2023.md - Microsoft prompt-based vision foundation model built around the FLD-5B iterative data engine.
fone-2025.md - Fourier Number Embedding method for single-token number representations.
gaia-micross-2021.md - GAIA AIOps collection and MicroSS microservice telemetry subset.
fast-2025.md - Frequency-space tokenizer for robot action chunks.
gemma-4-12b-2026.md - Google DeepMind encoder-free multimodal 12B open-weight model.
gemini-robotics-1-5-2025.md - Google DeepMind embodied-reasoning and VLA robot model family.
genie-2024.md - Google DeepMind generative interactive environment model with learned latent actions from unlabeled video.
gift-eval-2024.md - General-purpose time-series forecasting benchmark and leaderboard.
gr00t-n1-2025.md - NVIDIA open humanoid VLA/action model.
h-net-2025.md - Hierarchical sequence model with learned dynamic chunking.
helix-2025.md - Figure AI upper-body humanoid VLA with fast/slow control.
helix-02-2026.md - Figure AI full-body humanoid VLA/controller stack.
hierarchical-reasoning-model-2025.md - HRM fast/slow recurrent reasoning architecture.
hyperloop-transformers-2026.md - Looped language-model architecture using loop-level hyper-connections.
huginn-2025.md - Recurrent-depth language model using latent-space loops as test-time compute.
latent-thoughts-2025.md - Looped Transformer framing for latent thought steps.
lemma-rca-2024.md - Large multi-modal multi-domain root-cause-analysis dataset collection.
llm-sleep-2026.md - Sleep-time memory-consolidation method for SSM-attention hybrid language models.
lejepa-2025.md - JEPA objective combining predictive alignment with SIGReg.
leautoencoder-2026.md - Self-teaching autoencoder prototype using transformed latent consistency instead of direct image-space reconstruction loss.
loopformer-2026.md - Elastic-depth looped Transformer for budget-conditioned latent reasoning.
mamba-2023.md - Selective state space model architecture for efficient recurrent sequence modeling.
mamba-2-2024.md - Structured state space duality architecture and SSD algorithm.
mamba-3-2026.md - Mamba-family architecture with richer discretization, complex state, and MIMO inference updates.
mantis-2025.md - Time-series classification foundation-model lineage covering Mantis, MantisV2, and UTICA.
mesanet-2025.md - Sequence model with locally optimal test-time training.
mhc-2025.md - Manifold-Constrained Hyper-Connections method for stable matrix-valued residual streams.
miras-2025.md - Associative-memory framework for attentional bias, retention, and online optimization.
moda-2026.md - Mixture-of-Depths Attention method for content-based inter-layer depth retrieval.
moirai-2024.md - Salesforce Uni2TS forecasting family covering Moirai 1.x, Moirai-MoE, and Moirai 2.0.
octo-2024.md - Open-source generalist robot policy.
openvla-2024.md - Open 7B VLA model using discretized action tokens.
openrca-2025.md - LLM-agent root-cause-analysis benchmark over large software telemetry.
ops-lite-2026.md - Compact RCA evaluation set with per-case causal graph ground truth.
pdr-rtv-2026.md - Agentic-coding test-time scaling recipe using structured summaries, recursive tournament voting, and refinement.
parallel-samplers-recurrent-depth-2025.md - Parallel inference method for recurrent-depth models.
pararnn-2025.md - Parallel nonlinear RNN training framework from Apple.
parcae-2026.md - Stable looped language-model architecture with scaling-law analysis.
perception-encoder-2025.md - Meta vision-encoder family whose strongest general features are often internal before alignment tuning.
pi0-2024.md - Physical Intelligence VLA flow model for general robot control.
pi0-7-2026.md - Steerable generalist VLA model with metadata, subgoal images, and flow action expert.
raev2-2026.md - Multi-layer representation-autoencoder recipe for generation and navigation world-model rollouts.
rcaeval-2025.md - Microservice root-cause-analysis benchmark and evaluation framework.
rdt-1b-2024.md - Robotics Diffusion Transformer for bimanual manipulation.
rate-2023.md - Recurrent Action Transformer with Memory offline RL policy architecture.
recurrent-transformer-2026.md - Transformer variant with layerwise recurrent memory.
rmt-2022.md - Recurrent Memory Transformer segment-level memory-token method.
reinpatch-2026.md - Learned adaptive patching method for time-series forecasting.
rt-2-2023.md - Vision-language-action model using action-as-language tokens.
rwkv-ts-2024.md - RWKV-style recurrent sequence model adapted to time-series tasks.
simmtm-2023.md - Masked time-series modeling framework based on multi-neighbor reconstruction.
sparse-layers-looped-language-models-2026.md - Sparse MoE looped language-model branch.
stable-worldmodel-2026.md - Reproducible world-model research platform with trajectory data handling, planning solvers, baselines, and factor-of-variation evaluation.
sundial-2025.md - THUML flow-matching time-series forecasting foundation-model family.
t2s-2025.md - Text-to-time-series generation model with LA-VAE and flow-matching Diffusion Transformer.
tabm-2024.md - MLP-based tabular deep-learning model with parameter-efficient ensembling.
telecomts-2025.md - Multimodal 5G observability benchmark for anomaly detection, root-cause analysis, forecasting, and time-series/text Q&A.
time-2026.md - Contamination-resistant zero-shot forecasting benchmark.
time-hd-2025.md - High-dimensional time-series forecasting benchmark introduced with U-Cast.
time-series-library-2024.md - Time-series benchmark collection and LSF/LTSF dataset handle.
timeomni-1-2026.md - Time-series reasoning model and TSR-Suite.
timeomni-vl-2026.md - Vision-centric time-series understanding/generation framework.
timesfm-2023.md - Google decoder-only time-series forecasting foundation model.
tiny-recursive-model-2025.md - TRM tiny recursive reasoning model.
titans-2025.md - Neural long-term memory sequence-model family.
titans-revisited-2025.md - Titans reimplementation and critical-analysis object.
tiny-time-mixers-2024.md - IBM Granite compact pretrained mixer-style forecasting model family.
toto-2025.md - Datadog observability-oriented forecasting family covering Toto 1.0 and Toto 2.0.
ts2vec-2021.md - Hierarchical contrastive time-series representation model.
tsmixer-2023.md - All-MLP time-series forecasting architecture.
turboquant-2025.md - Online vector quantization method for KV-cache and vector-search state, with vLLM caveats around FP8, latency, throughput, and memory pressure.
tuna-2-2026.md - Pixel-space unified multimodal model.
units-2024.md - Unified multi-task time-series model.
universal-reasoning-model-2025.md - UT-derived recursive reasoning model.
universal-transformers-2018.md - Root recurrent-depth self-attention model.
universal-transformers-need-memory-2026.md - Study of UT memory tokens and adaptive-depth tradeoffs.
vl-jepa-2025.md - Vision-language JEPA system.
100 items under this folder.