-
Tsinghua University
-
18:20
(UTC +08:00) - https://blog.algorithmpark.xyz/
Highlights
- Pro
Stars
[CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
Synchronize Codex session provider metadata across rollout files and SQLite state.
[CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
A CLI tool to switch and manage Codex accounts
[CVPR 2026] Official code of the paper "Meta-CoT: Enhancing Granularity and Generalization in Image Editing"
Export and Share your ChatGPT conversation history
A unified framework for easy reinforcement learning in Flow-Matching models
ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…
🔬 Harness Vibe Research with Self-evolving AI Scientists
A curated list of papers on reinforcement learning for video generation
Overleaf git bridge, fork of https://gitlab.com/axkibe/olgitbridge with support for the Overleaf v4 frontend
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Qwen3.6 is the large language model series developed by Qwen team, Alibaba Group.
OpenClaw中国插件:支持飞书,钉钉,QQ,企业微信,微信
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
A framework for training world models with virtual environments, complete with annotated environment dataset (RetroAct), exploration agent (AutoExplore Agent), and GenieRedux-G - an implementation …
[CVPR 2026] "E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training" official implementation.
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
Wan: Open and Advanced Large-Scale Video Generative Models
Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"
Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)