Lists (15)
Sort Name ascending (A-Z)
Starred repositories
A Comprehensive Survey of Interactive Video World Models
将博导十年科研经验炼化为可直接调用的 AI 技能。从 Idea 构思到论文投稿,你的 AI 科研副导师。
A curated, continuously updated reading list, paper blogs, and resources for World Action Models (WAMs) in embodied AI.
【Zotero AI 管家】调用大模型,自动精读论文库里的论文,总结为Zotero笔记。支持主流大模型平台!您只需像往常一样把文献丢进 Zotero, 管家会自动帮您精读论文,将文章揉碎了总结为笔记,让您“十分钟完全了解”这篇论文!
Efficient image to 3D geometry foundation models from Meta Reality Labs for monocular depth, point maps, and surface normals. Featuring HyDen (ICLR 2026).
A Curated List of Vision-Language-Action (VLA) and World Action Models (WAM) Research and Beyond
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…
A feed-forward 3D foundation model for reconstructing scenes from streaming data
Robust Loop Closure Verification with Trajectory Prior in Repetitive Environments
📚这个仓库是在arxiv上收集的有关VLN,VLA,World Model,SLAM,Gaussian Splatting,非线性优化等相关论文。每天都会自动更新!issue区域是最新10篇论文
Roo Code gives you a whole dev team of AI agents in your code editor.
An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.
Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?
你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候,对你的期望是很高的。 一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.
Official code for "LagerNVS Latent Geometry for Fully Neural Real-time Novel View Synthesis" (CVPR 2026)
Official implementation of "Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation".
Xbotics 社区具身智能学习指南:我们把“具身综述→学习路线→仿真学习→开源实物→人物访谈→公司图谱”串起来,帮助新手和实战者快速定位路径、落地项目与参与开源。
Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞
A 30-day public U.S. stock challenge: follow a 5000 HKD 🦞 claw through live market days.
Skill package for ML/CV/NLP paper writing, curated and adapted from Prof. Peng Sida's open notes for Codex, Claude Code, and Gemini.
Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels