Lists (1)
Sort Name ascending (A-Z)
Stars
Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint
Generate 3D-printable cubes/cuboids with ArUco or AprilTag fiducial markers on all 6 faces, then detect their 6-DOF pose from a camera.
A external skin changer for a certain online game. (Vibe Coding Warning)
AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing.
A terminal workspace with batteries included
Reverse engineer any repo into it's original prompt
[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling
SVG-based 3D emoji generator, export in format like GLB, OBJ, STL, or USDZ for games, AR, and 3D projects.
Fast, accurate & comprehensive text measurement & layout
UnrealCV: Connecting Computer Vision to Unreal Engine
A web-based annotation tool for synchronized multi-video timeline labeling and AI-assisted question generation, built for the GameplayQA benchmark.
The first multiplayer video world model in Minecraft
SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds
Official repository for the paper "Flow Equivariant Recurrent Neural Networks"
Reliable, minimal and scalable library for evaluating and conducting world model research
Official Repo for paper: Scaling Behavior Cloning Improves Causal Reasoning: An Open Model for Real-Time Video Game Playing
Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 2025)
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
An open-source AI agent that brings the power of Gemini directly into your terminal.
Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication
Code for "Closing the Train-Test Gap in World Models for Gradient-Based Planning"
Multilingual Voice Understanding Model
[CSUR 2025] Continual Learning of Large Language Models: A Comprehensive Survey
📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.
Latent Collaboration in Multi-Agent Systems