Lists (1)
Sort Name ascending (A-Z)
Starred repositories
ConsDreamer: Advancing Multi-View Consistency for Zero-Shot Text-to-3D Generation
[AAAI 2026] The code repository for "ReaSon: Reinforced Causal Search with Information Bottleneck for Video Understanding" in PyTorch.
[AAAI 2026] The code repository for "Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian Splatting" in PyTorch.
[Actively Maintained🔥] A list of Embodied AI papers accepted by top conferences (ICLR, NeurIPS, ICML, RSS, CoRL, ICRA, IROS, CVPR, ICCV, ECCV).
100+ AI Agent & RAG apps you can actually run — clone, customize, ship.
Understanding R1-Zero-Like Training: A Critical Perspective
A book for Learning the Foundations of LLMs
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
[Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
For paper: Test-Time Personalization with Meta Prompt for Gaze Estimation
A collection of project, papers, and source code for Meta AI's Segment Anything Model (SAM) and related studies.
Cross-platform, customizable ML solutions for live and streaming media.
Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)
A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API
A repository of 60 useful data science prompts for ChatGPT
[CVPR 2024] HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention
real time face swap and one-click video deepfake with only a single image
Official inference repo for FLUX.1 models
[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
Official implementation of FouriScale (ECCV2024)
Lumina-T2X is a unified framework for Text to Any Modality Generation
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”