Lists (1)
Sort Name ascending (A-Z)
Starred repositories
[Actively Maintained🔥] A list of Embodied AI papers accepted by top conferences (ICLR, NeurIPS, ICML, RSS, CoRL, ICRA, IROS, CVPR, ICCV, ECCV).
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Understanding R1-Zero-Like Training: A Critical Perspective
A book for Learning the Foundations of LLMs
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
[Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
For paper: Test-Time Personalization with Meta Prompt for Gaze Estimation
A collection of project, papers, and source code for Meta AI's Segment Anything Model (SAM) and related studies.
Cross-platform, customizable ML solutions for live and streaming media.
Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)
A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API
A repository of 60 useful data science prompts for ChatGPT
[CVPR 2024] HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention
real time face swap and one-click video deepfake with only a single image
Official inference repo for FLUX.1 models
[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
Official implementation of FouriScale (ECCV2024)
Lumina-T2X is a unified framework for Text to Any Modality Generation
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
Official PyTorch Implementation of EDGE (CVPR 2023)
Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation
A generative speech model for daily dialogue.