Lists (1)
Sort Name ascending (A-Z)
Stars
SWE-bench: Can Language Models Resolve Real-world Github Issues?
Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, datasets, and full end-to-end reference examples to build with Nemotron models
Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞
KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)
Autonomous GPU Kernel Generation & Optimization via Deep Agents
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Open Source smart glasses designed to be 1. All day wearable 2. Immediately useful 3. Extendable for makers, startups, and everyone else.
Real-time Facial Emotion Detection using deep learning
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.
Quickly hashing all subexpressions of a program modulo alpha-renaming
verl: Volcano Engine Reinforcement Learning for LLMs
a pytorch implementation of https://arxiv.org/abs/1704.03477
OpenCUA: Open Foundations for Computer-Use Agents
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
A curated collection of resources, tools, and frameworks for developing GUI Agents.
Home page for Microsoft Phi-Ground tech-report
A library for efficient similarity search and clustering of dense vectors.
Agent S: an open agentic framework that uses computers like a human
Open-source resources on agents for computer use.
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
Pioneering Automated GUI Interaction with Native Agents
[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments