Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,779 599 Updated Oct 24, 2025

amaha7984 / ExpDWT-VAE

Discrete Wavelet Transform as a Facilitator for Expressive Latent Space Representation in Variational Autoencoders in Satellite Imagery

Python 1 Updated Apr 6, 2025

merveenoyan / smol-vision

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 1,775 136 Updated Oct 27, 2025

Tencent-Hunyuan / HunyuanWorld-Mirror

Fast and Universal 3D reconstruction model for versatile tasks

Python 631 42 Updated Nov 3, 2025

apple / pico-banana-400k

Python 1,510 65 Updated Oct 28, 2025

mit-han-lab / streaming-vlm

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Python 650 40 Updated Oct 15, 2025

sayakpaul / nanoDiT

Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.

Python 132 16 Updated May 29, 2025

IDEA-Research / Rex-Omni

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 682 41 Updated Nov 4, 2025

rbalestr-lab / llm-jepa

Python 121 9 Updated Sep 28, 2025

marcoslucianops / DeepStream-Yolo

NVIDIA DeepStream SDK 8.0 / 7.1 / 7.0 / 6.4 / 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 / 5.1 implementation for YOLO models

C++ 1,830 418 Updated Oct 2, 2025

KellerJordan / modded-nanogpt

NanoGPT (124M) in 3 minutes

Python 3,767 489 Updated Oct 29, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 35,743 4,106 Updated Nov 5, 2025

thu-pacman / chitu

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,325 88 Updated Nov 5, 2025

obra / superpowers

Claude Code superpowers: core skills library

JavaScript 6,044 440 Updated Oct 31, 2025

NVlabs / QeRL

QeRL enables RL for 32B LLMs on a single H100 GPU.

Python 410 34 Updated Oct 16, 2025

OpenGVLab / OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Ghulam Jilani Raza gj-raza

Starred repositories

computer-use

high-performance

action-recognition

skeleton-based-action-recognition

simd-optimizations

simd-parallelism

libjpeg

embedded-systems

Machine learning

video-super-resolution