Skip to content
View anhncs's full-sized avatar

Block or report anhncs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,272 727 Updated Dec 21, 2025

Introduction to Machine Learning Systems

JavaScript 11,056 1,241 Updated Dec 21, 2025

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 5,910 578 Updated Oct 31, 2025

Contexts Optical Compression

Python 21,514 1,925 Updated Oct 25, 2025

Agentic LLM System for Practicing System Design and other technical Interviews.

10 6 Updated Sep 11, 2025

the original ideas from Isaac

Python 6 Updated Jul 11, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,435 479 Updated Aug 7, 2024

The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.

Python 7,631 1,451 Updated Dec 19, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,390 803 Updated Dec 21, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,427 1,689 Updated Sep 24, 2025

Main reference implementation for NLWeb, implemented in Python.

Python 6,113 683 Updated Dec 18, 2025

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

Python 463 22 Updated May 17, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,985 1,651 Updated Nov 19, 2025

Implementation of all RL algorithms in a simpler way

Jupyter Notebook 1,332 232 Updated Aug 29, 2025

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 1,419 51 Updated Dec 15, 2025

SOTA search powered LLM

Python 3,745 343 Updated Apr 4, 2025

DSPy: The framework for programming—not prompting—language models

Python 30,932 2,488 Updated Dec 21, 2025

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 27,734 2,512 Updated Sep 30, 2025

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 5,026 459 Updated Dec 13, 2025

[CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation

Python 1,637 182 Updated Sep 12, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 73,985 8,858 Updated Dec 20, 2025

LLM inference in C/C++

C++ 91,713 14,179 Updated Dec 21, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,096 2,671 Updated Nov 3, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 18,995 1,300 Updated Oct 21, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,927 918 Updated Dec 15, 2025

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,395 285 Updated Jul 17, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,503 1,530 Updated Apr 24, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,634 838 Updated Dec 18, 2025

Simple RL training for reasoning

Python 3,811 281 Updated Aug 3, 2025
Next