Skip to content
View ae86208's full-sized avatar

Block or report ae86208

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Ghostty-based macOS terminal with vertical tabs and notifications for AI coding agents

Swift 15,646 1,156 Updated Apr 28, 2026

maximal update parametrization (µP)

Jupyter Notebook 1,703 104 Updated Jul 17, 2024

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 43,436 4,430 Updated Apr 28, 2026

A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architectures

Python 134 14 Updated Jan 31, 2026

Post-training with Tinker

Python 3,171 400 Updated Apr 28, 2026

A collection of MCP servers.

85,822 9,585 Updated Apr 27, 2026

"Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…

Python 8,784 975 Updated Feb 27, 2026

Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Pre-Ranking, Ranking, Post Ranking, Relevance, LLM and RL. Please cite our p…

Python 2,461 287 Updated Apr 25, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,110 275 Updated Apr 28, 2026

Awesome Unified Multimodal Models

1,218 39 Updated Mar 24, 2026

Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".

Python 227 24 Updated Apr 8, 2026

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,144 67 Updated Mar 20, 2025

An easy-to-use framework for large scale recommendation algorithms.

Python 370 66 Updated Apr 27, 2026

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,110 485 Updated Apr 28, 2026

Solve Visual Understanding with Reinforced VLMs

Python 5,948 377 Updated Mar 12, 2026

ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.6、ernie4.5、Mini…

5,933 242 Updated Apr 26, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 13,942 1,384 Updated Apr 28, 2026

[CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

Python 181 11 Updated Jul 7, 2025

A PyTorch Library for Multi-Task Learning

Python 2,547 232 Updated May 14, 2025

A playbook for effectively prompting post-trained LLMs

899 38 Updated Jan 21, 2025

OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Python 157 3 Updated Dec 24, 2024

[ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 2,134 82 Updated Dec 12, 2025

🍃 MINT-1T: A one trillion token multimodal interleaved dataset.

832 19 Updated Jul 31, 2024

Training MLPs on Graphs without Supervision, WSDM 25

Python 10 1 Updated Feb 1, 2025

[ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 494 33 Updated Apr 5, 2026

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.

Python 3,518 287 Updated Jul 25, 2025

Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)

Python 49 4 Updated Jan 21, 2025

OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)

Python 36 2 Updated Jun 16, 2025

Eagle: Frontier Vision-Language Models with Data-Centric Strategies

Python 943 50 Updated Oct 25, 2025

🚀 Efficient implementations for emerging model architectures

Python 5,001 515 Updated Apr 27, 2026
Next