Skip to content
View ae86208's full-sized avatar

Block or report ae86208

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Ghostty-based macOS terminal with vertical tabs and notifications for AI coding agents

Swift 13,233 943 Updated Apr 9, 2026

maximal update parametrization (µP)

Jupyter Notebook 1,694 105 Updated Jul 17, 2024

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 39,936 3,929 Updated Apr 8, 2026

A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architectures

Python 134 14 Updated Jan 31, 2026

Post-training with Tinker

Python 3,034 368 Updated Apr 8, 2026

A collection of MCP servers.

84,408 8,880 Updated Apr 5, 2026

"Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…

Python 8,692 970 Updated Feb 27, 2026

Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Pre-Ranking, Ranking (CTR/CVR prediction), Post Ranking, Relevance, LLM, Rei…

Python 2,426 286 Updated Apr 5, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,059 264 Updated Apr 8, 2026

Awesome Unified Multimodal Models

1,178 38 Updated Mar 24, 2026

Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".

Python 225 24 Updated Apr 8, 2026

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,139 67 Updated Mar 20, 2025

An easy-to-use framework for large scale recommendation algorithms.

Python 359 63 Updated Apr 8, 2026

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 5,005 454 Updated Apr 8, 2026

Solve Visual Understanding with Reinforced VLMs

Python 5,929 378 Updated Mar 12, 2026

ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括359个大模型,覆盖chatgpt、gpt-5.2、o4-mini、谷歌gemini-3-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3-max、qwen3.5-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.5、ernie4.5、…

5,831 234 Updated Apr 8, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,599 1,334 Updated Apr 8, 2026

[CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

Python 179 11 Updated Jul 7, 2025

A PyTorch Library for Multi-Task Learning

Python 2,534 232 Updated May 14, 2025

A playbook for effectively prompting post-trained LLMs

900 38 Updated Jan 21, 2025

OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Python 157 3 Updated Dec 24, 2024

[ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 2,131 82 Updated Dec 12, 2025

🍃 MINT-1T: A one trillion token multimodal interleaved dataset.

830 18 Updated Jul 31, 2024

Training MLPs on Graphs without Supervision, WSDM 25

Python 10 1 Updated Feb 1, 2025

[ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 493 33 Updated Apr 5, 2026

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.

Python 3,473 282 Updated Jul 25, 2025

Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)

Python 49 4 Updated Jan 21, 2025

OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)

Python 36 2 Updated Jun 16, 2025

Eagle: Frontier Vision-Language Models with Data-Centric Strategies

Python 938 50 Updated Oct 25, 2025

🚀 Efficient implementations for emerging model architectures

Python 4,832 484 Updated Apr 8, 2026
Next