ae86208

Jiahui Du ae86208

CV & ML

38 followers · 28 following

Alibaba
Beijing, China
https://sites.google.com/site/jhdubjtu/home

Achievements

Starred repositories

manaflow-ai / cmux

Ghostty-based macOS terminal with vertical tabs and notifications for AI coding agents

Swift 15,646 1,156 Updated Apr 28, 2026

microsoft / mup

maximal update parametrization (µP)

Jupyter Notebook 1,703 104 Updated Jul 17, 2024

aaif-goose / goose

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 43,436 4,430 Updated Apr 28, 2026

inclusionAI / PromptCoT

A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architectures

Python 134 14 Updated Jan 31, 2026

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 3,171 400 Updated Apr 28, 2026

punkpeye / awesome-mcp-servers

A collection of MCP servers.

85,822 9,585 Updated Apr 27, 2026

davidkimai / Context-Engineering

"Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…

Python 8,784 975 Updated Feb 27, 2026

guyulongcs / Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising

Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Pre-Ranking, Ranking, Post Ranking, Relevance, LLM and RL. Please cite our p…

Python 2,461 287 Updated Apr 25, 2026

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,110 275 Updated Apr 28, 2026

AIDC-AI / Awesome-Unified-Multimodal-Models

Awesome Unified Multimodal Models

1,218 39 Updated Mar 24, 2026

facebookresearch / ReasonIR

Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".

Python 227 24 Updated Apr 8, 2026

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,144 67 Updated Mar 20, 2025

alibaba / TorchEasyRec

An easy-to-use framework for large scale recommendation algorithms.

Python 370 66 Updated Apr 27, 2026

inclusionAI / AReaL

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,110 485 Updated Apr 28, 2026

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,948 377 Updated Mar 12, 2026

jeinlee1991 / chinese-llm-benchmark

ReLE评测：中文AI大模型能力评测（持续更新）：目前已囊括374个大模型，覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型，以及step3.5-flash、kimi-k2.6、ernie4.5、Mini…

5,933 242 Updated Apr 26, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 13,942 1,384 Updated Apr 28, 2026