Skip to content
View wangshanyw's full-sized avatar

Highlights

  • Pro

Block or report wangshanyw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,080 268 Updated Apr 14, 2026

LeetCode solutions

Java 499 286 Updated May 10, 2024

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,780 5,339 Updated Apr 14, 2026

📕 小红书创作者MCP工具包 - 支持与AI客户端集成的内容创作和发布工具

Python 1,236 167 Updated Jul 10, 2025

Democratizing Reinforcement Learning for LLMs

Python 5,423 541 Updated Apr 14, 2026

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 23,413 2,347 Updated Oct 17, 2025

Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient

Python 66 8 Updated Aug 3, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,087 8,569 Updated Apr 12, 2026

Writing reviews of academic papers

517 105 Updated Aug 12, 2015

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Python 770 109 Updated Apr 1, 2026
Python 1,504 111 Updated May 12, 2023

Truly flash T5 realization!

Python 73 5 Updated Jan 26, 2026

Curated list of datasets and tools for post-training.

4,427 359 Updated Mar 9, 2026

Replication code for semantic ID generation in “Transformer Memory as a Differentiable Search Index”.

Python 5 1 Updated Feb 9, 2023

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,909 303 Updated Jan 16, 2024

Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch

Python 345 10 Updated Apr 2, 2025

Integrate the DeepSeek API into popular software

36,259 4,015 Updated Feb 23, 2026

Language Models as Semantic Indexers (ICML 2024)

Python 41 2 Updated May 2, 2024

Residual Quantization with Implicit Neural Codebooks

Python 115 7 Updated Oct 7, 2025

This repository includes all the interview preparation questions for Amazon SDE role

C++ 1,444 316 Updated Feb 14, 2024

🎮 A curated list of awesome game datasets, and tools to artificial intelligence in games

1,036 76 Updated Mar 11, 2026

Sample codes for my CUDA programming book

Cuda 2,036 383 Updated Dec 14, 2025

CUDA Library Samples

C++ 2,368 455 Updated Apr 9, 2026

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 9,074 2,314 Updated Mar 30, 2026

Let your Claude able to think

TypeScript 16,995 1,977 Updated Apr 7, 2026
Python 50 5 Updated Jun 7, 2025

This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals

1,125 46 Updated Feb 6, 2026

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

2,821 248 Updated Mar 5, 2026

A curated list of previous asked Interview Question at Big Companies and Startups 🤲 🏆

1,753 357 Updated Aug 18, 2022
Next