-
Awesome-LLM-Strawberry Public
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
-
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedAug 28, 2025 -
ring-flash-attention Public
Forked from zhuzilin/ring-flash-attentionRing attention implementation with flash attention
-
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedJul 23, 2025 -
vllm-project.github.io Public
Forked from vllm-project/vllm-project.github.io -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedMar 23, 2025 -
awesome-RLHF Public
Forked from opendilab/awesome-RLHFA curated list of reinforcement learning with human feedback resources (continually updated)
-
Awesome-LLM-Long-Context-Modeling Public
Forked from Xnhyacinth/Awesome-LLM-Long-Context-Modeling📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
-
-
-
-
Awesome-LLM-Inference Public
Forked from xlite-dev/Awesome-LLM-Inference📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
-
-
pymarl2 Public
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
-
noisy-mappo Public
Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)
-
-
alpha-zero-gomoku Public
A Multi-threaded Implementation of AlphaZero (C++)
-
cuda-neural-network Public
Convolutional Neural Network with CUDA (MNIST 99.23%)
-
NTU-Thesis-LaTeX-Template Public
Forked from Hsins/NTU-Thesis-LaTeX-Template🎓 Unofficial LaTeX templates for your graduate thesis (both master's theses and doctoral dissertations) at National Taiwan University. 國立臺灣大學碩博士學位論文 LaTeX 模板
TeX MIT License UpdatedJan 11, 2021 -
-
mame-street-fighter-3-ai Public
Reinforcement Learning for Street Fighter III: 3rd Strike
-
termux-jupyter Public
Termux init script
-
Trading Robot based on LSTM-PPO
-
-
Deep Reinforcement Learning Notes
-
Reinforcement Learning for WeChat Jump
-
-
mini-interpreter Public
A Simple Scripting Language
-
-
web-server Public
A Web Server designed with Reactor I/O Model