hijkzzz

hijkzzz

RLer + MLSyser / 2 + NLPer / 2

640 followers · 52 following

Achievements

x3 x4

Achievements

x3 x4

Awesome-LLM-Strawberry Public

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

reinforcement-learning mathematics coding mcts strawberry llm chain-of-thought

6,838 372 Apache License 2.0 Updated Oct 17, 2025
flashinfer Public
Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda Apache License 2.0 Updated Aug 28, 2025
ring-flash-attention Public
Forked from zhuzilin/ring-flash-attention

Ring attention implementation with flash attention

Python 1 MIT License Updated Jul 27, 2025
verl Public
Forked from volcengine/verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python Apache License 2.0 Updated Jul 23, 2025
vllm-project.github.io Public
Forked from vllm-project/vllm-project.github.io

HTML 1 Updated Apr 19, 2025
vllm Public
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache License 2.0 Updated Mar 23, 2025
awesome-RLHF Public
Forked from opendilab/awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

4 Apache License 2.0 Updated Jan 10, 2025
Awesome-LLM-Long-Context-Modeling Public
Forked from Xnhyacinth/Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

2 MIT License Updated Dec 30, 2024
hijkzzz Public

Updated Dec 2, 2024
hijkzzz.github.io Public

Homepage

HTML 3 Updated Nov 27, 2024
2025 Public
Forked from iclr-blogposts/2025

HTML 1 MIT License Updated Nov 23, 2024
Awesome-LLM-Inference Public
Forked from xlite-dev/Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

3 GNU General Public License v3.0 Updated Oct 3, 2024
llamafia.github.io Public
Forked from LLaMafia/llamafia.github.io

HTML Updated Jul 17, 2024
pymarl2 Public

Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

reinforcement-learning starcraft sota smac marl

Python 691 132 Apache License 2.0 Updated May 18, 2024
noisy-mappo Public

Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)

sota smac multi-agent-reinforcement-learning mappo

Python 70 6 MIT License Updated Jun 9, 2023
staging Public
Forked from iclr-blogposts/staging

HTML MIT License Updated Apr 27, 2023
alpha-zero-gomoku Public

A Multi-threaded Implementation of AlphaZero (C++)

cpp multithreading gomoku-game alphazero libtorch

Python 383 49 Updated Jan 7, 2023
cuda-neural-network Public

Convolutional Neural Network with CUDA (MNIST 99.23%)

neural-network cpp cuda cnn mnist

C++ 195 40 Updated Apr 4, 2022
NTU-Thesis-LaTeX-Template Public
Forked from Hsins/NTU-Thesis-LaTeX-Template

🎓 Unofficial LaTeX templates for your graduate thesis (both master's theses and doctoral dissertations) at National Taiwan University. 國立臺灣大學碩博士學位論文 LaTeX 模板

TeX MIT License Updated Jan 11, 2021
deep-learning-notes Public

Deep Learning Notes

deep-learning notes paper

51 1 Updated Jun 29, 2020
mame-street-fighter-3-ai Public

Reinforcement Learning for Street Fighter III: 3rd Strike

reinforcement-learning ai macro street-fighter

Python 1 Updated Jun 20, 2020
termux-jupyter Public

Termux init script

jupyter numpy sklearn pandas termux

Shell 1 Updated Feb 16, 2020
reinforcement-learning-trading-robot Public

Trading Robot based on LSTM-PPO

Python 28 5 Updated Dec 27, 2019
reinforcement-learning.pytorch Public

Reinforcement Learning Library

Python 1 Updated Dec 24, 2019
deep-reinforcement-learning-notes Public

Deep Reinforcement Learning Notes

notes deep-reinforcement-learning papers game-ai

119 6 Updated May 30, 2019
reinforcement-learning-wechat-jump Public

Reinforcement Learning for WeChat Jump

reinforcement-learning ai ddpg wechat-jump

Python 92 2 Updated May 7, 2019
prisma Public

Prisma

deep-learning style-transfer

Python 71 3 Updated Apr 11, 2019
mini-interpreter Public

A Simple Scripting Language

golang interpreter compiler stone

Go 80 5 Updated Mar 21, 2019
mini-os-kernel Public

A mini Unix-Like OS kernel

unix-like mini-kernel

C 99 6 Updated Mar 20, 2019
web-server Public

A Web Server designed with Reactor I/O Model

multi-threading cpp http-server reactor

C++ 64 1 Updated Mar 20, 2019

hijkzzz

Achievements

Achievements

Awesome-LLM-Strawberry Public

Uh oh!

flashinfer Public

Uh oh!

ring-flash-attention Public

Uh oh!

verl Public

Uh oh!

vllm-project.github.io Public

Uh oh!

vllm Public

Uh oh!

awesome-RLHF Public

Uh oh!

Awesome-LLM-Long-Context-Modeling Public

Uh oh!

hijkzzz Public

Uh oh!

hijkzzz.github.io Public

Uh oh!

2025 Public

Uh oh!

Awesome-LLM-Inference Public

Uh oh!

llamafia.github.io Public

Uh oh!

pymarl2 Public

Uh oh!

noisy-mappo Public

Uh oh!

staging Public

Uh oh!

alpha-zero-gomoku Public

Uh oh!

cuda-neural-network Public

Uh oh!

NTU-Thesis-LaTeX-Template Public

Uh oh!

deep-learning-notes Public

Uh oh!

mame-street-fighter-3-ai Public

Uh oh!

termux-jupyter Public

Uh oh!

reinforcement-learning-trading-robot Public

Uh oh!

reinforcement-learning.pytorch Public

Uh oh!

deep-reinforcement-learning-notes Public

Uh oh!

reinforcement-learning-wechat-jump Public

Uh oh!

prisma Public

Uh oh!

mini-interpreter Public

Uh oh!

mini-os-kernel Public

Uh oh!

web-server Public

Uh oh!