jiefisher

jiefisher

16 followers · 20 following

Achievements

Stars

ruilisi / lsbot

Lean & Secure Bot

Go 409 57 Updated Apr 27, 2026

NJUNLP / MoE-LPR

Python 22 5 Updated Dec 11, 2024

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 1,640 389 Updated May 20, 2026

arcee-ai / NeMo-RL

Scalable toolkit for efficient model reinforcement

Python 12 3 Updated Jan 27, 2026

facebookresearch / deepconf

DeepConf: Deep Think with Confidence

Python 397 59 Updated May 6, 2026

TsinghuaC3I / Unify-Post-Training

Towards a Unified View of Large Language Model Post-Training

Python 211 10 Updated Sep 8, 2025

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

Cuda 2,270 210 Updated May 20, 2026

kevin85421 / RayCG-ChatLearn

Python 5 Updated Oct 20, 2024

lsdefine / simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,675 132 Updated Nov 21, 2025

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,435 3,909 Updated May 20, 2026

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 2,998 276 Updated May 20, 2026

thu-pacman / chitu

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 3,137 263 Updated May 18, 2026

zhangtianhong-1998 / LLM_infra_from_scratch

这是一个基于C++实现的从零开始的大模型推理框架

C++ 10 1 Updated Nov 18, 2024

srbhr / Ollama-function-calling

Ollama Function Calling with Search API

Python 11 1 Updated Apr 28, 2025

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 11,045 1,113 Updated May 17, 2026

harleyszhang / llm_note

LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

Python 881 87 Updated May 10, 2026

gabrielolympie / moe-pruner

A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size

Python 87 9 Updated Sep 5, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,449 165 Updated Mar 20, 2025

QwenLM / AutoIF

Python 331 32 Updated Jul 25, 2024

YuxiXie / MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 330 37 Updated Jan 29, 2026

MARIO-Math-Reasoning / Super_MARIO

Python 341 22 Updated Jun 5, 2025

ezelikman / quiet-star

Code for Quiet-STaR

Python 740 92 Updated Aug 21, 2024

FranxYao / Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 496 31 Updated Mar 19, 2024

alibaba / rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Cuda 1,119 191 Updated May 20, 2026

Tekh-ops / picocad

C 2 1 Updated Oct 30, 2021

wojciech-bilicki / TetrisTutorial

GDScript 36 20 Updated Aug 30, 2023

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 7,089 715 Updated May 6, 2026

alibaba / ChatLearn

A flexible and efficient training framework for large-scale alignment tasks

Python 452 39 Updated Oct 23, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,527 944 Updated May 15, 2026

Sunt-ing / stick

😇 A PyTorch-like deep learning framework. Just for fun.

Python 157 7 Updated Oct 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jiefisher

Achievements

Achievements

Block or report jiefisher

Stars

ruilisi / lsbot

NJUNLP / MoE-LPR

NVIDIA-NeMo / RL

arcee-ai / NeMo-RL

facebookresearch / deepconf

TsinghuaC3I / Unify-Post-Training

mirage-project / mirage

kevin85421 / RayCG-ChatLearn

lsdefine / simple_GRPO

verl-project / verl

BBuf / how-to-optim-algorithm-in-cuda

thu-pacman / chitu

zhangtianhong-1998 / LLM_infra_from_scratch

srbhr / Ollama-function-calling

xlite-dev / LeetCUDA

harleyszhang / llm_note

gabrielolympie / moe-pruner

Unakar / Logic-RL

QwenLM / AutoIF

YuxiXie / MCTS-DPO

MARIO-Math-Reasoning / Super_MARIO

ezelikman / quiet-star

FranxYao / Long-Context-Data-Engineering

alibaba / rtp-llm

Tekh-ops / picocad

wojciech-bilicki / TetrisTutorial

arcee-ai / mergekit

alibaba / ChatLearn

OpenRLHF / OpenRLHF

Sunt-ing / stick