Skip to content
View hitercs's full-sized avatar
  • Harbin Institute of Technology
  • Haidian, Beijing

Block or report hitercs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Build RL environments for LLM training

Python 823 115 Updated Apr 14, 2026

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 17,075 3,401 Updated Apr 14, 2026

A Comprehensive Survey on Long Context Language Modeling

238 18 Updated Nov 24, 2025

Curated list of technical blogs on machine learning · AI/ML/DL/CV/NLP/MLOps

308 37 Updated Sep 16, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 13,861 1,364 Updated Apr 30, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,679 3,653 Updated Apr 14, 2026

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 15,311 1,430 Updated Mar 26, 2026

Mamba SSM architecture

Python 17,969 1,688 Updated Apr 13, 2026

Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]

Python 182 8 Updated Jul 8, 2025

Simple RL training for reasoning

Python 3,845 289 Updated Dec 23, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 23,996 2,760 Updated Mar 12, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,342 917 Updated Apr 14, 2026
Python 7 Updated Nov 27, 2024

A bibliography and survey of the papers surrounding o1

TeX 1,213 51 Updated Nov 16, 2024

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,965 83 Updated Apr 7, 2026

Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"

Python 250 13 Updated Sep 12, 2025

unified embedding model

Python 876 72 Updated Sep 1, 2023

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,669 506 Updated Jul 18, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,949 1,858 Updated Jul 15, 2025

A guidance language for controlling large language models.

Jupyter Notebook 21,387 1,154 Updated Apr 10, 2026

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,115 126 Updated Jun 1, 2023

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1,835 82 Updated Jul 27, 2025

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Python 2,779 248 Updated Dec 5, 2023

LangChain 的中文入门教程

8,927 707 Updated Apr 19, 2025

Desktop application of new Bing's AI-powered chat (Windows, macOS and Linux)

JavaScript 9,008 676 Updated Feb 8, 2024

An open-source tool-augmented conversational language model from Fudan University

Python 12,088 1,134 Updated Jul 13, 2024

Prompt programming with FMs.

Python 443 45 Updated Jul 22, 2024
Next