Skip to content
View zhangzhiqiangccm's full-sized avatar
🤡
I may be slow to respond.
🤡
I may be slow to respond.
  • CUC
  • beijing

Highlights

  • Pro

Block or report zhangzhiqiangccm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
353 results for source starred repositories
Clear filter

《人妻约会指南》电子书及LaTeX源代码

TeX 41 4 Updated Nov 11, 2025

使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力

Python 161 20 Updated Oct 13, 2025

llm & rl

Jupyter Notebook 271 27 Updated Oct 24, 2025

复现大模型相关算法及一些学习记录

Python 2,950 391 Updated Feb 7, 2026

《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并通过 GitHub 进行开源共享。

HTML 3,190 556 Updated Sep 7, 2025

最少使用 3090 即可训练自己的比特大脑(miniLLM)🧠(进行中). Train your own BitBrain(A mini LLM) with just an RTX 3090 minimum.

Python 38 2 Updated Jun 29, 2025
Jupyter Notebook 105 3 Updated Dec 8, 2025

Awesome papers involving LLMs in Social Science.

585 46 Updated Nov 19, 2025

📚 从零开始的大语言模型原理与实践教程

Jupyter Notebook 25,637 2,377 Updated Jan 29, 2026

Drawing Bayesian networks, graphical models, tensors, technical frameworks, and illustrations in LaTeX.

TeX 1,964 189 Updated May 26, 2025

Chinese Political Hate Speech Detection Trained with Flair NLP

6 Updated Jun 9, 2024

MS-Agent: Lightweight Framework for Empowering Agents with Autonomous Exploration in Complex Task Scenarios

Python 3,949 458 Updated Feb 7, 2026

Retriever-0.1B

Python 96 17 Updated Jun 6, 2024

《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣

Python 4,249 316 Updated Sep 2, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 38,926 4,691 Updated Feb 6, 2026

Pytorch AMP / Activation Checkpoint / Gradient Accumulation

Jupyter Notebook 7 1 Updated May 4, 2025

Network communication information intervention based on reinforcement learning

Python 3 Updated Apr 20, 2025

面向网络信息传播的多智能体框架

Python 3 Updated Apr 22, 2025

记录所参加的比赛,包括但不限于kaggle,阿里天池,科大讯飞等平台所提供的NLP方面的比赛。

Jupyter Notebook 5 Updated Apr 25, 2025

童发发的大模型学习之旅

HTML 134 10 Updated Aug 9, 2025

[ICLR 2025 Oral] This is the official repo for the paper "LLM-SR" on Scientific Equation Discovery and Symbolic Regression with Large Language Models

Python 209 43 Updated Jul 31, 2025

A Comprehensive Library for Memory of LLM-based Agents.

Python 100 8 Updated May 13, 2025

从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!

Jupyter Notebook 2,001 136 Updated Nov 22, 2025

通过动画学强化学习笔记

65 2 Updated Feb 17, 2025

大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"

Jupyter Notebook 1,602 116 Updated Feb 4, 2026

[ICLR 2025] A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents

Python 90 3 Updated Feb 2, 2026

Cognitive agents and social evolution simulator

Python 163 7 Updated May 7, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 51,731 4,272 Updated Feb 7, 2026

Reproduce R1 Zero on Logic Puzzle

Python 2,432 164 Updated Mar 20, 2025
Next