Starred repositories
每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈
zhaozhengcoder / Protobuf-VS-FlatBuffers-VS-Json
Forked from xzhTheo/Protobuf-VS-FlatBuffers-VS-JsonProtobuf VS FlatBuffers VS Json序列化比较
Zhuofeng-Li / Qwen-Agent
Forked from QwenLM/Qwen-AgentAgent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
jiayus-nvidia / FBGEMM
Forked from pytorch/FBGEMMFB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
QDelta / tch-rs
Forked from LaurentMazare/tch-rsRust bindings for the C++ api of PyTorch.
ahxt / Megatron-Bridge
Forked from NVIDIA-NeMo/Megatron-BridgeHuggingFace conversion and training library for Megatron-based models
LambdaLabsML / SkyThought
Forked from NovaSky-AI/SkyThoughtSky-T1: Train your own O1 preview model within $450
Example models using DeepSpeed
group project of polyu comp5517, job finding assistant via LLM
genius2787 / SAC-Lagrangian
Forked from ammarhydr/SAC-LagrangianPyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm
genius2787 / auction-gym
Forked from amazon-science/auction-gymAuctionGym is a simulation environment that enables reproducible evaluation of bandit and reinforcement learning methods for online advertising auctions.
marcomq / rust-xgboost
Forked from postgresml/rust-xgboostRust bindings for XGBoost.
A GPU-accelerated graph learning library for PyTorch, facilitating the scaling of GNN training and inference.
pyemma / LeetCUDA
Forked from xlite-dev/LeetCUDA📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
A Survey of Reinforcement Learning for Large Reasoning Models
naver-ai / vllm
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
X-yang03 / mcp-course
Forked from huggingface/mcp-courseStudy of MCP course
zhouh / nanoGPT
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Qihoo360 / 360-LLaMA-Factory
Forked from hiyouga/LlamaFactoryadds Sequence Parallelism into LLaMA-Factory
bilibili / rocksdb
Forked from facebook/rocksdbA library that provides an embeddable, persistent key-value store for fast storage.
Efficient triton implementation of Native Sparse Attention.
《AI 研发提效研究:自己动手训练 LoRA》,包含 Llama (Alpaca LoRA)模型、ChatGLM (ChatGLM Tuning)相关 Lora 的训练。训练内容:用户故事生成、测试代码生成、代码辅助生成、文本转 SQL、文本生成代码……
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
jiaqizhai / rails_staging
Forked from bailuding/railsRetrieval with Learned Similarities
Robust recipes to align language models with human and AI preferences