Stars
antimatter15 / alpaca.cpp
Forked from ggml-org/llama.cppLocally run an Instruction-Tuned Chat-Style LLM
deepspeedai / Megatron-DeepSpeed
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
sanjeevanahilan / nanoChatGPT
Forked from karpathy/nanoGPTA crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
RUCAIBox / CIKM2020-S3Rec
Forked from aHuiWang/CIKM2020-S3RecCode for CIKM2020 "S3-Rec: Self-Supervised Learning for Sequential Recommendation with Mutual Information Maximization"
mitmath / 18337
Forked from SciML/SciMLBook18.337 - Parallel Computing and Scientific Machine Learning
bespokelabsai / verifiers
Forked from PrimeIntellect-ai/verifiersVerifiers for LLM Reinforcement Learning
hamishivi / EasyLM
Forked from young-geng/EasyLMLarge language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
UnrealTracking / mate
Forked from XuehaiPan/mateMATE: the Multi-Agent Tracking Environment.
anair13 / mj_envs
Forked from vikashplus/robohiveA collection of MuJoCo based environments.