-
coding @xai-org
- Singapore
-
05:26
(UTC +08:00) - http://siviltaram.github.io/
- @sivil_taram
-
-
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedDec 4, 2025 -
Precision-RL Public
Forked from sail-sg/Precision-RLDefeating the Training-Inference Mismatch via FP16
Python MIT License UpdatedOct 31, 2025 -
dl4c.github.io-1 Public
Forked from dl4c/dl4c.github.io✨ Build a beautiful and simple website in literally minutes. Demo at https://beautifuljekyll.com
HTML MIT License UpdatedAug 26, 2025 -
-
OctoThinker Public
Forked from GAIR-NLP/OctoThinkerRevisiting Mid-training in the Era of Reinforcement Learning Scaling
-
verl-pipeline Public
Forked from agentica-project/verl-pipelineAsync pipelined version of Verl
Python Apache License 2.0 UpdatedApr 8, 2025 -
simpleRL-reason Public
Forked from hkust-nlp/simpleRL-reasonThis is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
-
-
CHASE Public
Forked from McGill-NLP/CHASESynthetic Data Generation for Evaluation
-
TinyLlama Public
Forked from jzhang38/TinyLlamaThe TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Python Apache License 2.0 UpdatedFeb 2, 2025 -
-
dl4c.github.io Public
Forked from dl4c/dl4c2023.github.ioDeep Learning for Code Website
HTML Apache License 2.0 UpdatedJan 19, 2025 -
vllm Public
Forked from vivian0429/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedDec 3, 2024 -
oat Public
Forked from sail-sg/oat🌾 OAT: Online AlignmenT for LLMs
Python Apache License 2.0 UpdatedDec 1, 2024 -
axolotl Public
Forked from axolotl-ai-cloud/axolotlGo ahead and axolotl questions
Python Apache License 2.0 UpdatedJul 23, 2024 -
dclm Public
Forked from mlfoundations/dclmDataComp for Language Models
HTML MIT License UpdatedJun 18, 2024 -
extract-expert Public
Forked from QuixiAI/extract-expertExtract a single expert from an MoE model of Mixtral architecture, using slerp
Python Apache License 2.0 UpdatedMay 22, 2024 -
sailcraft Public
Forked from sail-sg/sailcraftData Toolkit for Sailor Language Models
Python UpdatedApr 30, 2024 -
bytepiece Public
Forked from bojone/bytepiece更纯粹、更高压缩率的Tokenizer
Python Apache License 2.0 UpdatedApr 19, 2024 -
code-html-to-markdown Public
A lightweight script for processing HTML page to markdown format with support for code blocks
-
catwalk Public
Forked from allenai/catwalkThis project studies the performance and robustness of language models and task-adaptation methods.
Python Apache License 2.0 UpdatedApr 4, 2024 -
Triton-Puzzles Public
Forked from srush/Triton-PuzzlesPuzzles for learning Triton
-
surya Public
Forked from datalab-to/suryaAccurate line-level text detection and recognition (OCR) in any language
Python GNU General Public License v3.0 UpdatedFeb 2, 2024 -
mergekit Public
Forked from arcee-ai/mergekitTools for merging pretrained large language models.
Python GNU Lesser General Public License v3.0 UpdatedJan 21, 2024 -
Persona-Dialogue-Generation Public
The code of ACL 2020 paper "You Impress Me: Dialogue Generation via Mutual Persona Perception"
-
OpenAgents Public
Forked from xlang-ai/OpenAgentsOpenAgents: An Open Platform for Language Agents in the Wild
Python Apache License 2.0 UpdatedOct 20, 2023 -
peft Public
Forked from huggingface/peft🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
-
Megatron-LLM Public
Forked from epfLLM/Megatron-LLMdistributed trainer for LLMs
Python Other UpdatedSep 4, 2023 -
santacoder-finetuning-commit Public
Forked from loubnabnl/santacoder-finetuningFine-tune SantaCoder for Code/Text Generation.