Skip to content
View winglian's full-sized avatar

Sponsors

@narrative-io

Highlights

  • Pro

Block or report winglian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Ludic – an LLM-RL library for the era of experience

Python 37 4 Updated Dec 20, 2025
81 9 Updated Dec 16, 2025

Official Implementation of Dynamic erf (Derf).

Python 78 9 Updated Dec 12, 2025

Serving multiple LoRA finetuned LLM as one

Python 1,126 56 Updated May 8, 2024

An Efficient "Factory" to Build Multiple LoRA Adapters

Python 360 65 Updated Feb 13, 2025

Using GRPO and a modified compositional reward function to train an opensource model on the 1890 Dakota Dictionary

HTML 8 Updated Dec 19, 2025
Python 611 56 Updated Dec 19, 2025

Streamline on-policy/off-policy distillation workflows in a few lines of code

Python 81 4 Updated Dec 19, 2025
Python 92 9 Updated Nov 6, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,502 180 Updated Dec 19, 2025

Throughput-oriented multi-turn inference engine for KernelBench [ICML '25]

Python 18 8 Updated May 27, 2025

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 745 70 Updated Nov 28, 2025

Automating analysis from trace files

Python 50 5 Updated Dec 19, 2025

Advanced quantization toolkit for LLMs and VLMs. Support for WOQ, MXFP4, NVFP4, GGUF, Adaptive Schemes and seamless integration with Transformers, vLLM, SGLang, and llm-compressor

Python 772 64 Updated Dec 19, 2025

Triton-based Symmetric Memory operators and examples

Python 67 11 Updated Oct 17, 2025

Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions

Python 603 45 Updated Dec 19, 2025

Deep learning at the speed of light.

Rust 2,652 178 Updated Dec 19, 2025

SimKO: Simple Pass@K Policy Optimization

Python 23 3 Updated Oct 24, 2025
Python 23 1 Updated Oct 8, 2025

The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models

Python 713 48 Updated Oct 28, 2025

QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning

C++ 148 13 Updated Nov 11, 2025
Python 88 12 Updated Nov 16, 2025

Supporting code for the blog post on modular manifolds.

Python 107 13 Updated Sep 26, 2025
Python 122 11 Updated Nov 24, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,657 1,355 Updated Dec 17, 2025

Simplifying reinforcement learning for complex game environments

C 4,639 344 Updated Dec 19, 2025

Efficient non-uniform quantization with GPTQ for GGUF

Python 57 4 Updated Sep 17, 2025

Fast low-bit matmul kernels in Triton

Python 410 30 Updated Dec 18, 2025
Next