winglian

Wing Lian winglian

364 followers · 17 following

Annapolis, MD
https://twitter.com/winglian

Achievements

x4 x4 x4

Achievements

x4 x4 x4

Highlights

Lists (2)

Sort

CUDA

1 repository

Jet brains ai

3 repositories

Stars

jkz-338 / Robust-ToM-RL

This repo contains the code for our paper "From Shortcuts to Reasoning: Robust Post-Training of Theory of Mind with Reinforcement Learning"

2 Updated May 27, 2026

Snowflake-AI-Research / fastkernels

Python 3 Updated Jul 29, 2026

burtenshaw / rlm-general-harness

Python 9 Updated Jul 23, 2026

greghavens / moonshiner

Unified model-distillation harness for code abilities: pi / claude-code / codex teachers, configurable judge, verified agentic traces

Python 10 1 Updated Jul 29, 2026

tilde-research / one-layer-deeper

Python 54 13 Updated Jul 24, 2026

1nisharg / TernaryLM-Memory-Efficient-Language-Modeling

Jupyter Notebook 1 Updated Feb 20, 2026

XIANGLONGYAN / PT2-LLM

Python 15 1 Updated Jul 9, 2026

chaithanyasai18 / LLMs-finetuning

This repository consists of python scripts for LLM finetuning (SFT, LoRA, QLoRA) and LLM synthetic data generation scripts.

Python 4 1 Updated Aug 11, 2025

cysecbench / dataset

Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models

Python 45 10 Updated Jan 14, 2025

chrisliu298 / awesome-on-policy-distillation

A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation (OPD) of large language models

583 20 Updated Jul 29, 2026

llm-as-a-verifier / llm-as-a-verifier

LLM-as-a-Verifier is a general-purpose framework that provides fine-grained feedback for any agent without requiring additional training. It achieves SOTA performance across coding, robotics, and m…

Python 607 56 Updated Jul 7, 2026

Liquid4All / antidoom

Python 334 32 Updated Jul 7, 2026

cx0 / llm-typos

Impact of typos and common misspellings on LLM task performance.

Python 25 2 Updated Mar 22, 2024

smonsays / sparsely-gated-linear

Official code for the paper "Sparsely gated tiny linear experts"

Python 7 1 Updated Jul 6, 2026

xupy2003 / ContextAwareRL

Python 5 1 Updated Jun 8, 2026

sileod / reasoning-core

Procedural data generators for verifiable reasoning, synthetic pretraining, post-training, evaluation, and RL.

Python 45 4 Updated Jul 24, 2026

hao-ai-lab / JetSpec

JetSpec: Breaking the Scaling Ceiling of Speculative Decoding with Causal Parallel Tree Drafting

Python 171 6 Updated Jun 27, 2026

Trampoline-AI / fractal

the recursive language model (RLM) CLI agent

Python 199 12 Updated Jul 16, 2026

furiosa-ai / EfficientRollout

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

Python 16 Updated Jul 27, 2026

tile-ai / TileRT

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 1,600 112 Updated Jul 14, 2026

zhang-liyi / llm-inductive

Python 4 2 Updated Jul 29, 2026

sdc17 / SwiReasoning

[ICLR 2026] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

Python 138 16 Updated May 20, 2026

Sphere-AI-Lab / orbit

Stable and Efficient Reinforcement Learning for Trillion-Parameter LLMs

Python 150 9 Updated Jun 28, 2026

rishabh-1086 / distIL

Python 14 1 Updated May 24, 2026

Dogacel / auto-gpu-kernel

Winner 🏆 (Agent-only) MLSys 2026 - FlashInfer AI Kernel Generation Contest for the DeepSeek Sparse Attention (DSA) track with an average speedup of 34.93x

Python 148 10 Updated Jun 10, 2026

KONAKONA666 / fastgrpo

Python 26 1 Updated May 25, 2026

open-lm-engine / coda-kernels

CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

Python 235 24 Updated Jul 30, 2026

guy120494 / SUMO

Python 15 7 Updated Feb 5, 2026

sapientinc / HRM-Text

HRM-Text is a 1B text generation model based on the HRM architecture, strengthened by task completion and latent space reasoning.

Python 1,759 167 Updated Jun 17, 2026

affaan-m / ECC

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 235,586 35,879 Updated Jul 29, 2026