Skip to content
View dlwh's full-sized avatar

Sponsoring

@patrick-kidger

Organizations

@scalanlp @stanford-crfm @Open-Athena @marin-community

Block or report dlwh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The HELMET Benchmark

Jupyter Notebook 195 36 Updated Dec 4, 2025

A simple, performant and scalable Jax LLM!

Python 1 1 Updated Dec 21, 2025
Python 659 69 Updated Dec 21, 2025

A framework for the large scale analysis of programming language usage.

Jupyter Notebook 30 6 Updated Jun 27, 2023

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 199 62 Updated Dec 22, 2025
Python 4 Updated Sep 16, 2025
Python 27 Updated Aug 27, 2025
Python 3 Updated Aug 27, 2025

C API for MLX

C++ 156 29 Updated Dec 18, 2025

A Lightweight LLM Post-Training Library

Python 2,041 204 Updated Dec 21, 2025

Our library for RL environments + evals

Python 3,654 454 Updated Dec 22, 2025

A benchmark for LLMs on complicated tasks in the terminal

Python 1,238 438 Updated Dec 20, 2025

Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Roy et al. (2025)

Python 46 1 Updated Sep 2, 2025

Nano vLLM

Python 9,904 1,247 Updated Nov 3, 2025

Roo Code gives you a whole dev team of AI agents in your code editor.

TypeScript 21,314 2,685 Updated Dec 22, 2025

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,587 346 Updated Dec 22, 2025

Minimal yet performant LLM examples in pure JAX

Python 218 28 Updated Dec 4, 2025

(EasyDel Former) is a utility library designed to simplify and enhance the development in JAX

Python 29 5 Updated Nov 26, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,873 431 Updated Mar 5, 2025
Python 15 7 Updated May 11, 2025

A toolkit for describing model features and intervening on those features to steer behavior.

Python 223 20 Updated Dec 12, 2025

AutoBound automatically computes upper and lower bounds on functions.

Python 364 19 Updated Oct 24, 2025

A framework for few-shot evaluation of language models.

Python 1 Updated Dec 19, 2024

Evaluation suite for LLMs

Python 376 47 Updated Jul 11, 2025

Implementation of PSGD optimizer in JAX

Python 35 2 Updated Dec 31, 2024

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,208 402 Updated Dec 15, 2025

Entropy Based Sampling and Parallel CoT Decoding

Python 3,431 325 Updated Nov 13, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,747 271 Updated Jul 18, 2025
Next