Skip to content
View willieneis's full-sized avatar

Highlights

  • Pro

Organizations

@ermongroup @sailinglab @dragonfly @fusion-ml @self-optimizing-systems @naszilla @realworldml @uncertainty-toolbox @TorchUQ

Block or report willieneis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Can LLMs beat classical HPO? A benchmark comparing classical, LLM-based, and hybrid methods on Karpathy's autoresearch.

Python 21 2 Updated Apr 2, 2026
Rust 1 Updated Jan 22, 2026

[NeurIPS 2025] PARCO: Parallel AutoRegressive Combinatorial Optimization

Python 49 5 Updated Dec 3, 2025

Structured Primitives for Efficient Architecture Research

Python 19 1 Updated Dec 22, 2025

Inference and numerics for multi-hybrid AI model architectures

Jupyter Notebook 100 32 Updated Mar 3, 2026

Pretraining infrastructure for multi-hybrid AI model architectures

Python 221 29 Updated Feb 20, 2026

A MAD laboratory to improve AI architecture designs 🧪

Python 141 16 Updated Dec 17, 2024
Python 5 Updated Jan 26, 2026

Hubble is a suite of fully open-source large language models (LLMs) for the scientific study of LLM memorization.

Jupyter Notebook 15 2 Updated Jan 23, 2026

PyTorch-native post-training at scale

Python 664 97 Updated Apr 2, 2026

Python implementation of Bayesian online changepoint detection

Python 108 31 Updated Sep 3, 2023

Tora: Torchtune-LoRA for RL

Python 87 7 Updated Dec 2, 2025
Python 557 51 Updated Aug 28, 2025

AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each problem, and is faster than existing implementations.

Python 95 14 Updated Mar 12, 2026

A lightweight library for beautiful game of life embeds.

JavaScript 27 1 Updated Aug 1, 2025

A command-line tool for building static sites from Observable Notebooks

TypeScript 280 16 Updated Feb 24, 2026

A comprehensive toolkit for streamlining data editing, search, and inspection for large-scale language model training and interpretability.

Jupyter Notebook 20 Updated Oct 30, 2025

Official Implementation of wd1

Python 25 1 Updated Sep 25, 2025

Reproduce ICLR2025 Energy-Based Diffusion Language Models for Text Generation

Python 66 8 Updated Jul 22, 2025

Implementation of Sharpe-ratio-based active learning strategies for aligning large language models using Direct Preference Optimization (DPO).

Python 3 Updated Jul 4, 2025

Sample-Efficient Preference Alignment in LLMs via Active Exploration

Python 5 Updated Jul 4, 2025

Official implementation for the paper "Toward Scientific Reasoning in LLMs: Training from Expert Discussions via Reinforcement Learning"

Python 56 8 Updated Jun 4, 2025

A High-Efficiency System of Large Language Model Based Search Agents

Python 77 5 Updated Jul 2, 2025

Resa: Transparent Reasoning Models via SAEs

Python 48 5 Updated Sep 23, 2025

Uncertainty-guided Likelihood Tree Search

Python 8 Updated Nov 15, 2024

[ICLR 2026] Tina: Tiny Reasoning Models via LoRA

Python 327 41 Updated Sep 23, 2025
Python 37 6 Updated May 15, 2025

This package contains the original 2012 AlexNet code.

Cuda 2,854 371 Updated Mar 12, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 32,089 4,020 Updated Apr 2, 2026
Python 68 9 Updated Mar 27, 2025
Next