Skip to content
View rossjillian's full-sized avatar

Highlights

  • Pro

Organizations

@cvlab-columbia

Block or report rossjillian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 781 112 Updated Dec 22, 2025

An autoregressive character-level language model for making more things

Python 3,527 887 Updated Jun 4, 2024

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 14,119 2,113 Updated Aug 8, 2024

The best ChatGPT that $100 can buy.

Python 39,117 4,954 Updated Dec 9, 2025

Extremely fast Query Engine for DataFrames, written in Rust

Rust 36,655 2,523 Updated Dec 23, 2025

Download market data from Yahoo! Finance's API

Python 20,279 2,950 Updated Dec 23, 2025

Scrape papers from OpenReview using OpenReview API

Python 57 17 Updated Mar 3, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,983 1,652 Updated Nov 19, 2025

Python SEC EDGAR Filings API. Over 18 million filings, all 150 filing types supported. Query, full-text search and real-time stream API. Convert XBRL-to-JSON and access standardized financial state…

Python 277 38 Updated Apr 23, 2025

Documenting large text datasets 🖼️ 📚

Python 14 3 Updated Dec 17, 2024

What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets

Python 225 21 Updated Nov 16, 2024

Implementation of Rank-biased Overlap

Python 151 17 Updated May 3, 2024

A Conversational Speech Generation Model

Python 14,368 1,458 Updated May 27, 2025

Collection of LLM completions for reasoning-gym task datasets

Python 30 7 Updated Jul 4, 2025
Jupyter Notebook 816 180 Updated Aug 26, 2024

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,283 106 Updated Dec 15, 2025

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 453 60 Updated Sep 27, 2024

Code for "Is There a Replication Crisis in Finance" by Jensen, Kelly and Pedersen (2023)

R 348 142 Updated Mar 5, 2025

Fully open reproduction of DeepSeek-R1

Python 25,749 2,406 Updated Nov 24, 2025
Python 137 8 Updated Nov 3, 2023

A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API

Python 12,727 2,148 Updated Dec 19, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,749 271 Updated Jul 18, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,720 2,214 Updated Mar 11, 2025

Extracting spatial and temporal world models from LLMs

Jupyter Notebook 256 27 Updated Oct 17, 2023
Python 8 Updated Jun 17, 2024

An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"

Python 322 32 Updated Sep 16, 2024

Training Sparse Autoencoders on Language Models

Python 1,126 207 Updated Dec 23, 2025

Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.

Python 234 55 Updated Dec 22, 2025
Next