Skip to content
View dwillis's full-sized avatar

Highlights

  • Pro

Organizations

@unitedstates

Block or report dwillis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

55 stars written in Jupyter Notebook
Clear filter

21 Lessons, Get Started Building with Generative AI

Jupyter Notebook 101,389 53,789 Updated Nov 3, 2025

Data and code behind the articles and graphics at FiveThirtyEight

Jupyter Notebook 17,217 11,141 Updated Feb 25, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,633 964 Updated Oct 23, 2025

🪼 a python library for doing approximate and phonetic matching of strings.

Jupyter Notebook 2,161 164 Updated Oct 27, 2025

Developer APIs to Accelerate LLM Projects

Jupyter Notebook 1,732 165 Updated Oct 18, 2024

Official Implementation of "KBLaM: Knowledge Base augmented Language Model"

Jupyter Notebook 1,417 121 Updated Oct 13, 2025

Croissant is a high-level format for machine learning datasets that brings together four rich layers.

Jupyter Notebook 746 90 Updated Nov 5, 2025

Analyze documents with Amazon Textract and generate output in multiple formats.

Jupyter Notebook 469 163 Updated Apr 24, 2025

🎲 Notes explaining Dirichlet Processes, HDPs, and Latent Dirichlet Allocation

Jupyter Notebook 413 108 Updated Mar 18, 2019

How to use OpenAIs Whisper to transcribe and diarize audio files

Jupyter Notebook 363 47 Updated Oct 12, 2022

potato: portable text annotation tool

Jupyter Notebook 353 65 Updated Oct 24, 2025

Python for Data Science (Seminar Course at UC Berkeley; AY 250)

Jupyter Notebook 332 161 Updated Apr 25, 2022

Predict Race and Ethnicity Based on the Sequence of Characters in a Name

Jupyter Notebook 248 64 Updated Oct 7, 2025

Scripts and data for various Vox Media stories and news projects

Jupyter Notebook 187 66 Updated Aug 14, 2023

Code for the paper: "Large Language Models as Corporate Lobbyists" (2023).

Jupyter Notebook 171 15 Updated Jan 13, 2023

A pytest plugin for running and analyzing LLM evaluation tests.

Jupyter Notebook 144 4 Updated Feb 5, 2025

This is an introduction to tensorflow

Jupyter Notebook 134 52 Updated May 31, 2016

Notebooks covering introductory material to ML, ML with sklearn and tips.

Jupyter Notebook 77 38 Updated Oct 17, 2018

Python 3.x notebooks about real-world data cleaning and visualization

Jupyter Notebook 72 28 Updated May 4, 2016

A tutorial on optical character recognition using tesseract, ImageMagick and other open source tools

Jupyter Notebook 69 14 Updated Jan 31, 2025

Course materials for a data visualization course taught at the University of Nebraska-Lincoln's College of Journalism and Mass Communications

Jupyter Notebook 68 31 Updated Apr 17, 2018

A Los Angeles Times analysis of Every shot in Kobe Bryant's NBA career

Jupyter Notebook 64 11 Updated Mar 19, 2021

A Los Angeles Times analysis of serious assaults misclassified by LAPD

Jupyter Notebook 63 11 Updated Oct 21, 2018

A collection of Jupyter notebooks demonstrating ways to analyze Census data

Jupyter Notebook 51 17 Updated Jul 27, 2017

Data and materials to reproduce Bloomberg's investigation into racial and gender bias in OpenAI's GPT

Jupyter Notebook 39 9 Updated Mar 7, 2024

Inspect a URL and estimate if it contains a news story

Jupyter Notebook 38 2 Updated Oct 31, 2025

Data and analysis supporting several passages in the BuzzFeed News article, "The New American Slavery: Invited To The U.S., Foreign Workers Find A Nightmare," published July 24, 2015.

Jupyter Notebook 28 1 Updated Dec 27, 2016

Suggestions, schedules, and other information about the Engineering Chapter's Tech Talk meetings.

Jupyter Notebook 28 15 Updated Nov 9, 2023
Jupyter Notebook 27 12 Updated Mar 27, 2016

Whisper Audio Transcriber: Streamlined tool for converting audio to text using the powerful Whisper ASR model. User-friendly and efficient.

Jupyter Notebook 24 4 Updated Dec 9, 2023
Next