Skip to content
View ctharve's full-sized avatar

Organizations

@piquelab

Block or report ctharve

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Best Practices on Recommendation Systems

Python 21,622 3,310 Updated Apr 11, 2026

Lecture notes and example code for teaching C & C++

Python 243 27 Updated May 9, 2020

Material from the Big Data course at Chicago Booth

TeX 93 90 Updated Feb 16, 2019

Optimization for Data Science Course

Jupyter Notebook 12 6 Updated Jan 19, 2017
HTML 12 1 Updated Feb 7, 2017

Teaching repo for Applied Data Science @ Columbia, a project-based course for data science skills (statistical thinking, machine learning, data engineering, team work, presentation, endurance of fr…

HTML 190 343 Updated Apr 10, 2024

Reinforcement learning resources curated

9,715 1,907 Updated May 25, 2023

A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.

Jupyter Notebook 673 208 Updated Jul 9, 2022

Quantitative research and educational materials

Jupyter Notebook 2,787 1,720 Updated Nov 3, 2020

Deploy & Monitor ML Models directly from R

R 3 1 Updated Jul 12, 2016

🐢 bayesAB: Fast Bayesian Methods for A/B Testing

R 314 42 Updated Jun 25, 2021

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++ 28,256 8,867 Updated Apr 10, 2026

Portfolio and risk analytics in Python

Jupyter Notebook 6,283 1,876 Updated Dec 23, 2023

Scalable Topic Modeling using Variational Inference in MapReduce

Java 149 95 Updated Oct 20, 2015

Uber trip data from a freedom of information request to NYC's Taxi & Limousine Commission

731 378 Updated May 8, 2022

Import public NYC taxi and for-hire vehicle (Uber, Lyft) trip data into a PostgreSQL or ClickHouse database

R 2,068 567 Updated Aug 18, 2025

Files for Modern Statistical Workflow workshop

HTML 10 4 Updated Jul 16, 2016

Spark reference applications

Scala 650 335 Updated Oct 3, 2024

Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filte…

Jupyter Notebook 18,883 4,483 Updated Aug 7, 2024
Jupyter Notebook 574 537 Updated May 16, 2018

C++ implementation of IRLS algorithm for generalized linear model

C++ 21 16 Updated Nov 15, 2017

Class materials for a distributed systems lecture series

9,649 712 Updated Mar 18, 2025

Some tutorial-type code to introduce map-reduce style of programming

Scala 28 4 Updated Feb 21, 2013

Principled Functional Programming in Scala

Scala 4,674 701 Updated Apr 7, 2026

Predicts league of legends play off games for the 2015 season

Python 2 Updated Mar 16, 2016

A C++ version of the R Package "SQUAREM"

C++ 8 2 Updated Feb 25, 2015

Gaussian Mixture Model Implementation in Pyspark

Python 31 4 Updated Dec 2, 2014

Official content for the Fall 2014 Harvard CS109 Data Science course

CSS 319 716 Updated Feb 1, 2017

Bayesian Macroeconometrics in R

C++ 92 58 Updated Jul 18, 2022

Lectures on scientific computing with python, as IPython notebooks.

Jupyter Notebook 3,632 1,807 Updated Oct 15, 2023
Next