Stars
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
A game theoretic approach to explain the output of any machine learning model.
Draw pretty maps from OpenStreetMap data! Built with osmnx +matplotlib + shapely
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
A better notebook for Scala (and more)
A library for debugging/inspecting machine learning classifiers and explaining their predictions
🎨 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reasoning (based on LaTeX AST).
Re-implementation of TensorFlow in pure python, with an emphasis on code understandability
A collection of all my datasets
一个微型的正则表达式引擎 | A micro regular expression engine
Wrapper around Google APIs to create charts in Google Slides with python
Tutorial session from PyData Washington DC, Fri 7 October 2016
Render 3D animations as a series of text files.
China Biographical Database project(CBDB) put the data.csv here for synchronizing CBDB data in biog_ref
My project assignments for the Data Engineer Nanodegree course in Udacity material from
Jupyter Notebooks for PySpark Workshop using NYC Taxi Trip data