Skip to content
View tx2016's full-sized avatar

Block or report tx2016

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code repo for the book "Feature Engineering for Machine Learning," by Alice Zheng and Amanda Casari, O'Reilly 2018

Jupyter Notebook 1,497 674 Updated Aug 11, 2020

A collection of tutorials and examples for solving and understanding machine learning and pattern classification tasks

Jupyter Notebook 4,208 1,278 Updated Nov 26, 2023

A repo for data science related questions and answers

Jupyter Notebook 2,413 648 Updated Oct 6, 2022

My solution to the book A Collection of Data Science Take-Home Challenges

Jupyter Notebook 1,718 852 Updated Mar 9, 2019

A compiled list of kaggle competitions and their winning solutions for classification problems.

276 103 Updated Jul 21, 2016

📖 [译] Sklearn 与 TensorFlow 机器学习实用指南【版权问题,网站已下线!!】

CSS 3,781 1,527 Updated Aug 9, 2021

⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 or handson-mlp instead.

Jupyter Notebook 29,942 13,179 Updated May 19, 2026

Python code for common Machine Learning Algorithms

Jupyter Notebook 4,583 4,775 Updated Jun 5, 2025

Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.

Jupyter Notebook 2,380 1,655 Updated Mar 31, 2024

Python Driver for Apache Cassandra®

Python 1,425 582 Updated Jun 19, 2026

⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 or handson-mlp instead.

Jupyter Notebook 25,607 12,779 Updated May 19, 2026

A Python scikit for building and analyzing recommender systems

Python 6,796 1,051 Updated May 30, 2026

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

Jupyter Notebook 28,162 7,924 Updated Jun 25, 2024

Apache Spark (PySpark) Practice on Real Data

Jupyter Notebook 270 137 Updated Jan 31, 2020

Practice with "Real" SQL Problems

1,507 598 Updated Nov 11, 2023

Apache Hadoop

Java 15,568 9,220 Updated Jun 18, 2026

Apache Spark - A unified analytics engine for large-scale data processing

Scala 43,477 29,246 Updated Jun 19, 2026

Run MapReduce jobs on Hadoop or Amazon Web Services

Python 2,611 580 Updated Apr 2, 2026

Ways of doing Data Science Engineering and Machine Learning in R and Python

Jupyter Notebook 616 253 Updated Apr 25, 2021

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks

Jupyter Notebook 1,661 906 Updated Mar 16, 2024

Deep learning library featuring a higher-level API for TensorFlow.

Python 9,579 2,362 Updated May 6, 2024

Short tutorial for TensorFlow, designed to be presented in-person

Jupyter Notebook 298 223 Updated Sep 24, 2016

matplotlib: plotting with Python

Python 22,905 8,356 Updated Jun 18, 2026

Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.

Python 6,353 1,535 Updated Dec 3, 2024

scikit-learn: machine learning in Python

Python 66,370 27,078 Updated Jun 19, 2026

An Open Source Machine Learning Framework for Everyone

C++ 195,773 75,196 Updated Jun 19, 2026