Stars
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Example code for Our Python Fundamentals LiveLessons Videos
Bootstrap Kubernetes the hard way. No scripts.
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
RAPIDS Community Notebooks
⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 or handson-mlp instead.
Evaluating access and memory usage of different matrix types
Benchmarking R & Bioconductor performance working with large-scale single-cell data
a long-read error correction tool using the multi-string Burrows Wheeler Transform
Active Learning Workshop Materials
TripletLoss used in Google's FaceNet paper
Face recognition with deep neural networks.
A global, black box optimization engine for real world metric optimization.
In-memory nucleotide sequence k-mer counting, filtering, graph traversal and more
Deep Learning and Unsupervised Feature Learning Tutorial Solutions
Deep Learning (Python, C, C++, Java, Scala, Go)
A flexible framework of neural networks for deep learning
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running mat…
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K…