Skip to content
View pombero's full-sized avatar

Block or report pombero

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The resources of the preparation course for Databricks Data Engineer Associate certification exam

Python 625 994 Updated Dec 26, 2025

Ready-to-run Docker images containing Jupyter applications

Python 8,427 2,991 Updated Apr 7, 2026

This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.

Python 10,381 4,263 Updated Dec 22, 2020

https://huyenchip.com/ml-interviews-book/

HTML 4,590 666 Updated Mar 21, 2025

Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)

4,641 921 Updated Oct 31, 2025

Multi-user server for Jupyter notebooks

Python 8,262 2,108 Updated Apr 8, 2026

Visualizations for machine learning datasets

Jupyter Notebook 7,360 889 Updated May 24, 2023

Run Kubernetes locally

Go 31,688 5,202 Updated Apr 10, 2026

A place in which we publish scripts for reproducible benchmarks.

Python 105 38 Updated Dec 13, 2019

Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared …

Go 48,121 10,280 Updated Apr 11, 2026

The Prometheus monitoring system and time series database.

Go 63,534 10,322 Updated Apr 10, 2026

Performance optimization for Spark running on Kubernetes

Scala 87 30 Updated Aug 18, 2020

Apache Iceberg

Java 8,715 3,150 Updated Apr 11, 2026

Lens - The way the world runs Kubernetes

23,152 1,486 Updated Feb 11, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,631 5,144 Updated Apr 9, 2026

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

Python 21,387 3,165 Updated Apr 10, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,200 32,833 Updated Apr 11, 2026

Official Kaggle CLI

Python 7,245 1,342 Updated Apr 10, 2026

Kaggle Python docker image

Python 2,696 1,011 Updated Mar 20, 2026

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,744 2,080 Updated Apr 10, 2026

Readings in Databases

8,047 922 Updated Sep 9, 2024

Apache Spark - A unified analytics engine for large-scale data processing

Scala 43,110 29,153 Updated Apr 11, 2026

A curated list of data engineering tools for software developers

8,487 1,470 Updated Apr 5, 2026

A curated list of data engineering tools for software developers

507 91 Updated Jun 23, 2017

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 45,007 16,849 Updated Apr 11, 2026

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 97,074 9,049 Updated Apr 10, 2026

Parallel computing with task scheduling

Python 13,799 1,862 Updated Apr 7, 2026

Easily install and load packages from the tidyverse

R 1,784 294 Updated Jun 18, 2025

The fundamental package for scientific computing with Python.

Python 31,826 12,289 Updated Apr 11, 2026

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 99,031 27,460 Updated Apr 11, 2026
Next