hadoop
Here are 25 public repositories matching this topic...
Big Data essentials: Hadoop, MapReduce, Spark. Explore tutorials and demos in Jupyter notebooks—most are self-contained and live, ready to run with a click.
-
Updated
Nov 24, 2025 - Jupyter Notebook
AI course Notebooks and Exercises
-
Updated
Jun 16, 2025 - Jupyter Notebook
Local playground for Spark and Jupyter notebooks, plus Iceberg support
-
Updated
Apr 20, 2025 - Dockerfile
Apache Hadoop development environment integrated with Jupyter Notebook using Docker
-
Updated
Jan 10, 2025 - Dockerfile
重庆大学2024年秋大数据架构与技术课程,本仓库基于学校提供的原开源项目进行了优化和扩展,包含最新的工具版本、简化的环境配置流程、以及对更便捷高效的开发工具(如 Jupyter Notebook)的全面支持。
-
Updated
Jan 2, 2025 - Jupyter Notebook
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
Updated
Mar 20, 2024 - Python
This project walks through how you can create recommendations using Apache Spark machine learning. There are a number of jupyter notebooks that you can run on IBM Data Science Experience, and there a live demo of a movie recommendation web application you can interact with. The demo also uses IBM Message Hub (kafka) to push application events to…
-
Updated
Apr 17, 2023 - Jupyter Notebook
Hadoop beginner exercise in analyzing European football teams' statistics over the last 20 years. The goal is to determine which team had the highest win percentage-rate.
-
Updated
Oct 29, 2022 - Makefile
Exercise of using the Streaming API with Hadoop to determine the word count of Wikipedia articles.
-
Updated
Oct 29, 2022 - Jupyter Notebook
Hadoop environment with HDFS, Spark, Hue, Jupyter Notebooks, etc. all installed in docker-compose
-
Updated
Mar 25, 2022 - Jupyter Notebook
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."