This projectis part of the course requirements for Udacity's Data Scientist Nanodegree certification.
In this work, we look at correlation analysis of the cryptocurrency market, using Python, to analyze and visualize how these markets have been evolving the years, and provide a base framework for deeper analysis. New to crypto ? You can find a great overview of Satoshi Nakamoto’s innovation here
Modified Kaggle dataset using data from 2014 -2019, for the top 100 cryptocurrencies by marketcap. Table attributes include:
- Date: refers to the calendar date for the particular row - 24 hours midnight to midnight
- Currency: e.g Bitcoin, price in USD at the end of the day
The project followed the CRISP-DM method of data exploration's six steps of:
- Business Understanding
- Data Understanding
- Data Preparation
- Modeling
- Evaluation
- Deployment
The correlation and trend between various cryptocurrencies are evaluated to understand:
- Which coins can help diversify your portfolio: There is generally a strong correlation between coins, hence it is difficult to diversify risk (Just buying and holding Bitcoin is a safe bet).
- Are stable coins actually stable: Generally they appear to be decoupled from altcoin price fluctuations.
- If it is possible to predict the next bull run: Speculative at best. Nevertheless, end of 2019 looks similar to 2015, 2016 and 2017 … get ready to fill your bags.
All code used for the analysis can he found in the notebook.
To read the blog post associated with the article, you can go here.
- 1-EDA.ipynb: notebook for the analysis
- raw
- Coinmarketcap.csv: Raw data
- img1.png
- img2a.png
- img2b.png
- img3.png
- img4.png
- img5.png
- img6.png
pandas, numpy, matplotlib, seaborn, pyviz