Skip to content

PriceTT/DSND3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

DSND3: Identify Customer Segments

This projectis part of the course requirements for Udacity's Data Scientist Nanodegree certification.

Motivation:

In this project, you will apply unsupervised learning techniques to identify segments of the population that form the core customer base for a mail-order sales company in Germany. These segments can then be used to direct marketing campaigns towards audiences that will have the highest expected rate of returns. The data that you will use has been provided by our partners at Bertelsmann Arvato Analytics, and represents a real-life data science task.

Data Sources

There are four files associated with this project (not including due to copyright):

  • Udacity_AZDIAS_Subset.csv: Demographics data for the general population of Germany; 891211 persons (rows) x 85 features (columns).
  • Udacity_CUSTOMERS_Subset.csv: Demographics data for customers of a mail-order company; 191652 persons (rows) x 85 features (columns).
  • Data_Dictionary.md: Detailed information file about the features in the provided datasets.
  • AZDIAS_Feature_Summary.csv: Summary of feature attributes for demographics data; 85 features (rows) x 4 columns

Methods

Sklearn's KMeans class is used to perform k-means clustering on the PCA-transformed data.

About

Identify Customer Segments

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors