🌸 Iris Flower Classification Project

Objective

Classify iris flowers into three species (Setosa, Versicolor, Virginica) based on measurements of their petals and sepals.

📊 Dataset

The classic Iris dataset from the UCI Repository, loaded via scikit-learn:

150 samples (50 per species)
4 features:
- Sepal length (cm)
- Sepal width (cm)
- Petal length (cm)
- Petal width (cm)
3 classes: Setosa, Versicolor, Virginica

🛠️ Technologies Used

Python 3.x
Libraries:
- pandas - Data manipulation
- numpy - Numerical operations
- matplotlib - Visualization
- seaborn - Advanced visualization
- scikit-learn - Machine learning models and metrics

📋 Project Steps

1. Load the Dataset

Load the Iris dataset from scikit-learn
Create a pandas DataFrame for easier manipulation
Display basic information and statistics

2. Exploratory Data Analysis (EDA)

Pairplot: Visualize relationships between all feature pairs
Histograms: Show distribution of each feature by species
Box Plots: Display feature distributions and outliers
Correlation Heatmap: Show correlation between features

3. Data Preprocessing

Check for missing values (none found)
Split data into training (80%) and test (20%) sets
Apply feature scaling using StandardScaler
Use stratified sampling to maintain class balance

4. Model Training

Train and compare three classifiers:

Logistic Regression
K-Nearest Neighbors (KNN)
Decision Tree Classifier

5. Model Evaluation

Evaluate models using multiple metrics:

Accuracy: Overall correctness
Precision: Positive prediction accuracy
Recall: True positive detection rate
F1-Score: Harmonic mean of precision and recall
Confusion Matrix: Detailed prediction breakdown

6. Feature Importance

Analyze which features are most important for classification using the Decision Tree model.

🚀 How to Run

Prerequisites

Install required libraries:

pip install pandas numpy matplotlib seaborn scikit-learn

Execution

Run the main script:

python iris_classification.py

📈 Expected Results

All three models typically achieve high accuracy (95%+) on this dataset:

Logistic Regression: ~97-100%
K-Nearest Neighbors: ~97-100%
Decision Tree: ~97-100%

📁 Output Files

The script generates the following visualization files:

iris_pairplot.png - Scatter plots of all feature combinations
iris_distributions.png - Histograms showing feature distributions
iris_boxplots.png - Box plots for each feature by species
iris_correlation_heatmap.png - Feature correlation matrix
iris_confusion_matrices.png - Confusion matrices for all models
iris_model_comparison.png - Performance metrics comparison
iris_feature_importance.png - Feature importance ranking

🎯 Skills Gained

✅ Loading and exploring datasets
✅ Data visualization techniques (scatter plots, histograms, heatmaps)
✅ Data preprocessing and scaling
✅ Train-test split methodology
✅ Classification modeling with multiple algorithms
✅ Model evaluation using various metrics
✅ Confusion matrix interpretation
✅ Feature importance analysis
✅ Model comparison and selection

🔍 Key Insights

Petal measurements (length and width) are typically more discriminative than sepal measurements
Setosa is linearly separable from the other two species
Versicolor and Virginica have some overlap, making them slightly harder to distinguish
All three simple classifiers perform excellently on this dataset
The dataset is well-balanced with no missing values

📚 Additional Resources

👨‍💻 Author

Created as a beginner-friendly machine learning project to demonstrate classification techniques.

📝 License

This project is open source and available for educational purposes.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
VERIFICATION_REPORT.md		VERIFICATION_REPORT.md
iris_boxplots.png		iris_boxplots.png
iris_classification.py		iris_classification.py
iris_correlation_heatmap.png		iris_correlation_heatmap.png
iris_distributions.png		iris_distributions.png
iris_pairplot.png		iris_pairplot.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🌸 Iris Flower Classification Project

Objective

📊 Dataset

🛠️ Technologies Used

📋 Project Steps

1. Load the Dataset

2. Exploratory Data Analysis (EDA)

3. Data Preprocessing

4. Model Training

5. Model Evaluation

6. Feature Importance

🚀 How to Run

Prerequisites

Execution

📈 Expected Results

📁 Output Files

🎯 Skills Gained

🔍 Key Insights

📚 Additional Resources

👨‍💻 Author

📝 License

About

Uh oh!

Releases

Packages

Languages

1234-ad/Iris-Flower-Classification

Folders and files

Latest commit

History

Repository files navigation

🌸 Iris Flower Classification Project

Objective

📊 Dataset

🛠️ Technologies Used

📋 Project Steps

1. Load the Dataset

2. Exploratory Data Analysis (EDA)

3. Data Preprocessing

4. Model Training

5. Model Evaluation

6. Feature Importance

🚀 How to Run

Prerequisites

Execution

📈 Expected Results

📁 Output Files

🎯 Skills Gained

🔍 Key Insights

📚 Additional Resources

👨‍💻 Author

📝 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages