Machine Learning Algorithms from Scratch

A comprehensive collection of fundamental machine learning algorithms implemented from scratch in Python. This repository is designed for educational purposes, helping developers understand the core concepts behind popular ML algorithms without relying on library abstractions.

Overview

This repository demonstrates hands-on implementations of key machine learning algorithms, complete with:

Clean, well-documented code
Example usage scripts for each algorithm
Detailed README files for each implementation
Educational focus on understanding fundamentals

Perfect for students, educators, and developers looking to deepen their understanding of machine learning concepts.

Project Structure

ML-Algorithm/
├── KNN/
│   ├── knn.py                  # K-Nearest Neighbors implementation
│   ├── main.py                 # Example usage of KNN
│   └── README.md               # Detailed KNN documentation
├── Linear-Regression/
│   ├── linear.py               # Linear Regression implementation
│   ├── main.py                 # Example usage of Linear Regression
│   └── README.md               # Detailed Linear Regression documentation
├── Logistic-Regression/
│   ├── logistic.py             # Logistic Regression implementation
│   ├── main.py                 # Example usage of Logistic Regression
│   └── README.md               # Detailed Logistic Regression documentation
├── SVM/
│   ├── svm.py                  # Support Vector Machine implementation
│   ├── main.py                 # Example usage of SVM
│   └── README.md               # Detailed SVM documentation
├── requirements.txt            # Project dependencies
├── setup.py                    # Package setup and installation
├── LICENSE                     # MIT License file
└── README.md                   # This file

Implemented Algorithms

1. K-Nearest Neighbors (KNN)

Description: A non-parametric, instance-based classification algorithm that predicts the class of a data point based on the majority class among its k nearest neighbors in the feature space.

Key Characteristics:

Non-parametric approach
Instance-based learning
Suitable for both classification and regression
Sensitive to feature scaling

Files: KNN/knn.py, KNN/main.py, KNN/README.md

2. Linear Regression

Description: A supervised learning algorithm that models the linear relationship between a dependent variable and one or more independent variables using the least squares method.

Key Characteristics:

Parametric approach
Assumes linear relationship
Minimizes mean squared error (MSE)
Works well with linearly separable data

Files: Linear-Regression/linear.py, Linear-Regression/main.py, Linear-Regression/README.md

3. Logistic Regression

Description: A supervised learning algorithm for binary classification that applies the logistic (sigmoid) function to model the probability of class membership.

Key Characteristics:

Binary classification
Uses sigmoid activation function
Gradient descent optimization
Produces probability estimates

Files: Logistic-Regression/logistic.py, Logistic-Regression/main.py, Logistic-Regression/README.md

4. Support Vector Machine (SVM)

Description: A supervised learning algorithm that finds the optimal hyperplane to separate classes by maximizing the margin between them, using hinge loss and gradient descent.

Key Characteristics:

Effective for binary classification
Finds optimal decision boundary
Robust to outliers
Supports both linear and non-linear classification

Files: SVM/svm.py, SVM/main.py, SVM/README.md

Getting Started

Prerequisites

Python 3.6 or higher
pip (Python package manager)

Installation

Clone or download this repository:

git clone <repository-url>
cd ML-Algorithm

Install dependencies:

pip install -r requirements.txt

Or install manually:

pip install numpy scikit-learn

Usage Examples

Running Individual Algorithms

Each algorithm directory contains a main.py file with example usage. To run any algorithm:

# KNN Example
cd KNN
python main.py

# Linear Regression Example
cd ../Linear-Regression
python main.py

# Logistic Regression Example
cd ../Logistic-Regression
python main.py

# SVM Example
cd ../SVM
python main.py

Using in Your Own Code

from KNN.knn import KNN

# Create and train a KNN classifier
knn = KNN(k=3)
knn.fit(X_train, y_train)

# Make predictions
predictions = knn.predict(X_test)

Algorithm Details

For in-depth information about each algorithm, please refer to the individual README files:

Each README includes:

Mathematical foundations
Algorithm complexity analysis
Usage examples
Performance metrics
Advantages and disadvantages

Dependencies

Package	Version	Purpose
NumPy	>= 1.19.0	Numerical computations
scikit-learn	>= 0.24.0	Data generation and evaluation metrics

Contributing

Contributions are welcome! Here's how you can help:

Fork the repository
Create a feature branch (git checkout -b feature/improvement)
Commit your changes (git commit -am 'Add new feature')
Push to the branch (git push origin feature/improvement)
Submit a Pull Request

Please ensure your code follows the existing style and includes appropriate documentation.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Acknowledgments

These implementations are designed for educational purposes and serve as learning resources for understanding fundamental machine learning concepts.

Notes

All implementations are written from scratch without relying on high-level ML libraries
Code is optimized for clarity and understanding rather than performance
Each algorithm includes detailed comments explaining key concepts
Example datasets use scikit-learn for convenience, but the core algorithms are custom-built

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Algorithms from Scratch

Table of Contents

Overview

Project Structure

Implemented Algorithms

1. K-Nearest Neighbors (KNN)

2. Linear Regression

3. Logistic Regression

4. Support Vector Machine (SVM)

Getting Started

Prerequisites

Installation

Usage Examples

Running Individual Algorithms

Using in Your Own Code

Algorithm Details

Dependencies

Contributing

License

Acknowledgments

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
KNN		KNN
Linear-Regression		Linear-Regression
Logistic-Regression		Logistic-Regression
SVM		SVM
.gitignore		.gitignore
LICENSE		LICENSE
README.Md		README.Md
requirements.txt		requirements.txt
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Algorithms from Scratch

Table of Contents

Overview

Project Structure

Implemented Algorithms

1. K-Nearest Neighbors (KNN)

2. Linear Regression

3. Logistic Regression

4. Support Vector Machine (SVM)

Getting Started

Prerequisites

Installation

Usage Examples

Running Individual Algorithms

Using in Your Own Code

Algorithm Details

Dependencies

Contributing

License

Acknowledgments

Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages