Build software better, together

liv-skeete / smart-model-router

Semantic model router with parallel LLM classification, prompt caching, and vision short-circuiting. Optimizes request routing with sub-100ms overhead for Open WebUI.

caching machine-learning performance ai async-python request-routing model-optimization llm open-webui semantic-routing

Updated Feb 13, 2026
Python

ananyakaligal / Quantized-Finetuning

Star

This repository presents an efficient approach for fine-tuning large language models for the medical domain using 4-bit quantization and LoRA techniques.

lora quantization language-model finetuning medicalai model-optimization gemma-2b

Updated Nov 30, 2024
Jupyter Notebook

sumeyye-agac / har-to-tflite

Star

Tools and experiments for converting Human Activity Recognition (HAR) models to TensorFlow Lite for efficient on-device inference on mobile and wearable devices.

python deep-learning human-activity-recognition tensorflow-lite tf-lite embedded-ai edge-ai on-device-ml mobile-ai model-optimization model-conversion

Updated Mar 5, 2026
Python

A82516 / AEC

Star

Aprendizagem e Extração de Conhecimento

data-science data-analysis predictive-modeling model-validation model-optimization

Updated Sep 7, 2023

DataStatsMohith / customer-churn-prediction

Star

Predicts telecom customer churn with machine learning and an interactive Streamlit app. Features include single/batch predictions, dashboards, and actionable insights for improved retention.

python machine-learning random-forest data-visualization logistic-regression data-preprocessing feature-engineering customer-churn-prediction streamlit model-optimization auc-roc

Updated Dec 30, 2024
Jupyter Notebook

FlosMume / CUDA-AI-Inference-Starter

Star

A minimal, high-performance starter kit for running AI model inference on NVIDIA GPUs using CUDA. Includes environment setup, sample kernels, and guidance for integrating ONNX/TensorRT pipelines for fast, optimized inference on modern GPU hardware.

benchmarking deep-learning cuda nvidia high-performance-computing fp16 int8 inference-engine onnx tensor-rt devtech model-optimization gpu-inference ai-deployment adge-ai

Updated Nov 2, 2025
Cuda

lattice-ai / Compressed-DNNs-Forget

Star

Minimal Reproducibility Study of (https://arxiv.org/abs/1911.05248). Experiments with Compression of Deep Neural Networks

deep-neural-networks sparsity deep-learning neural-network tensorflow pruning deeplearning celeba celeba-dataset tensorflow-lite tflite sparsity-optimization model-optimization neural-network-pruning tracker-misc

Updated Jun 4, 2021
Python

maliknaik16 / machine-learning

Star

ML journey to explore concepts and framework through code and math. It serves as a personal log of my learning experiences, revisiting foundational topics, and delving into new areas within the field.

Updated Jun 23, 2025
Jupyter Notebook

Ashafa1905 / load-shortfall-regression-predict-api

Star

This is an End to End project and Api deployment for Spain electricity shortfall prediction

deployment modeling exploratory-data-analysis feature-engineering data-cleaning conclusion reccomendations model-optimization

Updated May 31, 2022
Python

pgeedh / Hyperparameter-Tuning-with-Keras-Tuner

Star

Practical experience in hyperparameter tuning techniques using the Keras Tuner library. Hyperparameter tuning plays a crucial role in optimizing machine learning models, and this project offers hands-on learning opportunities. Exploring different hyperparameter tuning methods, including random search, grid search, and Bayesian optimization

python machine-learning programming machine-learning-algorithms jupyter-notebook python3 model-selection hyperparameter-optimization mnist-dataset python-3 coursera-machine-learning hyperparameter-tuning random-search keras-neural-networks keras-tensorflow baysian-network model-optimization coursera-project

Updated Dec 5, 2023
Jupyter Notebook

hilmansw / Spam-Detection-App

Star

This project is built to detect spam messages using a Long Short-Term Memory (LSTM) model combined with Word2Vec as the word embedding technique. The model has been optimized using Grid Search, achieving a best accuracy of 95.65%.

natural-language-processing deep-learning tensorflow classification spam-detection hyperparameter-tuning streamlit model-optimization

Updated Nov 25, 2025
Python

JLeigh101 / deep-learning-challenge

Star

NU Bootcamp Module 21

python machine-learning deep-learning neural-network scikit-learn pandas predictive-modeling dataframe hidden-layers keras-tensorflow activation-functions standardscaler sigmoid-activation model-optimization relu-activation tanh-activation get-dummies model-training-and-evaluation

Updated May 30, 2023
Jupyter Notebook

Lucky-akash321 / Predicting-Startup-outcomes-with-XGBoost-and-Machine-Learning

Star

The "Predicting Startup Outcomes with XGBoost and Machine Learning" project uses machine learning algorithms, particularly XGBoost, to predict the success or failure of startups based on historical data. It leverages feature engineering and model optimization to enhance prediction accuracy.

supervised-learning data-preprocessing feature-engineering predictive-modeling xgboost-model machine-learning-predictions model-optimization startup-outcome-prediction

Updated Feb 13, 2025
Jupyter Notebook

PyPro2024 / VGG16-ResNet-FineTuning-Analysis

Star

An advanced study on optimizing Transfer Learning pipelines (VGG16 & ResNet50) for the CIFAR-10 dataset. Implements Fine-Tuning, L2 Regularization, Dropout, and Learning Rate Scheduling to solve overfitting and boost classification accuracy

machine-learning deep-learning image-classification resnet transfer-learning vgg16 cifar10 fine-tuning resnet50 model-optimization

Updated Jan 1, 2026
Jupyter Notebook

tk-yasuno / deepseek-v3-quantization-analysis

Star

Comprehensive performance analysis of DeepSeek V3 quantization levels (FP16, Q8_0, Q4_0) on 16GB GPU environments.

quantization model-evaluation fp16 gpu-performance latency-analysis model-quantization inference-acceleration model-optimization llm-inference llm-optimization deepseek-v3 throughput-analysis

Updated Sep 27, 2025
Python

quocnhut134 / Visible-Infrared_Person_Re-Identification_on_Weak_Hardware_using_Optimized-IDKL_Model

Star

Optimized IDKL Model for Visible-Infrared Person Re-Identification focusing on efficiency for resource-constrained hardware.

computer-vision deep-learning surveillance pytorch person-reidentification model-optimization

Updated Jan 27, 2026
Python

bnabis93 / vision-language-examples

Star

Vision-lanugage model example code.

tutorial example pytorch transformer embedding-models model-acceleration vision-language model-optimization vision-language-model

Updated Sep 6, 2023
Python

aminshennan / Parameter-Efficient-Fine-tuning-of-an-11B-LLM-with-LoRA-for-Domain-Specific-Summarization.

Star

NLP pipeline with parameter-efficient LoRA fine-tuning on FLAN-T5-XXL (11B params). Achieves +2.6 ROUGE-1 improvement with <1% trainable parameters and 8-bit quantization for scientific paper summarization.

python nlp flask natural-language-processing transformers text-summarization arxiv lora quantization document-processing scientific-papers abstractive-summarization huggingface model-optimization t5-model research-tools large-language-models parameter-efficient-fine-tuning

Updated May 27, 2025
Jupyter Notebook

amha-kindu / amharic-gpt

Star

Training and fine-tuning pipeline for a custom GPT-style language model built exclusively for Amharic. Pretrained on a 12+ GB corpus and adapted on curated datasets, with support for SentencePiece tokenization, LoRA fine-tuning, and efficient inference tools.

machine-learning python3 pytorch lora language-model data-parallelism distributed-training tensorboard-visualization transformer-architecture mixed-precision-training amharic-nlp model-optimization runpod model-training-and-evaluation flash-attention finetuning-transformers

Updated Oct 30, 2025
Python

Hamim-Hussain / Enhancing-Deep-Learning-Model-Performance

Star

Nonprofit foundation Alphabet Soup wants a tool that can help it select the applicants for funding with the best chance of success in their ventures. Using machine learning and neural networks, you’ll use the features in the provided dataset to create a binary classifier that can predict whether applicants will be successful if funded.

python data-science machine-learning deep-learning neural-network pandas predictive-modeling dataframe hidden-layers activation-functions model-optimization get-dummies model-training-and-evaluation

Updated Aug 22, 2023
Jupyter Notebook

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model-optimization

Here are 182 public repositories matching this topic...

liv-skeete / smart-model-router

ananyakaligal / Quantized-Finetuning

sumeyye-agac / har-to-tflite

A82516 / AEC

DataStatsMohith / customer-churn-prediction

FlosMume / CUDA-AI-Inference-Starter

lattice-ai / Compressed-DNNs-Forget

maliknaik16 / machine-learning

Ashafa1905 / load-shortfall-regression-predict-api

pgeedh / Hyperparameter-Tuning-with-Keras-Tuner

hilmansw / Spam-Detection-App

JLeigh101 / deep-learning-challenge

Lucky-akash321 / Predicting-Startup-outcomes-with-XGBoost-and-Machine-Learning

PyPro2024 / VGG16-ResNet-FineTuning-Analysis

tk-yasuno / deepseek-v3-quantization-analysis

quocnhut134 / Visible-Infrared_Person_Re-Identification_on_Weak_Hardware_using_Optimized-IDKL_Model

bnabis93 / vision-language-examples

aminshennan / Parameter-Efficient-Fine-tuning-of-an-11B-LLM-with-LoRA-for-Domain-Specific-Summarization.

amha-kindu / amharic-gpt

Hamim-Hussain / Enhancing-Deep-Learning-Model-Performance

Improve this page

Add this topic to your repo