You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.
Single-Cell RNA-seq Analysis of Bone Marrow Dataset Using Scanpy: This repository reproduces a complete scRNA-seq analysis pipeline using the Scanpy library on a modified bone marrow dataset (originally from CZI). The workflow includes preprocessing, normalization, clustering, marker-based annotation, and biological interpretation.
A lightweight content-based movie recommendation system built using TMDB metadata. It extracts key movie features, processes them into vectors, computes similarity scores, and serves the recommendations through an interactive Streamlit UI.
This project uses the S.Y. 2020-2021 DepEd Schools Masterlist that contains 64,000+ school information across the Philippines, including location, sectors, and classification details.
Enterprise-grade training data curation bot for LLM fine-tuning using Decodo and Python automation. It provides an async, modular pipeline for document loading, preprocessing, task-specific data generation (Q&A, summarization, classification), quality evaluation, and dataset export — all through a unified API.
Public Repository: Machine Learning & Data Mining project using the South African Heart Disease dataset. Applied PCA, Regularized Linear Regression, ANN, Logistic Regression, and Decision Trees with cross-validation for regression and classification. Includes feature scaling, EDA, and statistical tests.
The project demonstrates how to build an end-to-end supervised learning workflow for credit approval prediction. It documents the data preparation steps, the training of several classification algorithms, the model selection criteria, and the interpretation of the final solution so that the process can be replicated.