Stars
The simplest, fastest repository for training/finetuning small-sized VLMs.
Implementation of gradient descent from scratch with a linear regression toy example
Statsmodels: statistical modeling and econometrics in Python
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
locality sensitive hashing (LSHASH) for Python3
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
In this project, I explored how local LLMs can be used to label data and support analyses. Specifically, I used Llama2 model to automatically categorise my bank transaction data.
The definitive Python library to receive livestream events (comments, gifts, etc.) in realtime from TikTok LIVE.
The Amazon Athena Query Federation SDK allows you to customize Amazon Athena with your own data sources and code.
The official home of the Presto distributed SQL query engine for big data
Collection of articles listing reasons why data science projects fail.
Phomint / Fiberhome_TL1
Forked from igorcardoso14/Fiberhome_TL1A Python script to configure a OLT with a list of users using TL1
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
shiftkey / desktop
Forked from desktop/desktopFork of GitHub Desktop to support various Linux distributions
The official repository for ROOT: analyzing, storing and visualizing big data, scientifically
Introdução a Redes Neurais com PyTorch. Ministrado no Python Brasil 2020.
Multi-platform Electron template, using React & Redux Toolkit on the front-end and Python/Flask for microservices on the back-end.
Build cross-platform desktop apps with JavaScript, HTML, and CSS
⚡ Dynamically generated stats for your github readmes
Cheat Sheets
Reverb is an efficient and easy-to-use data storage and transport system designed for machine learning research