- UK or HU
- http://tamas.szuromi.me
- @tamas__szuromi
Stars
A curated list of awesome Machine Learning frameworks, libraries and software.
scikit-learn: machine learning in Python
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
The fundamental package for scientific computing with Python.
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,โฆ
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integraโฆ
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Always know what to expect from your data.
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Bayesian Modeling and Probabilistic Programming in Python
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrโฆ
An open source python library for automated feature engineering
Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
A simple Flask boilerplate app with SQLAlchemy, Redis, User Authentication, and more.