-
Hopsworks
- Stockholm
- @jim_dowling
- in/jim-dowling-206a98
Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Extremely fast Query Engine for DataFrames, written in Rust
DuckDB is an analytical in-process SQL database management system
DSPy: The framework for programmingβnot promptingβlanguage models
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
A playbook for systematically maximizing the performance of deep learning models.
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
βοΈ DEPRECATED β See https://github.com/ageron/handson-ml3 instead.
A game theoretic approach to explain the output of any machine learning model.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
π Cube Core is open-source semantic layer for AI, BI and embedded analytics
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
An orchestration platform for the development, production, and observation of data assets.
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
π Open source distributed and RESTful search engine.
A framework for few-shot evaluation of language models.
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance β¦
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Modin: Scale your Pandas workflows by changing a single line of code
An open-source, low-code machine learning library in Python