Stars
π Papers & tech blogs by companies sharing their work on data science & machine learning in production.
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
π₯ Fast State-of-the-Art Tokenizers optimized for Research and Production
Fast, flexible name matching for large datasets
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Implementation of a linear-chain CRF in PyTorch
The complete load testing platform. Everything you need for production-grade load tests. Serverless & distributed. Load test with Playwright. Load test HTTP APIs, GraphQL, WebSocket, and more. Use β¦
Deep Learning for Time Series Classification
π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A Java 8 (and up) compatibility kit for Scala.
A curated list of awesome Machine Learning frameworks, libraries and software.
A high performance caching library for Java
πΎ Database Tools incl. ORM, Migrations and Admin UI (Postgres, MySQL & MongoDB) [deprecated]
A curated list for awesome kubernetes sources π’π
CockroachDB β the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.
Jenkins X provides automated CI+CD for Kubernetes with Preview Environments on Pull Requests using Cloud Native pipelines from Tekton
Code to accompany Advanced Analytics with Spark from O'Reilly Media
C++ library to develop competitive programming problems
A set of best practices for JavaScript projects
Mirror of Apache Gossip Incubator
This guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.