Stars
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Multi-user server for Jupyter notebooks
An open source python library for automated feature engineering
An industrial deep learning framework for high-dimension sparse data
Resilience4j is a fault tolerance library designed for Java8 and functional programming
Simple scala wrapper for HttpURLConnection. OAuth included.
Generate Java types from JSON or JSON Schema and annotate those types for data-binding with Jackson, Gson, etc
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Learning embeddings for classification, retrieval and ranking.
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
An ONNX (Open Neural Network eXchange) API and backend for typeful, functional deep learning and classical machine learning in Scala 3
The missing Java distribution of native C++ libraries
tensorboard for pytorch (and chainer, mxnet, numpy, ...)
Uber-project for standard Jackson binary format backends: avro, cbor, ion, protobuf, smile
Intellij plugin that shows an object layout in memory to help optimize it. Uses OpenJDK JOL tool
📄 🇨🇳 📃 论文阅读笔记(分布式系统、虚拟化、机器学习)Papers Notebook (Distributed System, Virtualization, Machine Learning)
Data model generator based on Scala case classes
Avro schema generation and serialization / deserialization for Scala
🐘 Elasticsearch real-time search and analytics natively integrated with Hadoop
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.