Skip to content
View hangelwen's full-sized avatar

Block or report hangelwen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.

Python 9,564 2,614 Updated Sep 4, 2025

Simhash and near-duplicate detection

Python 421 115 Updated May 15, 2023

Multi-user server for Jupyter notebooks

Python 8,207 2,095 Updated Dec 15, 2025

An open source python library for automated feature engineering

Python 7,591 908 Updated Dec 23, 2025

ADMM based large scale logistic regression

Java 337 76 Updated Dec 16, 2023

An industrial deep learning framework for high-dimension sparse data

PureBasic 4,306 1,029 Updated Sep 25, 2024

Resilience4j is a fault tolerance library designed for Java8 and functional programming

Java 10,493 1,436 Updated Dec 3, 2025

Simple scala wrapper for HttpURLConnection. OAuth included.

Scala 972 118 Updated Apr 3, 2022

The Scala HTTP client you always wanted!

Scala 1,493 327 Updated Dec 24, 2025

Data-Centric Pipelines and Data Versioning

Go 6,277 570 Updated Feb 3, 2025

Generate Java types from JSON or JSON Schema and annotate those types for data-binding with Jackson, Gson, etc

Java 6,355 1,671 Updated Dec 23, 2025

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,560 576 Updated Nov 4, 2025

Learning embeddings for classification, retrieval and ranking.

C++ 3,958 527 Updated Dec 4, 2022

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

Python 3,679 530 Updated Dec 6, 2025

An ONNX (Open Neural Network eXchange) API and backend for typeful, functional deep learning and classical machine learning in Scala 3

Scala 143 9 Updated Nov 18, 2025

The missing Java distribution of native C++ libraries

Java 2,816 755 Updated Dec 26, 2025

tensorboard for pytorch (and chainer, mxnet, numpy, ...)

Python 7,984 858 Updated Nov 2, 2025

Uber-project for standard Jackson binary format backends: avro, cbor, ion, protobuf, smile

Java 337 147 Updated Dec 19, 2025

Intellij plugin that shows an object layout in memory to help optimize it. Uses OpenJDK JOL tool

Java 148 10 Updated Sep 14, 2024

📄 🇨🇳 📃 论文阅读笔记(分布式系统、虚拟化、机器学习)Papers Notebook (Distributed System, Virtualization, Machine Learning)

2,198 252 Updated Jun 1, 2022

Automated Machine Learning on Kubernetes

Python 1,648 495 Updated Dec 16, 2025
Scala 11 5 Updated Aug 22, 2023

Data model generator based on Scala case classes

Scala 29 14 Updated Nov 5, 2020

Avro schema generation and serialization / deserialization for Scala

Scala 727 240 Updated Dec 1, 2025

spark typed udf

Scala 10 4 Updated Jul 17, 2018

Responsive dashboard templates 📊✨

HTML 11,050 1,408 Updated Nov 2, 2021

🐘 Elasticsearch real-time search and analytics natively integrated with Hadoop

Java 2,023 999 Updated Dec 19, 2025

The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊

Clojure 45,308 6,125 Updated Dec 26, 2025

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Python 28,099 4,537 Updated Dec 19, 2025

The missing MatPlotLib for Scala + Spark

Scala 731 97 Updated Jan 30, 2022
Next