Skip to content
#

minhash

Here are 126 public repositories matching this topic...

Sparkling Water is a scalable system for detecting, merging, and clustering similar server processes based on interaction logs. Using Apache Spark, MinHash, LSH, and time-series hashing (SSH, BSeSH), it efficiently identifies behavior patterns in large server infrastructures for performance optimization, anomaly detection, and system analysis.

  • Updated Jun 28, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the minhash topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the minhash topic, visit your repo's landing page and select "manage topics."

Learn more