simhash This is a Python implementation of Simhash. The Simhash index (used to detected near-duplicates) is based on Scylla. Getting Started http://leons.im/posts/a-python-implementation-of-simhash-algorithm/ Build Status