Stars
andykram / openscoring
Forked from openscoring/openscoringREST web service for scoring PMML models
OSQA / osqa
Forked from evgenyfadeev/askbot-develAn open source Q&A(question and answer) eco-system. Issue tracking is at http://jira.osqa.net
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
haifengl / unicorn
Forked from adplabs/unicornBigTable, Document and Graph Database with Full Text Search
Alir3z4 / python-sanitize
Forked from aaronsw/sanitizeBringing sanity to world of messed-up data
Scalable Bloom Filter implemented in Python
scrapinghub / page_finder
Forked from plafl/page_finderFind which links on a web page are pagination links
Fast Python Bloom Filter using Mmap
nkhuyu / quark
Forked from qubole/quarkQuark is a data virtualization engine over analytic databases.
Continually updated Data Science Python Notebooks: Spark, Hadoop MapReduce, HDFS, AWS, Kaggle, scikit-learn, matplotlib, pandas, NumPy, SciPy, and various command lines.
nkhuyu / t-SNE-tutorial
Forked from oreillymedia/t-SNE-tutorialA tutorial on the t-SNE learning algorithm
nkhuyu / dedupe
Forked from dedupeio/dedupeA python library for accurate and scaleable data deduplication and entity-resolution.
Kaggle's competition for using Google's word2vec package for sentiment analysis
nkhuyu / langid.py
Forked from saffsd/langid.pyStand-alone language identification system
nkhuyu / h2o
Forked from h2oai/h2o-2h2o = fast statistical, machine learning & math runtime for bigdata
nkhuyu / svmjs
Forked from karpathy/svmjsSupport Vector Machine in Javascript (SMO algorithm, supports arbitrary kernels) + GUI demo
nkhuyu / mincemeatpy
Forked from michaelfairley/mincemeatpyLightweight MapReduce in python