Starred repositories
Source code for the X Recommendation Algorithm
Apache Spark - A unified analytics engine for large-scale data processing
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text.
Modeling high-frequency limit order book dynamics with support vector machines
Former GraphX development repository. GraphX has been merged into Apache Spark; please submit pull requests there.
Ollie is a open information extractor that uses bootstrapped dependency paths.
An implementation of the multi-class/multi-label classifier, of which the training is carried out using AdaBoost.MH on Apache Spark.
Random Walk (Personalized PageRank) Algorithms for Large Graphs
fanfannothing / goose
Forked from GravityLabs/gooseHtml Content / Article Extractor in Scala - open sourced from Gravity Labs - http://gravity.com
fanfannothing / dnmar
Forked from srivastava-sar/dnmarDistant Supervision with Missing Data