Stars
阿里云计算平台DataWorks(https://help.aliyun.com/document_detail/137663.html) 团队出品,为监控而生的数据库连接池
Pentaho Data Integration ( ETL ) a.k.a Kettle
A Flexible and Powerful Parameter Server for large-scale machine learning
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
The missing Java distribution of native C++ libraries
ZooKeeper-Monitor, a monitor for zookeeper in java. Download https://github.com/alibaba/taokeeper/downloads
Apache Aurora - A Mesos framework for long-running services, cron jobs, and ad-hoc jobs
Source code to the book "Effective Java Second Edition" created by Joshua Bloch
RSS Owl is a powerful application to organize, search and read your RSS, RDF & Atom news feeds in a comfortable way. Highlights are saved searches, google reader sync, notifications, filters, fast …
1st Place Solution for DataCastle-CashBus Competition
Warcbase is an open-source platform for managing analyzing web archives
make non-root mountable encrypted disk shares
Solr like data import handler to migrate data from sql systems to nosql
Efficient Execution of Perl Scripts on Hadoop Clusters