Skip to content
View cocodee's full-sized avatar

Block or report cocodee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

25 stars written in Scala
Clear filter

Source code for the X Recommendation Algorithm

Scala 68,081 12,658 Updated Sep 8, 2025

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,527 28,978 Updated Dec 23, 2025

A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.

Scala 13,243 3,582 Updated Dec 19, 2025

A fault tolerant, protocol-agnostic RPC system

Scala 8,854 1,450 Updated Dec 16, 2025

Old repo for Linkerd 1.x. See the linkerd2 repo for Linkerd 2.x.

Scala 5,336 502 Updated Mar 4, 2023

Simple and Distributed Machine Learning

Scala 5,192 854 Updated Dec 17, 2025

Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules

Scala 4,386 522 Updated Jun 29, 2022

Deploy and manage containers (including Docker) on top of Apache Mesos at scale.

Scala 4,049 835 Updated Sep 8, 2022

酷玩 Spark: Spark 源代码解析、Spark 类库等

Scala 3,485 1,397 Updated May 18, 2022

REST job server for Apache Spark

Scala 2,846 985 Updated Jul 8, 2025

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,286 971 Updated Dec 22, 2025

Streaming MapReduce with Scalding and Storm

Scala 2,130 264 Updated Jan 19, 2022

scala、spark使用过程中,各种测试用例以及相关资料整理

Scala 1,086 427 Updated Feb 9, 2019

Livy is an open source REST interface for interacting with Apache Spark from anywhere

Scala 1,007 312 Updated Oct 5, 2022

Lightweight real-time big data streaming engine over Akka

Scala 758 152 Updated Mar 1, 2022

Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the kubernetes scheduler back-end is now on https://github.com/apa…

Scala 612 117 Updated Jan 8, 2020

Real Time Analytics and Data Pipelines based on Spark Streaming

Scala 529 196 Updated Oct 24, 2019

Apache Spark training material

Scala 400 357 Updated Nov 24, 2015

Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.

Scala 390 51 Updated Dec 6, 2025

Serverless proxy for Spark cluster

Scala 324 69 Updated Oct 29, 2020

k-Nearest Neighbors algorithm on Spark

Scala 239 108 Updated Nov 14, 2023

Spark RDD with Lucene's query and entity linkage capabilities

Scala 128 37 Updated Sep 8, 2025

Scala binding for ZeroMQ

Scala 71 22 Updated May 10, 2013

An experiment that uses the PostgreSQL 'cube' extension to store and query word vectors

Scala 6 1 Updated Apr 3, 2018

Kyuubi is an enhanced editon of Apache Spark's primordial Thrift JDBC/ODBC Server.

Scala 6 3 Updated Jul 16, 2018