-
Ruijie Networks
- Shanghai
- http://hslovelal.top:8080
Lists (5)
Sort Name ascending (A-Z)
Starred repositories
MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining
Streaming data platform. Real-time stream processing, low-latency serving, and Iceberg table management.
《Agentic Design Patterns》中文翻译版
cuTile is a programming model for writing parallel kernels for NVIDIA GPUs
Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).
Build fast and accurate GenAI apps with GraphRAG SDK at scale.
Fire框架是由中通大数据自主研发并开源的、专门用于进行Spark和Flink任务开发的大数据框架,可节约70%以上的代码量。首创基于注解进行Spark和Flink任务开发,具备实时血缘、根因诊断、动态调优、参数热调整等众多平台化功能。Fire框架在中通内部每天处理数据量高达数千亿,在外部已被数十家公司所使用。
FastDFS is a high performance distributed file system (DFS). It's major functions include: file storing, file syncing and file accessing, and design for high capacity and load balance. Wechat/Weixi…
MCP Server for Apache Spark History Server. The bridge between Agentic AI and Apache Spark.
pg_lake: Postgres with Iceberg and data lake access
sql 血缘解析(hive sql、spark sql、starrocks sql、doris sql)
Analyze SQL and stored procedure data lineage using Java
Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data types and format characters) using Java.
一个通过类Raid技术,将文件分布式存储于多个消费级网盘,以实现极致下载加速的开源云盘分布式文件系统
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
解析SQL,获取字段、表级别的血缘关系。转换成血缘模型,在图数据库neo4j上呈现。
基于 antlr4 的多种数据库SQL解析器,获取SQL中元数据,可用于数据平台产品中的多个场景:ddl语句提取元数据、sql 权限校验、表级血缘、sql语法校验等场景。支持spark、flink、gauss、starrocks、Oracle、MYSQL、Postgresql,sqlserver,、db2等
Nebula-Algorithm is a Spark Application based on GraphX, which enables state of art Graph Algorithms to run on top of NebulaGraph and write back results to NebulaGraph.
One-liner NebulaGraph playground with allllllllll-in-one toolchain integrated on single Linux Server
A dataset generator/graph modeling demo of Shareholding Breakthrough with Distributed open-source Graph Database: Nebula Graph. 图数据库应用示例、数据集、图建模:股权关系穿透
China Fake Dataset Generator for Covid Track
Fraud detection data generation with configurable degree distribution& community structure, ready for NebulaGraph.
Spark Sql on Yarn Cluster or Kubeflow-Spark-Operator
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…