- Shanghai, China
- https://www.linkedin.com/in/chufeng-gao
- @ChufengGao
-
dbt-spark Public
Forked from dbt-labs/dbt-sparkdbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks
Python Apache License 2.0 UpdatedMar 10, 2026 -
chill-worklog Public
agentic workflow to write weekly / monthly worklogs
-
spark-kubernetes-operator Public
Forked from apache/spark-kubernetes-operatorApache Spark Kubernetes Operator
Java Apache License 2.0 UpdatedNov 4, 2025 -
spark Public
Forked from apache/sparkApache Spark - A unified analytics engine for large-scale data processing
Scala Apache License 2.0 UpdatedOct 30, 2025 -
dolphinscheduler Public
Forked from apache/dolphinschedulerApache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and provi…
-
kyuubi Public
Forked from apache/kyuubiApache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Scala Apache License 2.0 UpdatedMay 30, 2025 -
alibaba_hive_operator Public
Apache Airflow Hive Operator for Alibaba Cloud Serverless Spark
Python Other UpdatedMay 9, 2025 -
airflow_alibaba_provider Public archive
airflow_alibaba_provider
-
-
dbt-core Public
Forked from dbt-labs/dbt-coredbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Python Apache License 2.0 UpdatedOct 11, 2024 -
airflow Public
Forked from apache/airflowApache Airflow - A platform to programmatically author, schedule, and monitor workflows
Python Apache License 2.0 UpdatedSep 13, 2024 -
jaffle-shop-classic Public
Forked from dbt-labs/jaffle-shop-classicA self-contained dbt project for testing purposes
Apache License 2.0 UpdatedSep 12, 2024 -
-
LearningSparkV2 Public
Forked from databricks/LearningSparkV2This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Scala Apache License 2.0 UpdatedMay 8, 2024 -
spark-operator Public
Forked from kubeflow/spark-operatorKubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Go Apache License 2.0 UpdatedApr 24, 2024 -
micrometer Public
Forked from micrometer-metrics/micrometerAn application metrics facade for the most popular monitoring tools. Think SLF4J, but for metrics.
Java Apache License 2.0 UpdatedDec 28, 2023 -
bigdata-charts Public
Forked from Gradiant/bigdata-chartsCurated Big Data Applications for Kubernetes
PLpgSQL Apache License 2.0 UpdatedJul 19, 2023 -
mage-ai Public
Forked from mage-ai/mage-ai🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Python Apache License 2.0 UpdatedJul 13, 2023 -
hue Public
Forked from cloudera/hueOpen source SQL Query Assistant service for Databases/Warehouses
JavaScript Apache License 2.0 UpdatedJul 4, 2023 -
-
kafka Public
Forked from apache/kafkaMirror of Apache Kafka
Java Apache License 2.0 UpdatedMay 8, 2023 -
linkis Public
Forked from apache/linkisApache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
Java Apache License 2.0 UpdatedMar 28, 2023 -
chatgpt-java Public
Forked from kezhenxu94/chatgpt-javaChatGPT SDK and CLI for Java
Java Apache License 2.0 UpdatedMar 7, 2023 -
OpenLineage Public
Forked from OpenLineage/OpenLineageAn Open Standard for lineage metadata collection
Java Apache License 2.0 UpdatedFeb 20, 2023 -
DocsGPT Public
Forked from arc53/DocsGPTGPT-powered chat for documentation search & assistance.
Python MIT License UpdatedFeb 15, 2023 -
-
skywalking Public
Forked from apache/skywalkingAPM, Application Performance Monitoring System
Java Apache License 2.0 UpdatedFeb 2, 2023 -
kubernetes-client Public
Forked from fabric8io/kubernetes-clientJava client for Kubernetes & OpenShift
Java Apache License 2.0 UpdatedJan 28, 2023 -
cassandra Public
Forked from apache/cassandraMirror of Apache Cassandra
Java Apache License 2.0 UpdatedJan 27, 2023 -
keda Public
Forked from kedacore/kedaKEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes
Go Apache License 2.0 UpdatedJan 10, 2023