Skip to content
View ccchengff's full-sized avatar

Highlights

  • Pro

Organizations

@DMALab @PKU-DAIR

Block or report ccchengff

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 34 Updated Oct 16, 2025

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 5 Updated Jun 29, 2025

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 1,613 116 Updated Dec 19, 2025

A comprehensive guide for beginners in the field of data management and artificial intelligence.

506 21 Updated Apr 8, 2025

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

314 18 Updated Jun 21, 2024

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,775 123 Updated Aug 20, 2024

Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs). If you have any interests, please visit/star/fork https://github.com/P…

Python 23 18 Updated Oct 22, 2025

Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).

Python 174 15 Updated Dec 16, 2025

A curated reading list of research in Mixture-of-Experts(MoE).

653 44 Updated Oct 30, 2024

[IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any interests, please visit/star/fork https://github.com/Youhe-Jiang…

Python 52 5 Updated May 31, 2023

A scalable graph learning toolkit for extremely large graph datasets. (WWW'22, 🏆 Best Student Paper Award)

Python 157 24 Updated May 10, 2024

A high-performance distributed deep learning system targeting large-scale and automated distributed training.

Python 329 40 Updated Dec 13, 2025

[CVPR 2022] PointCLIP: Point Cloud Understanding by CLIP

Python 402 37 Updated Nov 24, 2022

An efficient open-source AutoML system for automating machine learning lifecycle, including feature engineering, neural architecture search, and hyper-parameter tuning.

Python 60 9 Updated Nov 11, 2025

A high-performance distributed deep learning system targeting large-scale and automated distributed training. If you have any interests, please visit/star/fork https://github.com/PKU-DAIR/Hetu

Python 122 57 Updated Dec 18, 2023

Towards Generalized and Efficient Blackbox Optimization System/Package (KDD 2021 & JMLR 2024)

Python 430 57 Updated Sep 18, 2025

Generalized and Efficient Blackbox Optimization System.

Python 85 84 Updated Feb 21, 2023

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

275,047 20,997 Updated Aug 22, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 96,023 26,295 Updated Dec 20, 2025

The Julia Programming Language

Julia 48,118 5,692 Updated Dec 19, 2025

Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)

Python 182 48 Updated Nov 19, 2018

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

Python 2,707 625 Updated Nov 22, 2022

😎 Awesome lists about all kinds of interesting topics

423,575 32,619 Updated Nov 22, 2025

A Detailed Cplusplus Concurrency Tutorial 《C++ 并发编程指南》

C++ 5,472 1,488 Updated Dec 29, 2022

LIBSVM -- A Library for Support Vector Machines

Java 4,674 1,641 Updated May 12, 2025

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …

C++ 17,941 3,967 Updated Dec 19, 2025

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++ 27,762 8,827 Updated Dec 18, 2025

A Flexible and Powerful Parameter Server for large-scale machine learning

Java 6,783 1,592 Updated Oct 13, 2025

The simulator of RISC-V, implemented by C++

Assembly 3 Updated Sep 25, 2017