Skip to content
View kq-chen's full-sized avatar

Organizations

@shikras

Block or report kq-chen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,350 326 Updated Jan 14, 2026

ComfyUI's ControlNet Auxiliary Preprocessors

Python 3,949 355 Updated Apr 13, 2026

A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models

Python 386 26 Updated Oct 11, 2025

Papers, repository and other data about anime or manga research. Please let me know if you have information that the list does not include.

1,233 72 Updated Dec 31, 2025

🚀 Efficient implementations for emerging model architectures

Python 5,003 515 Updated Apr 28, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,514 756 Updated Apr 28, 2026

Ring attention implementation with flash attention

Python 1,014 98 Updated Sep 10, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,951 322 Updated Jan 14, 2026

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 186 50 Updated Apr 8, 2026

Machine Learning Engineering Open Book

Python 17,814 1,132 Updated Mar 16, 2026

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 186 30 Updated Mar 17, 2026

Ongoing research training transformer models at scale

Python 16,178 3,883 Updated Apr 28, 2026

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,581 370 Updated Apr 28, 2026

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,276 163 Updated Apr 6, 2026

watchpoints is an easy-to-use, intuitive variable/object monitor tool for python that behaves similar to watchpoints in gdb.

Python 555 22 Updated Dec 23, 2024

A library that can print Python objects in human readable format

Python 710 52 Updated Apr 2, 2025

coredumpy saves your crash site for post-mortem debugging

Python 757 20 Updated Jan 5, 2026

An intuitive and low-overhead instrumentation tool for Python

Python 1,201 41 Updated Jul 8, 2025

A debugging and profiling tool that can trace and visualize python code execution

Python 7,621 468 Updated Feb 16, 2026

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 99,502 27,615 Updated Apr 28, 2026

Open-source unified multimodal model

Python 5,871 520 Updated Oct 27, 2025

[CVPR2024, Highlight] Official code for DragDiffusion

Python 1,255 94 Updated Jan 29, 2024

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Python 3,141 468 Updated Mar 3, 2025

Repository for ECCVW 2024 paper "Detect, Describe, Discriminate: Moving Beyond VQA for MLLM Evaluation"

Python 6 Updated Oct 6, 2024

Papers, datasets, and resources related to 2D cartoon video research. Contributions welcome.

198 13 Updated Apr 21, 2026
Python 1,562 228 Updated Mar 25, 2026

Riichi-Mahjong score calculator

TypeScript 73 9 Updated Feb 18, 2025

🚀🀄️ A fast and strong AI for riichi mahjong, powered by Rust and deep reinforcement learning.

Rust 1,436 191 Updated Sep 28, 2025
Next