Skip to content
View kq-chen's full-sized avatar

Organizations

@shikras

Block or report kq-chen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,260 311 Updated Jan 14, 2026

ComfyUI's ControlNet Auxiliary Preprocessors

Python 3,914 350 Updated Feb 16, 2026

A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models

Python 384 25 Updated Oct 11, 2025

Papers, repository and other data about anime or manga research. Please let me know if you have information that the list does not include.

1,232 71 Updated Dec 31, 2025

🚀 Efficient implementations for emerging model architectures

Python 4,846 487 Updated Apr 11, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,238 711 Updated Apr 9, 2026

Ring attention implementation with flash attention

Python 1,004 97 Updated Sep 10, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,939 320 Updated Jan 14, 2026

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 186 50 Updated Apr 8, 2026

Machine Learning Engineering Open Book

Python 17,662 1,120 Updated Mar 16, 2026

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 182 30 Updated Mar 17, 2026

Ongoing research training transformer models at scale

Python 15,996 3,812 Updated Apr 11, 2026

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,434 344 Updated Apr 11, 2026

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,267 160 Updated Apr 6, 2026

watchpoints is an easy-to-use, intuitive variable/object monitor tool for python that behaves similar to watchpoints in gdb.

Python 555 21 Updated Dec 23, 2024

A library that can print Python objects in human readable format

Python 712 51 Updated Apr 2, 2025

coredumpy saves your crash site for post-mortem debugging

Python 757 20 Updated Jan 5, 2026

An intuitive and low-overhead instrumentation tool for Python

Python 1,203 41 Updated Jul 8, 2025

A debugging and profiling tool that can trace and visualize python code execution

Python 7,609 469 Updated Feb 16, 2026

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 99,034 27,462 Updated Apr 11, 2026

Open-source unified multimodal model

Python 5,798 513 Updated Oct 27, 2025

[CVPR2024, Highlight] Official code for DragDiffusion

Python 1,253 94 Updated Jan 29, 2024

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Python 3,084 452 Updated Mar 3, 2025

Repository for ECCVW 2024 paper "Detect, Describe, Discriminate: Moving Beyond VQA for MLLM Evaluation"

Python 6 Updated Oct 6, 2024

Papers, datasets, and resources related to 2D cartoon video research. Contributions welcome.

195 13 Updated Dec 8, 2025
Python 1,554 223 Updated Mar 25, 2026

Riichi-Mahjong score calculator

TypeScript 73 9 Updated Feb 18, 2025

🚀🀄️ A fast and strong AI for riichi mahjong, powered by Rust and deep reinforcement learning.

Rust 1,409 188 Updated Sep 28, 2025
Next