Skip to content
View joseph-chan's full-sized avatar

Block or report joseph-chan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 248 27 Updated Jun 9, 2026

2026TAAC腾讯广告算法大赛-KDDCUP方案,best score:0.832321,rank:51

Python 24 7 Updated Jun 7, 2026

UniRank: A Ranking Model Benchmark for Unified Sequential Modeling and Feature Interaction

Python 43 12 Updated May 21, 2026

TAAC2025初赛第十四名O_o队伍代码

Python 134 22 Updated Oct 27, 2025

Accelerating MoE with IO and Tile-aware Optimizations

Python 711 89 Updated Jun 13, 2026

CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

Python 208 22 Updated Jun 10, 2026

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,421 154 Updated Jun 13, 2026

A self-hosted ML coding practice platform. 68 problems from ReLU to flow matching — attention, training, RLHF, diffusion, and more. Instant feedback in the browser.

Python 1,136 103 Updated May 12, 2026

A hyperparameter optimization framework

Python 14,349 1,338 Updated Jun 12, 2026

CUDA kernels for linear attention variants, written in CuTe DSL and CUTLASS C++.

Python 519 64 Updated Jun 12, 2026

AI agents running research on single-GPU nanochat training automatically

Python 86,423 12,520 Updated Mar 26, 2026

Pytorch domain library for recommendation systems

Python 2,563 653 Updated Jun 12, 2026

🎓从0开始训练一个大模型Minimind项目的超详细解析,包括但不限于用到的架构,算法,以及大模型面试经验

Python 906 55 Updated May 25, 2026

The code repository for the KDD 2026 paper "Q-Regularized Generative Auto-Bidding: From Suboptimal Trajectories to Optimal Policies"

Python 14 Updated Dec 16, 2025

Implementation of "FlashPreill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling"

Python 53 6 Updated Apr 27, 2026

flash attention tutorial written in python, triton, cuda, cutlass

Cuda 523 54 Updated Jan 20, 2026

Offers a toolset for comprehensive, multi-faceted large-scale data analysis and optimizations

Python 80 21 Updated Oct 22, 2025

Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Cuda 90 5 Updated Jul 14, 2024

IntelliFold: A Controllable Foundation Model for General and Specialized Biomolecular Structure Prediction.

Python 223 24 Updated May 28, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,195 2,004 Updated Mar 17, 2026

Research code accompanying AlphaGenome

Python 779 130 Updated Jun 3, 2026
Python 17 4 Updated Nov 6, 2025

【Accepted by WWW 2026 🎉🎉】Generative Regression Based Watch Time Prediction for Short-Video Recommendation

Python 220 2 Updated Mar 3, 2026

High Performance LLM Inference Operator Library

C++ 931 96 Updated Jun 11, 2026

RePo: Language Models with Context Re-Positioning

Python 77 9 Updated Mar 30, 2026

Kernels, of the mega variety :)

Python 751 59 Updated May 26, 2026
Next