Skip to content
View Lifann's full-sized avatar
  • Tencent
  • Guangzhou, China

Organizations

@tensorflow

Block or report Lifann

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .

C++ 169 127 Updated Jun 16, 2026

A Python DSL to write Nvidia PTX for Hopper and Blackwell in JAX and PyTorch

Python 313 26 Updated May 8, 2026

A Claude Code plugin that shows what's happening - context usage, active tools, running agents, and todo progress

JavaScript 25,582 1,167 Updated Jun 20, 2026

🤯 LobeHub is your Chief Agent Operator, organizing your agents into 7×24 operations by hiring, scheduling, and reporting on your entire AI team.

TypeScript 78,963 15,473 Updated Jun 22, 2026

A configuration framework that enhances Claude Code with specialized commands, cognitive personas, and development methodologies.

Python 23,347 1,984 Updated Jun 13, 2026

A Claude Skill to give your agent the ability to use a web browser

TypeScript 6,305 400 Updated Jun 5, 2026

Persistent file-based planning for AI coding agents and long-running agentic tasks. Crash-proof markdown plans that survive context loss and /clear, plus a deterministic completion gate and multi-a…

Python 23,760 2,074 Updated Jun 16, 2026

Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.

Python 37,700 2,992 Updated Jun 16, 2026

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Python 1,078 90 Updated Mar 4, 2026

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

Cuda 441 28 Updated Mar 30, 2026

Allow torch tensor memory to be released and resumed later

Python 251 60 Updated May 16, 2026

A unified architecture deep learning framework designed specifically for ultra-large-scale sparse models.

Python 348 24 Updated Feb 9, 2026

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 3,336 586 Updated Apr 15, 2024
XSLT 171 11 Updated May 2, 2024

Production-ready platform for agentic workflow development.

TypeScript 146,159 22,985 Updated Jun 22, 2026

An Unbiased Sequential Recommendation Dataset with Randomly Exposed Videos

HTML 131 11 Updated Jan 6, 2026

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

C++ 1,032 436 Updated Jun 18, 2026

Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

Python 282 71 Updated Jun 17, 2026

LibreCAD is a cross-platform 2D CAD program. It can read DXF/DWG, and write DXF/DWG/PDF/SVG files. It supports point/line/circle/ellipse/parabola/hyperbola/spline primitives. The GUI is highly cust…

C++ 5,978 1,235 Updated Jun 22, 2026

Efficient Top-K implementation on the GPU

Cuda 191 25 Updated Apr 9, 2019

cuVS - a library for vector search and clustering on the GPU

Cuda 784 195 Updated Jun 22, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,401 1,059 Updated Jun 4, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 9,751 1,293 Updated Jun 15, 2026
Sass 4 3 Updated Sep 14, 2024

Fastest kernels written from scratch

Cuda 583 76 Updated Sep 18, 2025

Learning CUDA

HTML 6 Updated Jun 17, 2026
Cuda 1 Updated Sep 26, 2024

High-performance GEMM implementation optimized for NVIDIA H100 GPUs, leveraging Hopper architecture's TMA, WGMMA, and Thread Block Clusters for near-peak theoretical performance.

Cuda 11 Updated Dec 4, 2024

Network Analysis in Python

Python 17,032 3,521 Updated Jun 20, 2026

Make huge neural nets fit in memory

Python 2,840 279 Updated Apr 26, 2020
Next