Skip to content
View popomen's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report popomen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An agentic skills framework & software development methodology that works.

Shell 174,462 15,394 Updated Apr 28, 2026

Convert arxiv papers to markdown

Python 159 14 Updated Apr 11, 2026

Agent skills for Obsidian. Teach your agent to use Markdown, Bases, JSON Canvas, and use the CLI.

27,785 1,834 Updated Apr 2, 2026

Multi-agent reconnaissance skill for Claude Code + Obsidian

23 7 Updated Feb 21, 2026

微信公众号转RSS

Shell 1,387 144 Updated Apr 9, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,362 330 Updated Jan 14, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,615 1,020 Updated Apr 30, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,149 955 Updated Apr 24, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 9,591 1,217 Updated Apr 29, 2026

Magnum IO community repo

C++ 115 19 Updated Mar 23, 2026

Train transformer language models with reinforcement learning.

Python 18,216 2,680 Updated Apr 30, 2026

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 278 19 Updated Feb 2, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,435 930 Updated Apr 30, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,034 3,779 Updated Apr 30, 2026

Infiniband Verbs Performance Tests

C 951 398 Updated Apr 15, 2026

RDMA core userspace libraries and daemons

C 2,212 852 Updated Apr 20, 2026

Large Context Attention

Python 770 53 Updated Oct 13, 2025

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,646 1,339 Updated Apr 29, 2026

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 33,507 6,957 Updated Apr 30, 2026

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 5,189 372 Updated Apr 20, 2026

A PyTorch native platform for training generative AI models

Python 5,287 802 Updated Apr 30, 2026

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

5,006 536 Updated Sep 25, 2024

Rotary Transformer

Python 1,107 62 Updated Mar 21, 2022

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,192 1,983 Updated Jan 9, 2026

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 1,010 61 Updated Mar 3, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 160,126 33,066 Updated Apr 30, 2026

深度学习经典、新论文逐段精读

32,946 2,784 Updated Mar 22, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,247 366 Updated Aug 14, 2025

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,625 933 Updated Aug 21, 2024

A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.

Go 1,678 417 Updated Apr 29, 2026
Next