Skip to content
View bugggggggg's full-sized avatar

Highlights

  • Pro

Block or report bugggggggg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An autonomous agent for deep financial research

TypeScript 14,494 1,734 Updated Feb 10, 2026

LLM inference in C/C++

C++ 94,853 14,869 Updated Feb 11, 2026

mHC kernels implemented in CUDA

Cuda 251 20 Updated Jan 14, 2026

Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality

HTML 317 18 Updated Jan 5, 2026

Sampling profiler for Python programs

Rust 14,933 499 Updated Feb 5, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 19,151 3,228 Updated Feb 11, 2026

My learning notes for ML SYS.

Python 5,313 346 Updated Jan 30, 2026
Python 63 5 Updated Jul 10, 2025

Scaling RL on advanced reasoning models

Python 662 41 Updated Oct 20, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 20,090 2,139 Updated Feb 10, 2026

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,514 207 Updated Jan 25, 2026
Python 434 34 Updated Oct 16, 2025

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 35,759 5,740 Updated Feb 11, 2026

DeepEP: an efficient expert-parallel communication library

Cuda 8,975 1,092 Updated Feb 9, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,473 984 Updated Feb 6, 2026
Python 923 84 Updated Dec 11, 2025

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 8,985 876 Updated Feb 6, 2026

Update ASR paper everyday

Python 451 22 Updated Feb 11, 2026

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

Python 299 14 Updated Jul 18, 2025

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 9,030 691 Updated Feb 11, 2026
Python 45 9 Updated Dec 12, 2024

A Zsh theme

Shell 52,816 2,396 Updated Jan 28, 2026

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,893 371 Updated Dec 17, 2025

A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.

Python 1,593 102 Updated Dec 20, 2025

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,482 86 Updated Feb 4, 2026

PyMuPDF4LLM

Python 1,293 176 Updated Jan 30, 2026

A machine learning software for extracting information from scholarly documents

Java 4,634 533 Updated Feb 11, 2026

LLM101n: Let's build a Storyteller

36,301 1,977 Updated Aug 1, 2024

Python SDK for the Reka AI API

Python 18 3 Updated Sep 30, 2024
Next