Skip to content
View johndpope's full-sized avatar

Organizations

@BellGeorge

Block or report johndpope

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AGILE: Lightweight and Efficient Asynchronous GPU-SSD Integration (SC25)

C++ 21 8 Updated Apr 14, 2026

Enhancing CUDA Intra-Streaming-Multiprocessor Parallelism for Large Language Models via Fine-Grained Task Graph

Jupyter Notebook 1 1 Updated Jul 6, 2025

An Activation Offloading Framework to SSDs for Faster Large Language Model Training

Python 7 2 Updated Apr 18, 2025

Alibaba Cloud AS02MC04 hack

49 16 Updated Mar 28, 2026

Evaluation harness and norm-direction method for KV cache compression. Cross-model worst-case quality metrics.

Python 1 Updated May 16, 2026

DGX Spark / GB10 vLLM image for Gemma 4 31B Deckard Heretic Uncensored NVFP4 with z-lab DFlash speculative decoding.

Python 23 2 Updated May 15, 2026

Official Repository for ICML 2026 paper Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner

2 Updated May 13, 2026

A Minimal and Elegant Framework for Real-Time Interactive World Models

95 Updated May 12, 2026
Python 174 12 Updated May 15, 2026

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,925 192 Updated May 16, 2026

The codebase of Cola DLM

Python 35 1 Updated May 15, 2026

Experimenting with a visual representation of Wikipedia

JavaScript 9 Updated May 4, 2026

Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 614 45 Updated May 11, 2026
TypeScript 4 1 Updated Apr 21, 2026

Free and open-source chat SDK. Build fast, real-time apps and generative AI agents with a high-performance, customizable, cross-platform UI.

Dart 2,287 854 Updated Apr 18, 2026

Modelence is a full-stack framework for building production web apps with a built-in database, authentication and monitoring. Modelence is opinionated and AI agent-first, which means it's optimized…

TypeScript 402 38 Updated May 15, 2026

implementing minimal versions of joint-embedding predictive architecture (JEPA)

Python 97 7 Updated May 17, 2026

Zero-shot expressive voice cloning and speech generation. Generate anything from short clips to full-length audiobooks with realistic emotional delivery, pacing, and breath control. Clone any voice…

Python 402 63 Updated May 15, 2026

🔥 Search, scrape, and clean the web for AI agents.

TypeScript 120,914 7,392 Updated May 17, 2026

This is an example flask backend to interface with a custom version of the Huggingface Chatui

Python 2 Updated May 25, 2024

LLM inference in C/C++

C++ 50 11 Updated May 17, 2026

Spawn any agent, on any cloud

TypeScript 159 26 Updated May 17, 2026

DeepSeek 4 Flash local inference engine for Metal and CUDA

C 10,262 838 Updated May 16, 2026

vLLM TurboQuant

Python 588 101 Updated Apr 16, 2026

Aurora optimizer release

Python 119 5 Updated May 8, 2026

API client for AUTOMATIC111/stable-diffusion-webui for nodejs/browser

TypeScript 5 1 Updated Mar 14, 2025

Windsurf-to-OpenAI compatible API proxy

JavaScript 2,450 526 Updated May 13, 2026

TTS voice notifications for Claude Code — hear when Claude finishes or needs your input

Shell 3 Updated May 9, 2026

[ICLR 2026] TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation

Python 8 1 Updated Apr 25, 2026

Official implementation of Paper "System-Aware 4-Bit KV-Cache Quantization for Real-World LLM Serving"

Shell 19 3 Updated Apr 17, 2026
Next