Skip to content
View ntsd's full-sized avatar

Block or report ntsd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

DFlash: Block Diffusion for Flash Speculative Decoding

Python 4,623 329 Updated May 10, 2026

An Enhanced TOP program to monitor your Nvidia DGX SPARK's Hardware

Python 27 10 Updated Jan 6, 2026

High-performance interactive system monitor for NVIDIA DGX systems — GPU, CPU, memory, disk, network in a beautiful TUI

Rust 30 5 Updated Apr 20, 2026

Bidirectional Telegram bot plugin for Paperclip - push notifications, bot commands, inline approve/reject buttons, reply routing

TypeScript 61 26 Updated May 10, 2026

Bidirectional Discord integration for Paperclip: notifications, slash commands, and community intelligence

TypeScript 28 21 Updated May 10, 2026

A benchmark for LLMs on complicated tasks in the terminal

Python 2,214 514 Updated Jan 22, 2026

A coding agent optimized to smaller LLMs

TypeScript 1,099 69 Updated May 16, 2026

sparkrun - launch, manage, and stop LLM inference workloads on NVIDIA DGX Spark systems

Python 230 22 Updated May 18, 2026

Docker configuration for running VLLM on dual DGX Sparks

Shell 1,384 248 Updated May 17, 2026

Collection of the best Paperclip plugins

677 84 Updated May 15, 2026

The open-source app everyone uses to manage agents at work

TypeScript 66,127 12,058 Updated May 18, 2026

🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman

JavaScript 61,391 3,418 Updated May 17, 2026

DFlash vLLM for DGX Spark — Plug & Play Block-Diffusion Speculative Decoding

Python 44 7 Updated May 1, 2026

Qwen3.6-35B-A3B-heretic NVFP4 + DFlash speculative decoding on DGX Spark (GB10/sm_121a). Source-built vLLM image + 7 patches + comprehensive deployment guide.

Python 70 6 Updated May 1, 2026

Lossless abliteration of Qwen3.6-27B with NVFP4 hardware quantization for DGX Spark / Blackwell. BF16 (51 GB) + NVFP4 (26 GB) deployment guide, docker-compose, and QuickStart.

Python 204 23 Updated May 15, 2026

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,682 400 Updated May 18, 2026

Menubar Tool to set Charge Limits and Prolong Battery Lifespan

Swift 9,045 329 Updated Apr 13, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 80,284 16,885 Updated May 18, 2026

MLX: An array framework for Apple silicon

C++ 26,288 1,809 Updated May 17, 2026

LLM inference in C/C++

C++ 110,667 18,331 Updated May 18, 2026

Community recipes for serving LLMs on RTX 3090. Multi-engine (vLLM, llama.cpp, SGLang) and model-agnostic. Currently shipping Qwen3.6-27B configs for 1× and 2× cards.

Python 979 49 Updated May 18, 2026

Composio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.

TypeScript 28,302 4,570 Updated May 17, 2026

The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.

Python 2,298 647 Updated May 15, 2026

From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.

TypeScript 23,961 2,081 Updated May 18, 2026

An extension suite that turns Pi into a multi-agent orchestration platform

TypeScript 172 31 Updated May 16, 2026

AGENTS.md — a simple, open format for guiding coding agents

TypeScript 21,449 1,571 Updated Mar 12, 2026

The agent that grows with you

Python 154,985 24,839 Updated May 18, 2026

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 27,353 2,793 Updated May 18, 2026

This project demonstrates Agent-to-Agent (A2A) communication between different agent frameworks, enabling distributed tracing and conversation across multiple…

Python 4 Updated Jan 16, 2026
Next