waqasm86

waqasm86 waqasm86

waqasm86@gmail.com

90 followers · 1.7k following

Achievements

floci Public
Forked from floci-io/floci

Light, fluffy, and always free - The AWS Local Emulator alternative

Java MIT License Updated May 16, 2026
meshy-cuda-sdf-surface-hpc Public

Cuda Updated May 8, 2026
cuda-3dgenai-kernels-starter Public

C++ Updated May 8, 2026
skills-github-pages Public

Exercise: Create a site or blog from your GitHub repositories with GitHub Pages

MIT License Updated Apr 30, 2026
llm-observability-stack Public

This is an opinionated umbrella Helm chart for your local single-node **k3s + NVIDIA GPU + Ollama + Open WebUI + LangChain/LangSmith** setup.

Jupyter Notebook Updated Mar 24, 2026
ClaudeHistoryMCP Public
Forked from jhammant/ClaudeHistoryMCP

MCP server for searching and surfacing Claude Code conversation history

TypeScript Updated Feb 24, 2026
llamatelemetry Public

CUDA-first OpenTelemetry Python SDK for LLM inference observability and explainability.

Python MIT License Updated Feb 23, 2026
hive Public
Forked from aden-hive/hive

Outcome driven agent development framework that evolves

Python Apache License 2.0 Updated Feb 12, 2026
mito Public
Forked from mito-ds/mito

Jupyter extensions that help you write code faster: Context aware AI Chat, Autocomplete, and Spreadsheet

Jupyter Notebook Other Updated Feb 6, 2026
grafana-com-public-clients Public
Forked from grafana/grafana-com-public-clients

grafana.com API Clients

Shell Apache License 2.0 Updated Feb 6, 2026
nbdev Public
Forked from AnswerDotAI/nbdev

Create delightful software with Jupyter Notebooks

Jupyter Notebook Apache License 2.0 Updated Feb 4, 2026
lon-mirror Public
Forked from Tuttotorna/lon-mirror

MB-X.01 · Logical Origin Node (L.O.N.) — TruthΩ → Co⁺ → Score⁺. Demo e spec verificabili. https://massimiliano.neocities.org/

Python MIT License Updated Feb 3, 2026
Kaggle-Dropbox-HuggingFace Public

Kaggle-Dropbox-HuggingFace

Jupyter Notebook Updated Feb 2, 2026
llcuda Public
Forked from llcuda/llcuda

CUDA 12-first backend inference for Unsloth on Kaggle — Optimized for small GGUF models (1B-5B) on dual Tesla T4 GPUs (15GB each, SM 7.5)

Jupyter Notebook MIT License Updated Feb 1, 2026
GRIT Public
Forked from eric-ai-lab/GRIT

Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"

Python MIT License Updated Jan 16, 2026
notebooks Public
Forked from roboflow/notebooks

A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM …

Jupyter Notebook Updated Jan 15, 2026
Ubuntu-Cuda-Llama.cpp-Executable Public

Pre-built llama.cpp CUDA binary for Ubuntu 22.04. No compilation required - download, extract, and run! Works with llcuda Python package for JupyterLab integration. Tested on GeForce 940M to RTX 4090.

python machine-learning ai deep-learning ubuntu binary cuda

Python 2 MIT License Updated Dec 28, 2025
cuda-nvidia-systems-engg Public

Production-grade C++20/CUDA distributed LLM inference system with TCP networking, MPI scheduling, and content-addressed storage. Features comprehensive benchmarking (p50/p95/p99 latencies), epoll a…

benchmarking performance networking tcp cpp storage gpu

C++ MIT License Updated Dec 27, 2025
local-llama-cuda Public

Custom CUDA implementation for LLM inference with MPI-based distributed computing. Memory-efficient layer offloading, multi-rank coordination, and GPU optimization for constrained hardware (1GB VRAM).

cpp gpu mpi distributed-computing cuda inference memory-optimization

C++ MIT License Updated Dec 25, 2025
cuda-tcp-llama.cpp Public

High-performance TCP inference gateway with epoll async I/O for CUDA-accelerated LLM serving. Binary protocol, connection pooling, streaming responses. Zero dependencies beyond POSIX and CUDA.

networking tcp server cpp cuda inference epoll

C++ Updated Dec 23, 2025
cuda-openmpi Public

CUDA-aware OpenMPI integration for GPU-accelerated distributed computing. Multi-GPU LLM inference with MPI communication, performance benchmarking, and collective operations testing.

benchmarking cpp gpu mpi distributed-computing cuda openmpi

Cuda MIT License Updated Dec 23, 2025
cuda-llm-storage-pipeline Public

Content-addressed LLM model distribution with SHA256 verification and SeaweedFS integration. Distributed storage, manifest management, LRU caching, and integrity checking for GGUF models.

cpp storage cuda distributed-storage content-addressed seaweedfs llm

C++ Updated Dec 23, 2025
cuda-mpi-llama-scheduler Public

Distributed MPI scheduler with work-stealing algorithm for LLM inference. Percentile latency analysis (p50/p95/p99), throughput benchmarking, multi-rank load balancing, and empirical performance me…

benchmarking performance cpp scheduler mpi cuda inference

Cuda Updated Dec 23, 2025
cuda-tcp-ip Public

Updated Dec 18, 2025
cmake-superbuild-toolkit Public

Qt-style CMake superbuild demo: FetchContent deps, feature flags, install/export targets, CI matrix, tests, and CPack packaging.

CMake Other Updated Dec 16, 2025
windsurf-llama-cpp-mcp-bridge Public

MCP stdio server for Windsurf that routes tool calls to a local llama.cpp llama-server (GGUF), optimized for low-VRAM GPUs.

Python Updated Dec 13, 2025
Apify-Google-Gemini Public

TypeScript Updated Nov 29, 2025
Wolfram-llama.cpp Public

This is a sample project to use wolfram with llama.cpp

Updated Nov 18, 2025
AI-Data-Engineering Public

Python Updated Nov 9, 2025
warp-llama.cpp-fastmcp Public

Python Updated Oct 21, 2025

waqasm86 waqasm86

Achievements

Achievements

floci Public

Uh oh!

meshy-cuda-sdf-surface-hpc Public

Uh oh!

cuda-3dgenai-kernels-starter Public

Uh oh!

skills-github-pages Public

Uh oh!

llm-observability-stack Public

Uh oh!

ClaudeHistoryMCP Public

Uh oh!

llamatelemetry Public

Uh oh!

hive Public

Uh oh!

mito Public

Uh oh!

grafana-com-public-clients Public

Uh oh!

nbdev Public

Uh oh!

lon-mirror Public

Uh oh!

Kaggle-Dropbox-HuggingFace Public

Uh oh!

llcuda Public

Uh oh!

GRIT Public

Uh oh!

notebooks Public

Uh oh!

Ubuntu-Cuda-Llama.cpp-Executable Public

Uh oh!

cuda-nvidia-systems-engg Public

Uh oh!

local-llama-cuda Public

Uh oh!

cuda-tcp-llama.cpp Public

Uh oh!

cuda-openmpi Public

Uh oh!

cuda-llm-storage-pipeline Public

Uh oh!

cuda-mpi-llama-scheduler Public

Uh oh!

cuda-tcp-ip Public

Uh oh!

cmake-superbuild-toolkit Public

Uh oh!

windsurf-llama-cpp-mcp-bridge Public

Uh oh!

Apify-Google-Gemini Public

Uh oh!

Wolfram-llama.cpp Public

Uh oh!

AI-Data-Engineering Public

Uh oh!

warp-llama.cpp-fastmcp Public

Uh oh!