Skip to content
View philschmid's full-sized avatar

Block or report philschmid

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

64 results for sponsorable starred repositories
Clear filter

The beautiful & flexible React.js docs framework.

TypeScript 9,869 546 Updated Dec 17, 2025

A Next.js 15 Starter Kit Deployed to Cloudflare

TypeScript 367 11 Updated May 25, 2025

A minimalistic MCP client with a good feature set.

TypeScript 819 212 Updated Dec 11, 2025

Source code for the website geminibyexample.com which provides simple Python code examples for the Gemini SDK

Python 22 3 Updated Apr 8, 2025

Core building blocks for AI apps. High-quality, accessible, and customizable components for AI interfaces.

TypeScript 2,412 129 Updated Dec 11, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,037 340 Updated Dec 17, 2025

Fast Semantic Text Deduplication & Filtering

Python 854 53 Updated Oct 27, 2025

GenAI Agent Framework, the Pydantic way

Python 13,818 1,478 Updated Dec 17, 2025

A curated list of awesome Docusaurus resources.

152 5 Updated Aug 18, 2024

A python module to repair invalid JSON from LLMs

Python 4,170 161 Updated Dec 17, 2025

Safely deploy OpenAI's Realtime APIs in less than 5 minutes!

Rust 159 15 Updated Oct 1, 2024

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 756 108 Updated Dec 17, 2025

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 6,392 201 Updated Dec 15, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 23,571 2,708 Updated Dec 11, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,546 4,087 Updated Dec 17, 2025

A vector search SQLite extension that runs anywhere!

C 6,547 257 Updated Jan 24, 2025

https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Python 1,296 88 Updated Mar 27, 2025

Check for data drift between two OpenAI multi-turn chat jsonl files.

Jupyter Notebook 39 6 Updated Apr 11, 2024

A curated list of awesome things related to shadcn/ui.

TypeScript 18,423 1,093 Updated Dec 12, 2025

FastAPI Tips by The FastAPI Expert!

3,279 134 Updated Oct 11, 2025

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Python 2,570 166 Updated Dec 15, 2025

Simple and fast JSON database

JavaScript 22,406 956 Updated Jul 24, 2025

Everything you want to know about Google Cloud TPU

Python 553 31 Updated Jul 16, 2024

I've developed a ChatGPT clone using Next.js 14, Shadcn-UI, Prisma ORM, and integrated it with the OpenAI API. It offers a user-friendly conversational AI experience.

TypeScript 350 134 Updated Aug 7, 2024

Python bindings for llama.cpp

Python 9,827 1,260 Updated Aug 15, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,639 12,031 Updated Dec 17, 2025

Deploy llama.cpp compatible Generative AI LLMs on AWS Lambda!

Python 175 26 Updated Apr 16, 2024

Fast ML inference & training for ONNX models in Rust

Rust 1,812 188 Updated Dec 16, 2025

HTTP mocking for Rust!

Rust 762 62 Updated Nov 30, 2025
Next