- Nuremberg
- www.philschmid.de
- @_philschmid
Starred repositories
The beautiful & flexible React.js docs framework.
A Next.js 15 Starter Kit Deployed to Cloudflare
A minimalistic MCP client with a good feature set.
Source code for the website geminibyexample.com which provides simple Python code examples for the Gemini SDK
Core building blocks for AI apps. High-quality, accessible, and customizable components for AI interfaces.
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
GenAI Agent Framework, the Pydantic way
A curated list of awesome Docusaurus resources.
A python module to repair invalid JSON from LLMs
Safely deploy OpenAI's Realtime APIs in less than 5 minutes!
Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
A vector search SQLite extension that runs anywhere!
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
Check for data drift between two OpenAI multi-turn chat jsonl files.
A curated list of awesome things related to shadcn/ui.
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Everything you want to know about Google Cloud TPU
I've developed a ChatGPT clone using Next.js 14, Shadcn-UI, Prisma ORM, and integrated it with the OpenAI API. It offers a user-friendly conversational AI experience.
A high-throughput and memory-efficient inference and serving engine for LLMs
Deploy llama.cpp compatible Generative AI LLMs on AWS Lambda!
Fast ML inference & training for ONNX models in Rust