philschmid

Philipp Schmid philschmid

AI Developer Experience @google-deepmind | prev: Tech Lead at @huggingface , AWS ML Hero

3.5k followers · 26 following

Achievements

x4 x4 x4

Achievements

x4 x4 x4

Starred repositories

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,966 31,464 Updated Dec 17, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 95,971 26,256 Updated Dec 17, 2025

liyedanpdx / reddit-ai-trends

Stay ahead of AI trends with automated Reddit insights! 🚀 This tool scans AI-related Reddit communities in English & Chinese, using Reddit Official API, DeepSeek R1 by OpenRouter to analyze posts, …

Python 770 76 Updated Dec 17, 2025

aws / aws-cdk

The AWS Cloud Development Kit is a framework for defining cloud infrastructure in code

TypeScript 12,578 4,348 Updated Dec 17, 2025

vercel / next.js

The React Framework

JavaScript 136,609 30,068 Updated Dec 17, 2025

deepset-ai / haystack

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

MDX 23,645 2,525 Updated Dec 17, 2025

rust-lang / rust-analyzer

A Rust compiler front-end for IDEs

Rust 15,803 1,892 Updated Dec 17, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 21,569 3,780 Updated Dec 17, 2025

ModelTC / LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,795 288 Updated Dec 17, 2025

fastapi / fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 93,152 8,380 Updated Dec 17, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,533 4,088 Updated Dec 17, 2025

astral-sh / ruff

An extremely fast Python linter and code formatter, written in Rust.

Rust 44,446 1,649 Updated Dec 17, 2025

PrimeIntellect-ai / verifiers

Environments for LLM Reinforcement Learning

Python 3,635 453 Updated Dec 17, 2025

aws-powertools / powertools-lambda-python

A developer toolkit to implement Serverless best practices and increase developer velocity.

Python 3,205 464 Updated Dec 17, 2025

mongodb / mongo

The MongoDB Database

C++ 27,845 5,721 Updated Dec 17, 2025

rust-lang / rust

Empowering everyone to build reliable and efficient software.

Rust 108,584 14,202 Updated Dec 17, 2025

huggingface / optimum-neuron

Training and inference on AWS Trainium and Inferentia chips.

Jupyter Notebook 252 88 Updated Dec 17, 2025

agno-agi / agno

The unified stack for multi-agent systems.

Python 36,068 4,761 Updated Dec 17, 2025

Lightning-AI / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,583 3,621 Updated Dec 17, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,101 6,611 Updated Dec 17, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,609 12,027 Updated Dec 17, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,452 474 Updated Dec 17, 2025

docling-project / docling

Get your documents ready for gen AI

Python 47,005 3,315 Updated Dec 17, 2025

promptfoo / promptfoo

Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with co…

TypeScript 9,452 823 Updated Dec 17, 2025

benbrandt / text-splitter

Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.

Rust 530 26 Updated Dec 17, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 91,454 14,136 Updated Dec 17, 2025

pola-rs / polars

Extremely fast Query Engine for DataFrames, written in Rust

Rust 36,583 2,519 Updated Dec 17, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,861 2,454 Updated Dec 17, 2025

awslabs / llrt

LLRT (Low Latency Runtime) is an experimental, lightweight JavaScript runtime designed to address the growing demand for fast and efficient Serverless applications.

Rust 8,667 383 Updated Dec 17, 2025

mangiucugna / json_repair

A python module to repair invalid JSON from LLMs

Python 4,170 161 Updated Dec 17, 2025

Philipp Schmid philschmid

Starred repositories

PyTorch

Tensorflow

Amazon Web Services