Skip to content
View philschmid's full-sized avatar

Block or report philschmid

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,966 31,464 Updated Dec 17, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 95,971 26,256 Updated Dec 17, 2025

Stay ahead of AI trends with automated Reddit insights! 🚀 This tool scans AI-related Reddit communities in English & Chinese, using Reddit Official API, DeepSeek R1 by OpenRouter to analyze posts, …

Python 770 76 Updated Dec 17, 2025

The AWS Cloud Development Kit is a framework for defining cloud infrastructure in code

TypeScript 12,578 4,348 Updated Dec 17, 2025

The React Framework

JavaScript 136,609 30,068 Updated Dec 17, 2025

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

MDX 23,645 2,525 Updated Dec 17, 2025

A Rust compiler front-end for IDEs

Rust 15,803 1,892 Updated Dec 17, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,569 3,780 Updated Dec 17, 2025

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,795 288 Updated Dec 17, 2025

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 93,152 8,380 Updated Dec 17, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,533 4,088 Updated Dec 17, 2025

An extremely fast Python linter and code formatter, written in Rust.

Rust 44,446 1,649 Updated Dec 17, 2025

Environments for LLM Reinforcement Learning

Python 3,635 453 Updated Dec 17, 2025

A developer toolkit to implement Serverless best practices and increase developer velocity.

Python 3,205 464 Updated Dec 17, 2025

The MongoDB Database

C++ 27,845 5,721 Updated Dec 17, 2025

Empowering everyone to build reliable and efficient software.

Rust 108,584 14,202 Updated Dec 17, 2025

Training and inference on AWS Trainium and Inferentia chips.

Jupyter Notebook 252 88 Updated Dec 17, 2025

The unified stack for multi-agent systems.

Python 36,068 4,761 Updated Dec 17, 2025

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,583 3,621 Updated Dec 17, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,101 6,611 Updated Dec 17, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,609 12,027 Updated Dec 17, 2025

AllenAI's post-training codebase

Python 3,452 474 Updated Dec 17, 2025

Get your documents ready for gen AI

Python 47,005 3,315 Updated Dec 17, 2025

Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with co…

TypeScript 9,452 823 Updated Dec 17, 2025

Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.

Rust 530 26 Updated Dec 17, 2025

LLM inference in C/C++

C++ 91,454 14,136 Updated Dec 17, 2025

Extremely fast Query Engine for DataFrames, written in Rust

Rust 36,583 2,519 Updated Dec 17, 2025

Development repository for the Triton language and compiler

MLIR 17,861 2,454 Updated Dec 17, 2025

LLRT (Low Latency Runtime) is an experimental, lightweight JavaScript runtime designed to address the growing demand for fast and efficient Serverless applications.

Rust 8,667 383 Updated Dec 17, 2025

A python module to repair invalid JSON from LLMs

Python 4,170 161 Updated Dec 17, 2025
Next