Skip to content
View anencore94's full-sized avatar
🔥
🔥
  • South Korea, Seoul
  • 06:30 (UTC +09:00)

Organizations

@kubeflow @mlops-for-all

Block or report anencore94

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Onboarding game built for OpenAI Agent Hackathon NYC

Python 7 3 Updated Jun 8, 2025

agentsculptor is an experimental AI-powered development agent designed to analyze, refactor, and extend Python projects automatically. It uses an OpenAI-like planner–executor loop on top of a vLLM …

Python 11 Updated Sep 17, 2025
Jupyter Notebook 6 Updated Jan 26, 2025

Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead at 5k RPS.

Go 4,451 528 Updated Apr 29, 2026

Comprehensive Claude Code project configuration example with hooks, skills, agents, commands, and GitHub Actions workflows

JavaScript 5,853 545 Updated Jan 6, 2026

Hugging Face MCP Server

TypeScript 226 58 Updated Apr 29, 2026

AI Agent Evaluator & Red Team Platform

Python 1,025 161 Updated Apr 29, 2026

Financial Services Interest Group

52 5 Updated Jan 14, 2026

Typescript/React Library for AI Chat💬🚀

TypeScript 9,829 1,000 Updated Apr 29, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,533 942 Updated Apr 29, 2026

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

82,206 9,074 Updated Apr 4, 2025

If you want to become good at system design, join this newsletter now 👇

24,241 3,022 Updated Apr 29, 2026

Universal Python SDK to run AI workloads on Kubernetes

Python 105 173 Updated Apr 29, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,698 1,074 Updated Apr 29, 2026

The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

TypeScript 2,219 356 Updated Apr 15, 2025

Generate spreadsheets based on GitHub contributions

Go 89 19 Updated Mar 27, 2026

Learn how to design systems at scale and prepare for system design interviews

43,047 5,514 Updated Apr 2, 2026

FastAPI framework plugins

Python 616 26 Updated Jul 9, 2025

Manages Unified Access to Generative AI Services built on Envoy Gateway

Go 1,568 232 Updated Apr 29, 2026

Kubernetes-native Job Queueing

Go 2,473 595 Updated Apr 29, 2026

Efficient and easy multi-instance LLM serving

Python 547 49 Updated Mar 12, 2026

DRA Driver for NVIDIA GPUs

Go 633 146 Updated Apr 29, 2026

HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container

C 298 156 Updated Apr 29, 2026

Heterogeneous GPU Sharing on Kubernetes

Go 3,381 544 Updated Apr 29, 2026

GenAI inference performance benchmarking tool

Python 180 87 Updated Apr 29, 2026

Repository for the next iteration of composite service (e.g. Ingress) and load balancing APIs.

Go 2,817 710 Updated Apr 28, 2026

Gateway API Inference Extension

Go 659 283 Updated Apr 29, 2026

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1,399 89 Updated Dec 3, 2024

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,169 494 Updated Apr 29, 2026
Next