- Seoul
Highlights
Stars
DSPy: The framework for programming—not prompting—language models
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Supercharge your workflow automation with this curated collection of n8n templates! Instantly connect your favorite apps-like Gmail, Telegram, Google Drive, Slack, and more-with ready-to-use, AI-po…
Research project. A Memory solution for users, teams, and applications.
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
Build Real-Time Knowledge Graphs for AI Agents
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
This collection demonstrates how to help you to quickly embed Watson NLP in your own applications.
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
Siege is an http load tester and benchmarking utility
Examples for using ONNX Runtime for machine learning inferencing.
Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes
A Locust metrics exporter for Prometheus
Distributed load testing using Kubernetes on Google Container Engine
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Former GUI client for gRPC services. No longer maintained.
Protocol Buffers - Google's data interchange format
Java client for Kubernetes & OpenShift
🏗 Build container images for your Java applications.