Skip to content
View ebbunnim's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@re-Active @SSAFY-ML @bcaitech1 @Boost-Up-AI @VumBleBot @FeedGate

Block or report ebbunnim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DSPy: The framework for programming—not prompting—language models

Python 31,006 2,501 Updated Dec 23, 2025

Backup and restore tool for Milvus

Go 203 65 Updated Dec 24, 2025

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 19,694 1,930 Updated Dec 24, 2025
Python 542 111 Updated Dec 7, 2025

Supercharge your workflow automation with this curated collection of n8n templates! Instantly connect your favorite apps-like Gmail, Telegram, Google Drive, Slack, and more-with ready-to-use, AI-po…

17,059 4,952 Updated Nov 2, 2025

Research project. A Memory solution for users, teams, and applications.

C# 2,130 397 Updated Dec 18, 2025

The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.

Python 1,628 469 Updated Dec 23, 2025

Universal memory layer for AI Agents

Python 44,631 4,852 Updated Dec 17, 2025

Build Real-Time Knowledge Graphs for AI Agents

Python 21,337 2,066 Updated Dec 24, 2025

Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

Roff 24,438 3,730 Updated Dec 22, 2025

Distributed Model Serving Framework

Java 181 78 Updated Sep 30, 2025

This collection demonstrates how to help you to quickly embed Watson NLP in your own applications.

Jupyter Notebook 58 36 Updated Dec 20, 2024

Controller for ModelMesh

Go 242 134 Updated Jun 10, 2025

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 30,796 2,754 Updated Nov 25, 2025

Model compression for ONNX

Python 99 9 Updated Nov 18, 2024

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,552 288 Updated Dec 24, 2025

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

Jupyter Notebook 521 167 Updated Dec 23, 2025

Siege is an http load tester and benchmarking utility

C 6,168 394 Updated Sep 8, 2025

Examples for using ONNX Runtime for machine learning inferencing.

C++ 1,569 399 Updated Dec 15, 2025

The OpenTelemetry C++ Client

C++ 1,178 513 Updated Dec 24, 2025

Numbers every LLM developer should know

4,277 140 Updated Jan 16, 2024

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

Go 4,935 1,325 Updated Dec 22, 2025

Helm Charts ⛵ @ Delivery Hero ⭐

Mustache 551 316 Updated Dec 9, 2025

A Locust metrics exporter for Prometheus

Go 113 39 Updated Apr 25, 2024

Distributed load testing using Kubernetes on Google Container Engine

Smarty 449 254 Updated Dec 5, 2025

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 18,745 3,609 Updated Dec 24, 2025

Former GUI client for gRPC services. No longer maintained.

TypeScript 9,014 469 Updated Jan 4, 2023

Protocol Buffers - Google's data interchange format

C++ 69,932 15,969 Updated Dec 24, 2025

Java client for Kubernetes & OpenShift

Java 3,601 1,500 Updated Dec 23, 2025

🏗 Build container images for your Java applications.

Java 14,095 1,457 Updated Dec 22, 2025
Next