Skip to content
View cloudmelon's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report cloudmelon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Underlay and RDMA network solution of the Kubernetes, for bare metal, VM and any public cloud

Go 629 87 Updated Dec 19, 2025

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

C++ 1,595 357 Updated Dec 10, 2025

KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes

Go 9,759 1,288 Updated Dec 17, 2025

A collection of Data & AI research or white papers for learning and researching across partners

1 Updated Jun 10, 2024

Development repository for the Triton language and compiler

MLIR 17,887 2,461 Updated Dec 20, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 34,378 3,312 Updated Dec 20, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,434 1,969 Updated Dec 20, 2025

This repository stores a collection of LLMs and Multimodal AI research papers for learning and researching.

1 Updated Aug 13, 2024

AI CodePlaybook contains a series of POVs for iMelonArt's AI MVP and help community create AI SaaS solutions with cloud-native technologies

Python 2 Updated Oct 30, 2025

A Generic Low-Code Framework Built on a Config-Driven Tree Walker

C# 314 50 Updated Nov 3, 2025

All things prompt engineering

Python 5,720 328 Updated Jun 4, 2024

Building Ferret multimodality open-source AI on Kubernetes

Python 1 Updated Jan 27, 2024

Large Language Model Text Generation Inference

Python 10,709 1,246 Updated Dec 19, 2025

Repo for SIG release

Shell 589 434 Updated Dec 17, 2025

Mistral AI test for commercial licenses and open-source for sovereign air-gapped solutions.

Python 1 Updated Apr 27, 2024

Official inference library for pre-processing of Mistral models

Python 831 121 Updated Dec 19, 2025

NVIDIA device plugin for Kubernetes

Go 3,596 765 Updated Dec 18, 2025

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

C 9,876 356 Updated Oct 25, 2025

Helm chart for Ollama on Kubernetes

Smarty 529 82 Updated Dec 14, 2025

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 157,944 13,972 Updated Dec 19, 2025

12 Weeks, 24 Lessons, AI for All!

Jupyter Notebook 44,485 8,891 Updated Dec 19, 2025

AI-in-a-Box leverages the expertise of Microsoft across the globe to develop and provide AI and ML solutions to the technical community. Our intent is to present a curated collection of solution ac…

Jupyter Notebook 587 197 Updated Dec 12, 2024

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Svelte 118,280 16,661 Updated Dec 20, 2025

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Jupyter Notebook 3,668 941 Updated Dec 19, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,789 869 Updated Jun 10, 2024

DSPy: The framework for programming—not prompting—language models

Python 30,889 2,484 Updated Dec 19, 2025

Microsoft Official Build Modern AI Apps reference solutions and content. Demonstrate how to build Copilot applications that incorporate Hero Azure Services including Azure OpenAI Service, Azure Con…

C# 187 87 Updated Dec 1, 2024

A programming framework for agentic AI

Python 52,695 8,006 Updated Oct 8, 2025
Next