Skip to content
View savvadesogle's full-sized avatar

Block or report savvadesogle

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The source code of CVPR 2019 paper "Deep Exemplar-based Video Colorization".

Python 365 88 Updated Oct 29, 2022

Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)

C++ 758 54 Updated Dec 22, 2025

oneAPI Collective Communications Library (oneCCL)

C++ 252 91 Updated Dec 18, 2025

This is for enabling GPU utilization and inferencing with an Intel ARC GPU even when there is no REBAR available in the system due to chipset not supporting those features

Shell 1 Updated Dec 7, 2025

Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac OS and Windows operating systems.

Python 30,995 1,659 Updated Dec 22, 2025

The vLLM XPU kernels for Intel GPU

C++ 12 16 Updated Dec 19, 2025

Run Generative AI models with simple C++/Python API and using OpenVINO Runtime

C++ 400 310 Updated Dec 22, 2025

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++ 9,391 2,898 Updated Dec 22, 2025

A Model Context Protocol (MCP) server for ATLAS, a Neo4j-powered task management system for LLM Agents - implementing a three-tier architecture (Projects, Tasks, Knowledge) to manage complex workfl…

TypeScript 455 63 Updated Jul 22, 2025
Python 6 Updated Dec 2, 2025

Optimizing inference proxy for LLMs

Python 3,238 259 Updated Dec 3, 2025
1 Updated Dec 13, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 11 Updated Nov 16, 2025

Kilo is the all-in-one agentic engineering platform. Build, ship, and iterate faster with the most popular open source coding agent. #1 on OpenRouter. 750k+ Kilo Coders. 6.1 trillion tokens/month.

TypeScript 13,385 1,541 Updated Dec 22, 2025

Delete all your messages in groups / supergroups using this python script

Python 420 101 Updated Aug 5, 2025

Hybrid Schema-Guided Reasoning (SGR) has agentic system design created by neuraldeep community

Python 875 159 Updated Dec 22, 2025

A scalable inference server for models optimized with OpenVINO™

C++ 804 231 Updated Dec 22, 2025

Developer kits reference setup scripts for various kinds of Intel platforms and GPUs

Python 40 10 Updated Dec 22, 2025

Let llama3 performs web searches and retrieves information using searXNG

Python 66 6 Updated Jul 29, 2025

Boost your efficiency with Fish Speech Batch Inference. Easily process multiple texts and achieve consistently great results. 🗨️🐟

Python 24 3 Updated Aug 4, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,403 806 Updated Dec 22, 2025

Create Custom LLMs

Python 1,786 239 Updated Nov 8, 2025
Jupyter Notebook 49 23 Updated Dec 22, 2025

Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.

Python 265 14 Updated Dec 20, 2025

OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.

Python 28 3 Updated May 24, 2025

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

C++ 380 18 Updated Apr 13, 2025

21 Lessons, Get Started Building with Generative AI

Jupyter Notebook 104,119 55,392 Updated Dec 22, 2025

A repository for Skyline, Strato, Vita3K and Yuzu Android compatible Adreno drivers.

4,466 120 Updated Dec 22, 2025
Python 3 Updated Jan 17, 2025

Make use of Intel Arc Series GPU to Run Ollama, StableDiffusion, Whisper and Open WebUI, for image generation, speech recognition and interaction with Large Language Models (LLM).

Dockerfile 215 21 Updated Dec 1, 2025
Next