Skip to content
View AlexKoff88's full-sized avatar

Block or report AlexKoff88

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

real time face swap and one-click video deepfake with only a single image

Python 92,373 13,415 Updated Apr 23, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,675 1,066 Updated Apr 27, 2026

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,738 893 Updated Apr 27, 2026

Work in progress.

Jupyter Notebook 79 9 Updated Nov 25, 2025

MLX: An array framework for Apple silicon

C++ 25,801 1,728 Updated Apr 27, 2026

Examples in the MLX framework

Python 8,543 1,153 Updated Apr 6, 2026

Tools for easier OpenVINO development/debugging

Python 10 6 Updated Jul 16, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,378 343 Updated Jul 12, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,679 8,634 Updated Apr 27, 2026

[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…

Python 1,210 78 Updated Apr 8, 2026

SOTA Open Source TTS

Python 29,965 2,527 Updated Apr 6, 2026

State-of-the-Art Text Embeddings

Python 18,601 2,776 Updated Apr 24, 2026

Build resilient language agents as graphs.

Python 30,582 5,228 Updated Apr 27, 2026

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

C++ 391 18 Updated Apr 13, 2025

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 34,415 4,860 Updated Apr 27, 2026

Vane is an AI-powered answering engine.

TypeScript 34,020 3,705 Updated Apr 11, 2026

The open source repository for Electricity Maps data parsers that powers the world's most comprehensive electricity data platform

Python 3,981 1,034 Updated Apr 27, 2026

first base model for full-duplex conversational audio

Python 1,784 113 Updated Jan 5, 2025

o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging the power of OpenAI's API, this tool provides functionalitie…

Python 2,853 281 Updated Dec 16, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 7,076 618 Updated Jul 4, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,459 4,797 Updated Jun 2, 2025

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

Python 21,460 3,189 Updated Apr 27, 2026

Mamba SSM architecture

Python 18,105 1,711 Updated Apr 27, 2026

Fast and memory-efficient exact attention

Python 23,562 2,649 Updated Apr 27, 2026

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,941 773 Updated Apr 20, 2026

General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for…

C++ 2,495 191 Updated Apr 13, 2026

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,331 300 Updated May 11, 2025

The agent engineering platform

Python 135,142 22,350 Updated Apr 27, 2026

RayLLM - LLMs on Ray (Archived). Read README for more info.

1,267 91 Updated Mar 13, 2025

Convmelspec: Convertible Melspectrograms via 1D Convolutions

Python 147 10 Updated May 13, 2024
Next