Skip to content
View Utopiah's full-sized avatar

Sponsoring

@FolkComputer
@bovine3dom

Block or report Utopiah

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
85 stars written in Python
Clear filter

A feature-rich command-line audio/video downloader

Python 153,693 12,468 Updated Mar 28, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 55,724 9,500 Updated Nov 12, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,454 4,785 Updated Jun 2, 2025

Let us control diffusion models!

Python 33,782 3,003 Updated Feb 25, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 33,186 6,877 Updated Mar 28, 2026

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,261 4,002 Updated Jul 17, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 22,743 4,098 Updated Mar 28, 2026

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 20,951 2,204 Updated Mar 25, 2026

Magenta: Music and Art Generation with Machine Intelligence

Python 19,775 3,787 Updated Jan 6, 2026

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,554 1,430 Updated Feb 27, 2026

Topic Modelling for Humans

Python 16,382 4,409 Updated Nov 1, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,907 1,808 Updated Mar 17, 2026

Generate 3D objects conditioned on text or images

Python 12,226 1,073 Updated Jun 22, 2024

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Python 10,397 2,452 Updated Jun 8, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,935 921 Updated Mar 4, 2026

Ready-to-run Docker images containing Jupyter applications

Python 8,426 2,992 Updated Mar 22, 2026

🛡️ Windows Hello™ style facial authentication for Linux

Python 7,427 367 Updated Jul 29, 2025

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,951 510 Updated Dec 13, 2025

Point cloud diffusion for 3D model synthesis

Python 6,873 798 Updated Jul 4, 2024

Hide screen when boss is approaching.

Python 6,282 1,079 Updated Oct 31, 2018

Voilà turns Jupyter notebooks into standalone web applications

Python 5,909 527 Updated Mar 2, 2026
Python 4,432 409 Updated Sep 27, 2024

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 3,109 299 Updated Jan 14, 2025

Vigil, the eternal morally vigilant programming language

Python 3,029 62 Updated Sep 26, 2022

OCR powered screen-capture tool to capture information instead of images

Python 2,560 117 Updated Mar 26, 2026

Neural Artistic Style in Python

Python 2,175 478 Updated Oct 23, 2016

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

Python 2,057 130 Updated Jul 22, 2024

This is a ZSH plugin that enables you to use OpenAI's Codex AI in the command line.

Python 1,716 99 Updated Mar 22, 2025

pix2pix3D: Generating 3D Objects from 2D User Inputs

Python 1,713 148 Updated Sep 13, 2023

WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.

Python 1,645 128 Updated Jul 31, 2024
Next