-
WebXR prototypist
- Brussels
- https://fabien.benetou.fr
- @utopiah
- @utopiah@mastodon.pirateparty.be
- https://git.benetou.fr
Stars
A feature-rich command-line audio/video downloader
The simplest, fastest repository for training/finetuning medium-sized GPTs.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Code and documentation to train Stanford's Alpaca models, and generate the data.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Magenta: Music and Art Generation with Machine Intelligence
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Wan: Open and Advanced Large-Scale Video Generative Models
Generate 3D objects conditioned on text or images
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Ready-to-run Docker images containing Jupyter applications
🛡️ Windows Hello™ style facial authentication for Linux
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Voilà turns Jupyter notebooks into standalone web applications
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
Vigil, the eternal morally vigilant programming language
OCR powered screen-capture tool to capture information instead of images
Neural Artistic Style in Python
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
This is a ZSH plugin that enables you to use OpenAI's Codex AI in the command line.
pix2pix3D: Generating 3D Objects from 2D User Inputs
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.