- Ho chi minh city, Vietnam
Stars
Python framework for creating, editing, and invoking Noisy Intermediate-Scale Quantum (NISQ) circuits.
The simplest, fastest repository for training/finetuning small-sized VLMs.
RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Underthesea - Vietnamese NLP Toolkit
[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
Tools for handling multimodal data in machine learning projects.
Example notebooks that show how to apply quantum computing with Amazon Braket.
Installer for D-Wave's Ocean tools
UniSpeech - Large Scale Self-Supervised Learning for Speech
Monty is a sensorimotor learning framework based on the thousand brains theory of the neocortex.
The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research top…
DiffFace: Diffusion-based Face Swapping with Facial Guidance
DEPRECATED in favor of our newer libraries (see www.fatiando.org). Python toolkit for modeling and inversion in geophysics.
Cook up amazing multimodal AI applications effortlessly with MiniCPM-o
pytorch -> onnx -> caffe, pytorch to caffe, or other deep learning framework to onnx and onnx to caffe.