Lists (1)
Sort Name ascending (A-Z)
Stars
The world's simplest facial recognition api for Python and the command line
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
We write your reusable computer vision tools. 💜
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Generate 3D objects conditioned on text or images
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
a state-of-the-art-level open visual language model | 多模态预训练模型
中文古诗自动作诗机器人,x炸天,基于tensorflow1.10 api,正在积极维护升级中,快star,保持更新!
Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced Reranking and Noise-injected Training.