Lists (8)
Sort Name ascending (A-Z)
Stars
A Conversational Speech Generation Model
AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Code release for CVPR'24 submission 'OmniGlue'
Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
Data release for the ImageInWords (IIW) paper.
code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
The official repo for "LLoCo: Learning Long Contexts Offline"
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Reaching LLaMA2 Performance with 0.1M Dollars
Fast and memory-efficient exact attention
A 3D graphics and physics engine coded from scratch in C++.
[CVPR'2024] Official implementation of the paper "ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation"
Instant voice cloning by MIT and MyShell. Audio foundation model.
Repository for the Paper "Multi-LoRA Composition for Image Generation"
Official code repository for the paper: "Neural Spline Fields for Burst Image Fusion and Layer Separation"
A quick example of how one can "synchronize" a 3d scene across multiple windows using three.js and localStorage
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)
[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"