Skip to content
View hithereai's full-sized avatar

Block or report hithereai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Conversational Speech Generation Model

Python 14,140 1,401 Updated May 27, 2025

Humanity's Last Exam

Python 1,121 69 Updated Oct 7, 2025

AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

Python 28,949 4,393 Updated May 28, 2025

Code release for CVPR'24 submission 'OmniGlue'

Python 680 63 Updated Aug 12, 2024

Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions

Python 245 13 Updated Feb 4, 2025

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Jupyter Notebook 1,587 92 Updated Jun 28, 2024

Data release for the ImageInWords (IIW) paper.

JavaScript 220 8 Updated Nov 17, 2024

code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction

Python 431 51 Updated Jun 13, 2024

Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!

Jupyter Notebook 1,407 177 Updated Jan 15, 2025

The official repo for "LLoCo: Learning Long Contexts Offline"

Python 117 8 Updated Jun 15, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,298 722 Updated Sep 22, 2025

Reaching LLaMA2 Performance with 0.1M Dollars

Python 985 80 Updated Jul 23, 2024

Fast and memory-efficient exact attention

Python 19,836 2,045 Updated Oct 8, 2025

A 3D graphics and physics engine coded from scratch in C++.

C++ 52 2 Updated Aug 28, 2025

[CVPR'2024] Official implementation of the paper "ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation"

Python 194 20 Updated Aug 18, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 34,586 3,808 Updated Apr 19, 2025

Windows boot logo changer for UEFI systems

C# 2,633 260 Updated Jul 29, 2025

Repository for the Paper "Multi-LoRA Composition for Image Generation"

Python 484 48 Updated Mar 31, 2024

A tool for retrosynthetic planning

Python 729 154 Updated Jul 3, 2025

Official code repository for the paper: "Neural Spline Fields for Burst Image Fusion and Layer Separation"

Jupyter Notebook 302 15 Updated Feb 18, 2025

A quick example of how one can "synchronize" a 3d scene across multiple windows using three.js and localStorage

JavaScript 18,881 2,933 Updated Nov 29, 2023

CoRL 2024

Python 441 56 Updated Oct 29, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,526 2,455 Updated Mar 13, 2025

Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

Python 994 130 Updated Dec 4, 2023

Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability

Python 944 83 Updated Nov 11, 2023

one-click face swap

Python 30,255 6,886 Updated Aug 19, 2024

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)

Python 4,979 489 Updated Jul 17, 2023

[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"

Python 845 59 Updated Oct 12, 2023
Next