hithereai

hithereai

33 followers · 1 following

Achievements

Lists (8)

Sort

Stars

SesameAILabs / csm

A Conversational Speech Generation Model

Python 14,140 1,401 Updated May 27, 2025

centerforaisafety / hle

Humanity's Last Exam

Python 1,121 69 Updated Oct 7, 2025

feder-cr / Jobs_Applier_AI_Agent_AIHawk

AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

Python 28,949 4,393 Updated May 28, 2025

google-research / omniglue

Code release for CVPR'24 submission 'OmniGlue'

Python 680 63 Updated Aug 12, 2024

IFICL / images-that-sound

Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions

Python 245 13 Updated Feb 4, 2025

mhamilton723 / FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Jupyter Notebook 1,587 92 Updated Jun 28, 2024

google / imageinwords

Data release for the ImageInWords (IIW) paper.

JavaScript 220 8 Updated Nov 17, 2024

Kroery / DiffMOT

code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction

Python 431 51 Updated Jun 13, 2024

verlab / accelerated_features

Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!

Jupyter Notebook 1,407 177 Updated Jan 15, 2025

jeffreysijuntan / lloco

The official repo for "LLoCo: Learning Long Contexts Offline"

Python 117 8 Updated Jun 15, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,298 722 Updated Sep 22, 2025

myshell-ai / JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Python 985 80 Updated Jul 23, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 19,836 2,045 Updated Oct 8, 2025

Hykudoru / Pescado-3D-Engine

A 3D graphics and physics engine coded from scratch in C++.

C++ 52 2 Updated Aug 28, 2025

Aradhye2002 / EcoDepth

[CVPR'2024] Official implementation of the paper "ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation"

Python 194 20 Updated Aug 18, 2025

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 34,586 3,808 Updated Apr 19, 2025

Metabolix / HackBGRT

Windows boot logo changer for UEFI systems

C# 2,633 260 Updated Jul 29, 2025

maszhongming / Multi-LoRA-Composition

Repository for the Paper "Multi-LoRA Composition for Image Generation"

Python 484 48 Updated Mar 31, 2024

MolecularAI / aizynthfinder

A tool for retrosynthetic planning

Python 729 154 Updated Jul 3, 2025

princeton-computational-imaging / NSF

Official code repository for the paper: "Neural Spline Fields for Burst Image Fusion and Layer Separation"

Jupyter Notebook 302 15 Updated Feb 18, 2025

bgstaal / multipleWindow3dScene

A quick example of how one can "synchronize" a 3d scene across multiple windows using three.js and localStorage

JavaScript 18,881 2,933 Updated Nov 29, 2023

mlzxy / devit

CoRL 2024

Python 441 56 Updated Oct 29, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,526 2,455 Updated Mar 13, 2025

harlanhong / CVPR2022-DaGAN

Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

Python 994 130 Updated Dec 4, 2023

ali-vilab / videocomposer

Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability

Python 944 83 Updated Nov 11, 2023

s0md3v / roop

one-click face swap

Python 30,255 6,886 Updated Aug 19, 2024

AUTOMATIC1111 / stable-diffusion-webui-tensorrt

Python 315 22 Updated Jul 25, 2023

OpenGVLab / DragGAN

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" （DragGAN 全功能实现，在线Demo，本地部署试用，代码、模型已全部开源，支持Windows, macOS, Linux）

Python 4,979 489 Updated Jul 17, 2023

YBYBZhang / ControlVideo

[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"

Python 845 59 Updated Oct 12, 2023

haha-lisa / Style-A-Video

55 Updated Apr 8, 2024

hithereai

Lists (8)

De-Flicker

Interpolation

Language Models

Optical Flow

Others

SD

Upscalers

Voice

Stars