Skip to content
View h6kplus's full-sized avatar

Highlights

  • Pro

Block or report h6kplus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

VEFX-Bench: A Holistic Benchmark for Generic Video Editing and Visual Effects

Python 215 16 Updated May 16, 2026

Official implementation of paper "PhyMotion: Structured 3D Motion Reward for Physics-Grounded Human Video Generation"

Python 38 5 Updated May 15, 2026

2026 AI/ML internship & new graduate job list updated daily

5,463 217 Updated Jun 15, 2026
Python 229 174 Updated Aug 18, 2025

Official implementation of paper "Planning with Sketch-Guided Verification for Physics-Aware Video Generation"

Python 14 Updated Nov 24, 2025

🔥Deepfake + LLM (CVPR25 Oral)

Python 113 7 Updated Jul 11, 2025
Python 11 Updated Jan 23, 2025

GH200 drone build templates (pytorch, torchvision, triton, vllm...)

6 Updated Jan 6, 2025

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

826 41 Updated Oct 10, 2025

NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.

Jupyter Notebook 10,260 673 Updated Jun 15, 2026

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,504 129 Updated Aug 5, 2025

This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and follow me if you like what you see🤩.

194 5 Updated Jan 30, 2025

[IROS 2024] Official implementation of paper: DriVLMe: "Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experience"s

JavaScript 65 19 Updated Nov 16, 2024

The world's simplest facial recognition api for Python and the command line

Python 56,503 13,700 Updated Aug 21, 2024

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,506 296 Updated May 31, 2024

A one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]

Python 180 18 Updated Dec 2, 2025

📖 A curated list of resources dedicated to talking face.

1,541 121 Updated Dec 23, 2024
JavaScript 4,226 1,919 Updated Jun 21, 2024

[TMLR] Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control

75 2 Updated Nov 29, 2024

[CVPR 2024] Official implementation, Inversion-Free Image Editing with Natural Language"

Python 361 9 Updated May 28, 2024

[ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"

Jupyter Notebook 400 20 Updated Mar 12, 2024

[ICRA 2024] Chat with NeRF enables users to interact with a NeRF model by typing in natural language.

Python 320 22 Updated Oct 10, 2025

[NeurIPS 2023] Official Code for CycleNet: Rethinking Cycle Consistent in Text‑Guided Diffusion for Image Manipulation

Python 96 9 Updated Oct 24, 2023

collection of diffusion model papers categorized by their subareas

2,211 102 Updated Mar 16, 2026

Summer 2026 software engineering, data science, AI, quant, product management, and hardware internship postings. Updated daily by Simplify and Pitt CSC.

Python 44,938 3,182 Updated Jun 16, 2026

Official Code for DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents (Findings of EMNLP 2022)

Python 22 4 Updated Oct 24, 2023

[PRICAI 2023] A-ESRGAN aims to provide better super-resolution images by using multi-scale attention U-net discriminators.

Python 134 11 Updated Feb 13, 2022

A modular RL library to fine-tune language models to human preferences

Python 2,388 202 Updated Mar 1, 2024

The agent engineering platform.

Python 139,443 23,112 Updated Jun 16, 2026

Programmer's guide about how to cook at home.

100,783 11,003 Updated Jun 16, 2026
Next