Skip to content
View atnikos's full-sized avatar
❤️‍🔥
❤️‍🔥

Block or report atnikos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".

Python 20 4 Updated Oct 28, 2025

Suite of motion imitation methods for training motion controllers.

Python 872 86 Updated Nov 5, 2025

[CVPR 2025] InteractVLM: 3D Interaction Reasoning from 2D Foundational Models

Python 117 8 Updated Oct 12, 2025

[ICCV 2025] The official implementation of MotionLab

Python 167 4 Updated Sep 17, 2025

We write your reusable computer vision tools. 💜

Python 35,819 2,992 Updated Nov 5, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,724 110 Updated Sep 16, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,216 4,766 Updated Jun 2, 2025

Official repository for LTX-Video

Python 8,704 797 Updated Oct 25, 2025

PyTorch media decoding and encoding

Python 790 67 Updated Nov 5, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,999 1,265 Updated Oct 27, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,904 7,485 Updated Nov 6, 2025

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 2,445 269 Updated Mar 10, 2025

A pipeline parallel training script for diffusion models.

Python 1,690 226 Updated Nov 5, 2025
Python 54 4 Updated Aug 1, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,237 1,119 Updated Aug 27, 2025

Easy-to-use glTF 2.0-compliant OpenGL renderer for visualization of 3D scenes.

Python 1,440 252 Updated Feb 7, 2025

[3DV'24] GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar

Python 83 3 Updated May 18, 2024

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 2,987 346 Updated Oct 30, 2025

Code and dataset for photorealistic Codec Avatars driven from audio

Python 2,841 279 Updated Sep 15, 2024

[CVPR 2024] TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation

Python 322 24 Updated Oct 3, 2025

Code for the NAACL 2019 paper: "Cross-topic distributional semantic representations via unsupervised mappings"

Python 8 Updated Jul 24, 2019

Official codebase for the SCULPT paper published in CVPR 2024

Python 19 1 Updated Aug 19, 2024

Code for the PoseScript (ECCV 22) and PoseFix (ICCV 23) papers.

Python 191 15 Updated Feb 13, 2025

Hydra is a framework for elegantly configuring complex applications

Python 9,905 723 Updated Oct 28, 2025

Flexible Python configuration system. The last one you will ever need.

Python 2,281 138 Updated Oct 29, 2025

AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.

TypeScript 4,313 350 Updated Jul 29, 2024

[ICCV2023] Official PyTorch Implementation of "BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction". ICCV 2023

Python 119 10 Updated Oct 9, 2023

Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision

Python 863 173 Updated Mar 24, 2023

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

Python 56 4 Updated Mar 11, 2024
Next