Skip to content
View atnikos's full-sized avatar
❤️‍🔥
❤️‍🔥

Block or report atnikos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".

Python 20 4 Updated Oct 28, 2025

Suite of motion imitation methods for training controllers.

Python 915 91 Updated Nov 9, 2025

[CVPR 2025] InteractVLM: 3D Interaction Reasoning from 2D Foundational Models

Python 117 8 Updated Oct 12, 2025

[ICCV 2025] The official implementation of MotionLab

Python 170 4 Updated Sep 17, 2025

We write your reusable computer vision tools. 💜

Python 35,871 2,998 Updated Nov 10, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,732 112 Updated Sep 16, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,247 4,774 Updated Jun 2, 2025

Official repository for LTX-Video

Python 8,747 805 Updated Oct 25, 2025

PyTorch media decoding and encoding

Python 803 67 Updated Nov 11, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,279 1,298 Updated Nov 10, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,294 7,538 Updated Nov 12, 2025

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 2,461 273 Updated Mar 10, 2025

A pipeline parallel training script for diffusion models.

Python 1,701 229 Updated Nov 7, 2025
Python 54 4 Updated Aug 1, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,265 1,127 Updated Aug 27, 2025

Easy-to-use glTF 2.0-compliant OpenGL renderer for visualization of 3D scenes.

Python 1,441 252 Updated Feb 7, 2025

[3DV'24] GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar

Python 83 3 Updated May 18, 2024

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 3,023 350 Updated Nov 11, 2025

Code and dataset for photorealistic Codec Avatars driven from audio

Python 2,842 279 Updated Sep 15, 2024

[CVPR 2024] TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation

Python 326 25 Updated Oct 3, 2025

Code for the NAACL 2019 paper: "Cross-topic distributional semantic representations via unsupervised mappings"

Python 8 Updated Jul 24, 2019

Official codebase for the SCULPT paper published in CVPR 2024

Python 19 1 Updated Aug 19, 2024

Code for the PoseScript (ECCV 22) and PoseFix (ICCV 23) papers.

Python 191 15 Updated Feb 13, 2025

Hydra is a framework for elegantly configuring complex applications

Python 9,933 725 Updated Nov 9, 2025

Flexible Python configuration system. The last one you will ever need.

Python 2,287 138 Updated Oct 29, 2025

AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.

TypeScript 4,314 350 Updated Jul 29, 2024

[ICCV2023] Official PyTorch Implementation of "BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction". ICCV 2023

Python 119 10 Updated Oct 9, 2023

Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision

Python 864 173 Updated Mar 24, 2023

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

Python 56 4 Updated Mar 11, 2024
Next