-
Max Planck Institute for Intelligent Systems
- Germany
-
18:10
(UTC -12:00) - atnikos.github.io
- @_nikos_athan
Lists (4)
Sort Name ascending (A-Z)
Stars
[ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".
Suite of motion imitation methods for training motion controllers.
[CVPR 2025] InteractVLM: 3D Interaction Reasoning from 2D Foundational Models
[ICCV 2025] The official implementation of MotionLab
We write your reusable computer vision tools. 💜
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
SkyReels V1: The first and most advanced open-source human-centric video foundation model
A pipeline parallel training script for diffusion models.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Easy-to-use glTF 2.0-compliant OpenGL renderer for visualization of 3D scenes.
[3DV'24] GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Code and dataset for photorealistic Codec Avatars driven from audio
[CVPR 2024] TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation
Code for the NAACL 2019 paper: "Cross-topic distributional semantic representations via unsupervised mappings"
Official codebase for the SCULPT paper published in CVPR 2024
Code for the PoseScript (ECCV 22) and PoseFix (ICCV 23) papers.
Hydra is a framework for elegantly configuring complex applications
Flexible Python configuration system. The last one you will ever need.
AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.
[ICCV2023] Official PyTorch Implementation of "BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction". ICCV 2023
Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision
CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.