Skip to content
View zhshj0110's full-sized avatar

Block or report zhshj0110

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zhshj0110/README.md

Hi! 👋

  • 🌱 I’m currently studying at the School of Artificial Intelligence, Beijing University of Posts and Telecommunications.
  • 🤔 My research interests include Human Activity Analysis, Human Motion Synthetic and Multimodal Large Language Models.
  • 📫 Email me @ zhshj0110@gmail.com

zhshj0110 |

Papers

  • [ICCV 2025] Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs [paper]
  • [PR 2025] A Generically Contrastive Spatiotemporal Representation Enhancement for 3D Skeleton Action Recognition [paper]
  • [ROBIO 2024] Temporal Text Prompts for Skeleton-based Action Recognition [paper]
  • [KBS 2024] MLP-AIR: An effective MLP-based module for actor interaction relation learning in group activity recognition [paper]
  • [TCSVT 2024] SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recognition [paper]
  • [PR 2024] Kinematics Modeling Network for Video-based Human Pose Estimation [paper]
  • [TIP 2022] Relation-Based Associative Joint Location for Human Pose Estimation in Videos [paper]

Pinned Loading

  1. huggingface/transformers huggingface/transformers Public

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    Python 154k 31.5k

  2. modelscope/ms-swift modelscope/ms-swift Public

    Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

    Python 11.8k 1.1k

  3. EvolvingLMMs-Lab/lmms-eval EvolvingLMMs-Lab/lmms-eval Public

    One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

    Python 3.4k 461

  4. xiaomi-research/q-frame xiaomi-research/q-frame Public

    [ICCV 2025] Implementation of the paper "Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs"

    Python 61 2

  5. SiT-MLP SiT-MLP Public

    [TCSVT 2024] Implementation of the paper "SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recognition"

    Python 18 1

  6. CSRE CSRE Public

    [PR 2025] The official implementation of paper 'A Generically Contrastive Spatiotemporal Representation Enhancement for 3D Skeleton Action Recognition'

    Python 7 2