Skip to content
View sunsmarterjie's full-sized avatar

Block or report sunsmarterjie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors

Python 2,539 354 Updated Oct 3, 2025

[CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding

Python 128 10 Updated Aug 26, 2025

[NeurIPS 2025 Spotlight] ReasonFlux Series - ReasonFlux, ReasonFlux-PRM and ReasonFlux-Coder

Python 496 34 Updated Sep 27, 2025

Context-Aware Chart Element Detection

Python 48 7 Updated Sep 25, 2025

The matplotlib-based software to generate chart dataset contains chart image, data and visual attributes json file. Originated from chart editing projects

Python 1 Updated Sep 30, 2024
Python 9 Updated Jan 23, 2025
Python 27 Updated Apr 8, 2025

vHeat: Building Vision Models upon Heat Conduction

Python 260 10 Updated Jun 12, 2025

The official Meta Llama 3 GitHub site

Python 29,072 3,474 Updated Jan 26, 2025

[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Python 4,236 483 Updated Jul 10, 2024

[Neurocomputing] The official code for "H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation".

Python 119 8 Updated Jan 23, 2025
Python 31 Updated Sep 24, 2024

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Python 1,355 115 Updated Dec 4, 2024

VMamba: Visual State Space Models,code is based on mamba

Python 2,873 205 Updated Mar 7, 2025

[AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues

Python 57 2 Updated May 2, 2025

[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"

Python 10,867 1,106 Updated Aug 29, 2025

Mamba SSM architecture

Python 16,331 1,480 Updated Oct 10, 2025

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,991 395 Updated Jul 10, 2024

✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL

Python 1,110 94 Updated Jan 23, 2024

[CSUR] A Survey on Video Diffusion Models

2,219 110 Updated Jun 27, 2025

Generative Models by Stability AI

Python 26,564 2,975 Updated Nov 3, 2025

Official implementation of AnimateDiff.

Python 11,842 1,019 Updated Jul 31, 2024

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Python 769 66 Updated Jul 3, 2024

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,555 8,389 Updated Sep 20, 2025

This Repository is "SSL for Image Representation", one of the OpenLab of the PseudoLab.

14 2 Updated Sep 11, 2023
Jupyter Notebook 72 1 Updated Mar 1, 2023

A toolbox for object skeleton detection, can also be used for edge detection, building extraction and road extraction. TIP (2021)

Python 138 28 Updated Feb 9, 2023

(CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling

Python 207 6 Updated Jul 28, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,426 3,819 Updated Jul 23, 2024
Next