Skip to content
View sunsmarterjie's full-sized avatar

Block or report sunsmarterjie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Generative Models by Stability AI

Python 26,569 2,976 Updated Nov 3, 2025

Google Research

Jupyter Notebook 36,673 8,232 Updated Oct 30, 2025

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,361 3,427 Updated Oct 28, 2025

Mamba SSM architecture

Python 16,347 1,481 Updated Oct 10, 2025

[NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors

Python 2,542 355 Updated Oct 3, 2025

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 5,090 804 Updated Sep 30, 2025

[NeurIPS 2025 Spotlight] ReasonFlux Series - ReasonFlux, ReasonFlux-PRM and ReasonFlux-Coder

Python 496 34 Updated Sep 27, 2025

Context-Aware Chart Element Detection

Python 48 7 Updated Sep 25, 2025

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,574 8,390 Updated Sep 20, 2025

[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"

Python 10,869 1,107 Updated Aug 29, 2025

[CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding

Python 129 9 Updated Aug 26, 2025

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,434 2,779 Updated Aug 22, 2025

[CSUR] A Survey on Video Diffusion Models

2,221 110 Updated Jun 27, 2025

The GitHub repository for the paper "Informer" accepted by AAAI 2021.

Python 6,288 1,276 Updated Jun 20, 2025

vHeat: Building Vision Models upon Heat Conduction

Python 260 10 Updated Jun 12, 2025

[AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues

Python 57 2 Updated May 2, 2025
Python 27 Updated Apr 8, 2025
Python 1 Updated Mar 7, 2025

VMamba: Visual State Space Models,code is based on mamba

Python 2,874 205 Updated Mar 7, 2025

The implementation of GOLD_NAS

Python 24 1 Updated Feb 17, 2025

The official Meta Llama 3 GitHub site

Python 29,073 3,476 Updated Jan 26, 2025
Python 9 Updated Jan 23, 2025

[Neurocomputing] The official code for "H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation".

Python 120 8 Updated Jan 23, 2025

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Python 1,357 115 Updated Dec 4, 2024

The matplotlib-based software to generate chart dataset contains chart image, data and visual attributes json file. Originated from chart editing projects

Python 1 Updated Sep 30, 2024
Python 31 Updated Sep 24, 2024

Official implementation of AnimateDiff.

Python 11,845 1,022 Updated Jul 31, 2024

(CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling

Python 207 6 Updated Jul 28, 2024

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 8,074 1,328 Updated Jul 23, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,444 3,821 Updated Jul 23, 2024
Next