Skip to content
View cshizhe's full-sized avatar

Highlights

  • Pro

Organizations

@AIM3-RUC

Block or report cshizhe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
100 stars written in Python
Clear filter

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 55,932 17,301 Updated Nov 2, 2025

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

Python 54,998 7,333 Updated May 14, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,688 6,869 Updated Nov 6, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,673 5,058 Updated Nov 6, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,500 6,473 Updated Nov 6, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,549 2,532 Updated Nov 5, 2025

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,353 3,424 Updated Oct 28, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,812 2,664 Updated Jul 3, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 19,112 2,955 Updated Nov 6, 2025

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,384 2,191 Updated Jul 24, 2024

An open source implementation of CLIP.

Python 12,890 1,192 Updated Nov 4, 2025

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 12,320 1,675 Updated Apr 7, 2025

Enjoy the magic of Diffusion models!

Python 10,586 987 Updated Nov 5, 2025

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 10,133 1,225 Updated Aug 4, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,428 735 Updated Sep 22, 2025

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 9,260 386 Updated Aug 12, 2025

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 7,162 1,285 Updated Oct 27, 2025

Example models using DeepSpeed

Python 6,709 1,109 Updated Oct 15, 2025

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,539 375 Updated Jun 2, 2025

Unified framework for robot learning built on NVIDIA Isaac Sim

Python 5,358 2,594 Updated Nov 6, 2025

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 5,089 804 Updated Sep 30, 2025

Google Drive Public File Downloader when Curl/Wget Fails

Python 4,949 389 Updated Aug 12, 2025

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,800 1,317 Updated Aug 14, 2024
Python 4,370 416 Updated Sep 14, 2025

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

Python 3,810 614 Updated Sep 19, 2023

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Python 3,771 598 Updated May 16, 2024

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 3,100 425 Updated May 8, 2024

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,761 142 Updated Jul 10, 2025
Next