cshizhe

Shizhe Chen cshizhe

192 followers · 33 following

Achievements

Highlights

Organizations

Lists (5)

Sort

🚀 My stack

Resources

9 repositories

Stars

100 stars written in Python

Clear filter

ultralytics / yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 55,932 17,301 Updated Nov 2, 2025

AntonOsika / gpt-engineer

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

Python 54,998 7,333 Updated May 14, 2025

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,688 6,869 Updated Nov 6, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,673 5,058 Updated Nov 6, 2025

chenfei-wu / TaskMatrix

Python 34,354 3,272 Updated Jan 6, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,500 6,473 Updated Nov 6, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 27,549 2,532 Updated Nov 5, 2025

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,353 3,424 Updated Oct 28, 2025

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,812 2,664 Updated Jul 3, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 19,112 2,955 Updated Nov 6, 2025

microsoft / Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,384 2,191 Updated Jul 24, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 12,890 1,192 Updated Nov 4, 2025

jacobgil / pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 12,320 1,675 Updated Apr 7, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 10,586 987 Updated Nov 5, 2025

lucidrains / denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 10,133 1,225 Updated Aug 4, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,428 735 Updated Sep 22, 2025

arogozhnikov / einops

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 9,260 386 Updated Aug 12, 2025

Physical-Intelligence / openpi

Python 8,630 1,065 Updated Oct 19, 2025

facebookresearch / SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 7,162 1,285 Updated Oct 27, 2025

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,709 1,109 Updated Oct 15, 2025

google-research / arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,539 375 Updated Jun 2, 2025

isaac-sim / IsaacLab

Unified framework for robot learning built on NVIDIA Isaac Sim

Python 5,358 2,594 Updated Nov 6, 2025

facebookresearch / moco

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 5,089 804 Updated Sep 30, 2025

wkentaro / gdown

Google Drive Public File Downloader when Curl/Wget Fails

Python 4,949 389 Updated Aug 12, 2025

open-mmlab / mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,800 1,317 Updated Aug 14, 2024

LLaVA-VL / LLaVA-NeXT

Python 4,370 416 Updated Sep 14, 2025

open-mmlab / mmtracking

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

Python 3,810 614 Updated Sep 19, 2023

fundamentalvision / Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Python 3,771 598 Updated May 16, 2024

facebookresearch / ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 3,100 425 Updated May 8, 2024

UX-Decoder / Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,761 142 Updated Jul 10, 2025