cshizhe

Shizhe Chen cshizhe

192 followers · 33 following

Achievements

Highlights

Organizations

Lists (5)

Sort

🚀 My stack

Resources

9 repositories

Stars

100 stars written in Python

Clear filter

ultralytics / yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 55,940 17,302 Updated Nov 2, 2025

AntonOsika / gpt-engineer

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

Python 54,999 7,333 Updated May 14, 2025

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,697 6,870 Updated Nov 6, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,683 5,062 Updated Nov 6, 2025

chenfei-wu / TaskMatrix

Python 34,355 3,272 Updated Jan 6, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,507 6,476 Updated Nov 6, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 27,559 2,535 Updated Nov 6, 2025

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,361 3,427 Updated Oct 28, 2025

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,816 2,665 Updated Jul 3, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 19,129 2,957 Updated Nov 6, 2025

microsoft / Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,384 2,191 Updated Jul 24, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 12,895 1,193 Updated Nov 4, 2025

jacobgil / pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 12,326 1,675 Updated Apr 7, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 10,591 988 Updated Nov 6, 2025

lucidrains / denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 10,136 1,226 Updated Aug 4, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,431 734 Updated Sep 22, 2025

arogozhnikov / einops

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 9,262 386 Updated Aug 12, 2025

Physical-Intelligence / openpi

Python 8,640 1,069 Updated Oct 19, 2025

facebookresearch / SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 7,163 1,285 Updated Oct 27, 2025

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,709 1,109 Updated Oct 15, 2025

google-research / arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,540 375 Updated Jun 2, 2025

isaac-sim / IsaacLab

Unified framework for robot learning built on NVIDIA Isaac Sim

Python 5,363 2,599 Updated Nov 6, 2025

facebookresearch / moco

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 5,090 804 Updated Sep 30, 2025

wkentaro / gdown

Google Drive Public File Downloader when Curl/Wget Fails

Python 4,950 390 Updated Aug 12, 2025

open-mmlab / mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,801 1,317 Updated Aug 14, 2024

LLaVA-VL / LLaVA-NeXT

Python 4,373 417 Updated Sep 14, 2025

open-mmlab / mmtracking

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

Python 3,811 614 Updated Sep 19, 2023

fundamentalvision / Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Python 3,773 598 Updated May 16, 2024

facebookresearch / ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 3,103 425 Updated May 8, 2024

UX-Decoder / Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,761 142 Updated Jul 10, 2025