lcy0604

😮‍💨

Chongyu-Liu lcy0604

😮‍💨

I am a Ph.D. graduate from South China University of Technology, with research interests in OCR, text image processing，and document understanding.

36 followers · 34 following

SCUT
Guangzhou
17:25 (UTC +08:00)
https://scholar.google.com/citations?user=dW7AgfgAAAAJ&hl=zh-CN

Achievements

Stars

28 stars written in Jupyter Notebook

Clear filter

CompVis / stable-diffusion

A latent text-to-image diffusion model

Jupyter Notebook 71,755 10,515 Updated Jun 18, 2024

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,382 6,133 Updated Sep 18, 2024

microsoft / ai-agents-for-beginners

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 43,948 14,829 Updated Nov 6, 2025

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 25,728 2,587 Updated Nov 4, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 23,797 2,039 Updated Sep 12, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 17,552 2,183 Updated Dec 25, 2024

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,007 1,268 Updated Oct 27, 2025

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,505 1,684 Updated Feb 29, 2024

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,735 865 Updated Jun 10, 2024

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,467 541 Updated May 18, 2025

OpenBMB / MiniCPM

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,407 520 Updated Oct 8, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,136 548 Updated Nov 3, 2025

OFA-Sys / Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,597 527 Updated Aug 29, 2025

ChaoningZhang / MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 5,457 554 Updated Nov 20, 2024

Infrasys-AI / AIInfra

AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 4,990 689 Updated Nov 6, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 4,582 505 Updated Aug 25, 2025

VectorSpaceLab / OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,281 364 Updated Jun 15, 2025

Tencent-Hunyuan / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,261 360 Updated Oct 26, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,767 295 Updated Jun 12, 2025

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,692 440 Updated Aug 5, 2025

Liuziyu77 / Visual-RFT

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,244 100 Updated Oct 29, 2025

microsoft / i-Code

Jupyter Notebook 1,706 166 Updated Sep 27, 2024

google-research / pix2seq

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

Jupyter Notebook 929 72 Updated Nov 7, 2023

zhanshijinwat / Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 745 76 Updated Apr 27, 2025

Yuliang-Liu / Curve-Text-Detector

This repository provides train＆test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.

Jupyter Notebook 652 158 Updated Jul 20, 2020

Gy920 / segment-anything-2-real-time

Run Segment Anything Model 2 on a live video stream

Jupyter Notebook 532 83 Updated Jun 3, 2025

wenwenyu / TCM

Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)

Jupyter Notebook 196 20 Updated Jun 17, 2024

LayTextLLM / LayTextLLM

Jupyter Notebook 99 11 Updated Dec 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chongyu-Liu lcy0604

Achievements

Achievements

Block or report lcy0604

Stars

CompVis / stable-diffusion

facebookresearch / segment-anything

microsoft / ai-agents-for-beginners

datawhalechina / self-llm

microsoft / OmniParser

facebookresearch / sam2

QwenLM / Qwen3-VL

CompVis / latent-diffusion

artidoro / qlora

FoundationVision / VAR

OpenBMB / MiniCPM

facebookresearch / dinov3

OFA-Sys / Chinese-CLIP

ChaoningZhang / MobileSAM

Infrasys-AI / AIInfra

PKU-Alignment / align-anything

VectorSpaceLab / OmniGen

Tencent-Hunyuan / HunyuanDiT

QwenLM / Qwen2.5-Omni

yuanzhoulvpi2017 / zero_nlp

Liuziyu77 / Visual-RFT

microsoft / i-Code

google-research / pix2seq

zhanshijinwat / Steel-LLM

Yuliang-Liu / Curve-Text-Detector

Gy920 / segment-anything-2-real-time

wenwenyu / TCM

LayTextLLM / LayTextLLM