Skip to content
View lcy0604's full-sized avatar
😮‍💨
😮‍💨

Block or report lcy0604

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
28 stars written in Jupyter Notebook
Clear filter

A latent text-to-image diffusion model

Jupyter Notebook 71,755 10,515 Updated Jun 18, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,382 6,133 Updated Sep 18, 2024

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 43,948 14,829 Updated Nov 6, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 25,728 2,587 Updated Nov 4, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 23,797 2,039 Updated Sep 12, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 17,552 2,183 Updated Dec 25, 2024

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,007 1,268 Updated Oct 27, 2025

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,505 1,684 Updated Feb 29, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,735 865 Updated Jun 10, 2024

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,467 541 Updated May 18, 2025

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,407 520 Updated Oct 8, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,136 548 Updated Nov 3, 2025

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,597 527 Updated Aug 29, 2025

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 5,457 554 Updated Nov 20, 2024

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 4,990 689 Updated Nov 6, 2025

Align Anything: Training All-modality Model with Feedback

Jupyter Notebook 4,582 505 Updated Aug 25, 2025

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,281 364 Updated Jun 15, 2025

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,261 360 Updated Oct 26, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,767 295 Updated Jun 12, 2025

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,692 440 Updated Aug 5, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,244 100 Updated Oct 29, 2025
Jupyter Notebook 1,706 166 Updated Sep 27, 2024

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

Jupyter Notebook 929 72 Updated Nov 7, 2023

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 745 76 Updated Apr 27, 2025

This repository provides train&test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.

Jupyter Notebook 652 158 Updated Jul 20, 2020

Run Segment Anything Model 2 on a live video stream

Jupyter Notebook 532 83 Updated Jun 3, 2025

Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)

Jupyter Notebook 196 20 Updated Jun 17, 2024
Jupyter Notebook 99 11 Updated Dec 23, 2024