Skip to content
View Xv-M-S's full-sized avatar

Block or report Xv-M-S

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Model Context Protocol(MCP) 编程极速入门

2,968 178 Updated Apr 23, 2025

The official Java SDK for Model Context Protocol servers and clients. Maintained in collaboration with Spring AI

Java 2,522 678 Updated Oct 6, 2025

Official Pytorch Implementation of DenseDiffusion (ICCV 2023)

Jupyter Notebook 497 34 Updated Nov 14, 2023

Code for "Semantic Object Accuracy for Generative Text-to-Image Synthesis" (TPAMI 2020)

Python 105 23 Updated Jan 13, 2022

Quick scripts to calculate CLIP text-image similarity

Python 277 18 Updated Apr 14, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 24,940 1,740 Updated Sep 28, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,858 6,611 Updated Sep 30, 2025

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Python 1,418 139 Updated Dec 20, 2023

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Python 176 12 Updated Apr 29, 2024

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 33,465 7,826 Updated Aug 27, 2025

🔥 公益免费的ChatGPT API,Free ChatGPT API,GPT4 API,可直连,无需代理,使用标准 OpenAI APIKEY 格式访问 ChatGPT,可搭配ChatGPT-next-web、ChatGPT-Midjourney、Lobe-chat、Botgem、FastGPT、沉浸式翻译等项目使用

5,375 515 Updated Jul 28, 2025

[ICLR 2025] Benchmarking Agentic Workflow Generation

Python 130 8 Updated Feb 19, 2025
Jupyter Notebook 61 2 Updated Oct 13, 2023

Repository containing all necessary codes to get started on the SoccerNet Dense Video Captioning challenge.

Python 31 6 Updated Apr 12, 2024

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,695 290 Updated Jun 12, 2025
Python 17 3 Updated Sep 22, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 27 2 Updated Jun 23, 2025
Python 996 85 Updated Oct 9, 2025

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python 1 Updated Oct 9, 2025
Python 63 2 Updated Aug 16, 2025
40 Updated Aug 7, 2025

[CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization

Python 70 2 Updated Jun 7, 2024

Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)

Python 80 5 Updated Feb 22, 2024

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python 780 32 Updated Oct 9, 2025

🚴 Call stack profiler for Python. Shows you why your code is slow!

Python 7,395 250 Updated Oct 6, 2025

Training-free Regional Prompting for Diffusion Transformers 🔥

Python 1 Updated Aug 17, 2025

[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

Python 655 44 Updated Jul 17, 2024

This is an official repository for the paper, NoiseCollage, which is a revolutionary extension of text-to-image diffusion models for layout-aware image generation.

Python 60 3 Updated May 16, 2024
Python 550 15 Updated Sep 30, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,286 464 Updated Aug 7, 2024
Next