Skip to content
View zhi-xuan-chen's full-sized avatar

Highlights

  • Pro

Block or report zhi-xuan-chen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 18,376 2,880 Updated Dec 19, 2025

a multiscale multimodal large language models for radiology report generation (RRG) tasks

Python 271 21 Updated Dec 11, 2025
Python 7,538 444 Updated Dec 14, 2025
Python 1,662 99 Updated Sep 30, 2025

【ICML 2025 Spotlight】 Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’

Python 1,574 234 Updated Nov 2, 2025

Merlin is a 3D VLM for computed tomography that leverages both structured electronic health records (EHR) and unstructured radiology reports for pretraining.

Python 173 17 Updated Oct 22, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,272 727 Updated Dec 21, 2025

Free-Text Promptable Universal 3D Medical Image Segmentation

Python 56 1 Updated Dec 17, 2025
Python 153 11 Updated Aug 29, 2024

This repository contains code to train a self-supervised learning model on chest X-ray images that lack explicit annotations and evaluate this model's performance on pathology-classification tasks.

Python 214 47 Updated Aug 28, 2023

Powerful macOS menu bar customization tool

Swift 3,600 108 Updated Nov 7, 2025

(ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.

Python 44 5 Updated Jul 1, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,197 2,685 Updated Aug 12, 2024

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,878 654 Updated Nov 20, 2025

Official implementation of "VIRAL: Visual Representation Alignment for MLLMs".

Python 140 8 Updated Sep 21, 2025

Contexts Optical Compression

Python 21,514 1,925 Updated Oct 25, 2025

(ICCV 2025) Enhance CLIP and MLLM's fine-grained visual representations with generative models.

Python 74 4 Updated Jun 25, 2025

Open-Source Frontier Voice AI

Python 18,798 2,079 Updated Dec 17, 2025

Improving Performance, Robustness, and Fairness of Radiographic AI Models with Finely-Controllable Synthetic Data

Python 17 1 Updated Sep 12, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,444 361 Updated Dec 19, 2025

[AAAI2026] X-SAM: From Segment Anything to Any Segmentation

Python 334 10 Updated Nov 28, 2025
7 1 Updated Jul 29, 2025
Python 18 1 Updated Nov 28, 2025

OpenMMLab Detection Toolbox and Benchmark

Python 32,190 9,831 Updated Aug 21, 2024

开源的端到端产品级通用智能体

Java 11,394 1,415 Updated Dec 16, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,286 103 Updated Oct 29, 2025

An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.

TypeScript 19,082 2,697 Updated Dec 18, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,879 12,103 Updated Dec 21, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,760 1,071 Updated Dec 21, 2025
Next