-
The Chinese University of Hong Kong
- Hong Kong
-
09:23
(UTC +08:00) - https://shengze-xu.github.io/
Lists (13)
Sort Name ascending (A-Z)
Stars
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
High-Resolution Image Synthesis with Latent Diffusion Models
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
《动手学大模型Dive into LLMs》系列编程实践教程
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
Segment Anything in Medical Images
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learna…
MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts
Topological Priors for Image segmentation
This is a project for solving differentia and integral equations, as well as system of equations using ANN