tkhe

Follow

Xiaolin Wang tkhe

Follow

12 followers · 44 following

Achievements

Achievements

Starred repositories

meta-pytorch / torchcomms

torchcomms: a modern PyTorch communications API

C++ 244 27 Updated Nov 6, 2025

Niujunbo2002 / NativeRes-LLaVA

Official code repo for our work "Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models"

Python 51 3 Updated Jun 17, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,013 1,270 Updated Oct 27, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,831 3,285 Updated Nov 6, 2025

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,785 572 Updated May 3, 2024

zhanshijinwat / Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 745 76 Updated Apr 27, 2025

qibin0506 / Cortex

个人构建MoE大模型：从预训练到DPO的完整实践

Python 1,759 138 Updated Nov 5, 2025

AI-Study-Han / Zero-Qwen-VL

训练一个对中文支持更好的LLaVA模型，并开源训练代码和数据。

Python 75 12 Updated Sep 6, 2024

charent / Phi2-mini-Chinese

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 576 64 Updated Jul 11, 2024

charent / ChatLM-mini-Chinese

中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。

Python 1,631 185 Updated Apr 20, 2024

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,139 548 Updated Nov 3, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 8,353 1,020 Updated Nov 3, 2025

amulil / cleanvllm

A single-file educational implementation for understanding vLLM's core concepts and running LLM inference.

Python 25 4 Updated Jun 22, 2025

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,627 258 Updated Nov 6, 2025

ShaohonChen / Qwen3-SmVL

将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调

Python 423 44 Updated Sep 8, 2025

erow / FastSSL

Python 41 2 Updated Oct 2, 2025

nonwhy / PURE

[ICCV2025] PyTorch implementation of "Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models"

Python 107 4 Updated Jul 25, 2025

thu-pacman / chitu

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,325 88 Updated Nov 6, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,285 11,068 Updated Nov 6, 2025

ximinng / LLM4SVG

[CVPR 2025] Official implementation for "Empowering LLMs to Understand and Generate Complex Vector Graphics" https://arxiv.org/abs/2412.11102

Python 583 13 Updated May 22, 2025

bbruceyuan / LLMs-Zero-to-Hero

从无名小卒到大模型（LLM）大英雄~ 欢迎关注后续！！！

Jupyter Notebook 1,801 123 Updated Oct 19, 2025

jingyaogong / minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,220 549 Updated Oct 30, 2025

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

21,634 2,050 Updated May 19, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,674 366 Updated Oct 21, 2025

itsmag11 / Omegance

Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)

Python 51 2 Updated Sep 21, 2025

antgroup / animate-x

[ICLR 2025] Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Python 366 36 Updated Sep 17, 2025

pq-yang / MatAnyone

[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation

Python 1,377 89 Updated Nov 2, 2025

xiaofeng94 / VL-PLM

Exploiting unlabeled data with vision and language models for object detection, ECCV 2022

Python 93 7 Updated Jan 16, 2024

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,610 3,776 Updated Nov 6, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,116 1,791 Updated Feb 26, 2025

Starred topics

Deep learning