Skip to content
View tkhe's full-sized avatar

Block or report tkhe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

torchcomms: a modern PyTorch communications API

C++ 241 23 Updated Nov 5, 2025

Official code repo for our work "Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models"

Python 51 3 Updated Jun 17, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,962 1,256 Updated Oct 27, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,697 3,271 Updated Nov 5, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,784 572 Updated May 3, 2024

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 745 76 Updated Apr 27, 2025

个人构建MoE大模型:从预训练到DPO的完整实践

Python 1,751 137 Updated Nov 5, 2025

训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。

Python 75 12 Updated Sep 6, 2024

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 576 64 Updated Jul 11, 2024

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 1,631 185 Updated Apr 20, 2024

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,113 545 Updated Nov 3, 2025

Nano vLLM

Python 8,269 1,011 Updated Nov 3, 2025

A single-file educational implementation for understanding vLLM's core concepts and running LLM inference.

Python 25 4 Updated Jun 22, 2025

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,622 256 Updated Oct 28, 2025

将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调

Python 421 44 Updated Sep 8, 2025
Python 41 2 Updated Oct 2, 2025

[ICCV2025] PyTorch implementation of "Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models"

Python 107 4 Updated Jul 25, 2025

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,325 88 Updated Nov 3, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,058 11,029 Updated Nov 5, 2025

[CVPR 2025] Official implementation for "Empowering LLMs to Understand and Generate Complex Vector Graphics" https://arxiv.org/abs/2412.11102

Python 583 12 Updated May 22, 2025

从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!

Jupyter Notebook 1,794 124 Updated Oct 19, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,202 544 Updated Oct 30, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

21,615 2,050 Updated May 19, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,671 366 Updated Oct 21, 2025

Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)

Python 51 2 Updated Sep 21, 2025

[ICLR 2025] Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Python 366 36 Updated Sep 17, 2025

[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation

Python 1,376 89 Updated Nov 2, 2025

Exploiting unlabeled data with vision and language models for object detection, ECCV 2022

Python 93 7 Updated Jan 16, 2024

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,482 3,756 Updated Nov 2, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,113 1,790 Updated Feb 26, 2025
Next