lxtGH

💬

At home

Xiangtai Li lxtGH

💬

At home

Work For Multi-Modal Models.

709 followers · 287 following

Bytedance (Tiktok)
Singapore
https://lxtgh.github.io/
@xtl994

Achievements

Highlights

lxtGH.github.io Public
Forked from RayeRen/acad-homepage.github.io

AcadHomepage: A Modern and Responsive Academic Personal Homepage

JavaScript 2 MIT License Updated May 16, 2026
Awesome-HumanView-VideoUnderstanding Public
Forked from marinero4972/Awesome-HumanView-VideoUnderstanding

Updated May 12, 2026
Awesome-Visual-Tokenizer Public
Forked from Shi-qingyu/Awesome-Visual-Tokenizer

Updated May 10, 2026
Open-o3-Video Public
Forked from marinero4972/Open-o3-Video

[ICML 2026] Official implementation of "Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence"

Python Apache License 2.0 Updated May 1, 2026
RecTok Public
Forked from Shi-qingyu/RecTok

[CVPR 26] Official PyTorch Implementation of RecTok

Python Updated Apr 22, 2026
latex-vscode-config Public
Forked from shinyypig/latex-vscode-config

Use LaTeX in VSCode.

Updated Jan 30, 2026
lxtGH Public

Updated Nov 14, 2025
OMG-Seg Public

Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,347 54 Other Updated Oct 15, 2025
DenseWorld-1M Public

Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"

129 2 Other Updated Oct 2, 2025
VLMEvalKit Public
Forked from open-compass/VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python Apache License 2.0 Updated Jul 14, 2025
describe-anything Public
Forked from NVlabs/describe-anything

Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python Apache License 2.0 Updated Jun 26, 2025
Sa2VA Public
Forked from bytedance/Sa2VA

Code for our work: Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Python 1 Apache License 2.0 Updated Jan 8, 2025
Panoptic-PartFormer Public

[ECCV-2022] The First Unified End-to-End System for Panoptic Part Segmentation

Python 63 3 Updated Sep 2, 2024
Awesome-Segmentation-With-Transformer Public

[T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey

757 54 Updated Aug 25, 2024
segment-anything-2 Public
Forked from facebookresearch/sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 1 Apache License 2.0 Updated Aug 14, 2024
Tube-Link Public

[ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS

Python 109 3 1 issue needs help Updated Mar 18, 2024
SFSegNets Public

[ECCV-2020-oral]-Semantic Flow for Fast and Accurate Scene Parsing

semanticsegmentation

Python 385 46 Updated Mar 12, 2024
- Public

Updated Mar 11, 2024
DiT Public
Forked from facebookresearch/DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python Other Updated Feb 20, 2024
PixArt-alpha Public
Forked from PixArt-alpha/PixArt-alpha

Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python GNU Affero General Public License v3.0 Updated Feb 19, 2024
awesome-3D-gaussian-splatting Public
Forked from MrNeRF/awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

1 MIT License Updated Jan 13, 2024
LLaVA Public
Forked from haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python Apache License 2.0 Updated Jan 3, 2024
PointNeXt Public
Forked from guochengqian/PointNeXt

[NeurIPS'22] PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies

Shell MIT License Updated Dec 1, 2023
Fast_Seg Public

This repo provides ⚡ fast⚡ semantic segmentation models on CityScapes/Camvid DataSet by Pytorch

Python 211 36 Apache License 2.0 Updated Aug 28, 2023
Video-K-Net Public

[CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation

Python 156 15 MIT License Updated Aug 19, 2023
Awesome-CV-Foundational-Models Public
Forked from awaisrauf/Awesome-CV-Foundational-Models

Updated Jul 29, 2023
PFSegNets Public

PointFlow (CVPR-2021)

Python 124 16 Updated Jul 6, 2023
TemporalPyramidRouting Public

Temporal Pyramid Routing For Video Instance Segmentation-T-PAMI-2022

Python 25 Apache License 2.0 Updated Jul 6, 2023
learning_research Public
Forked from pengsida/learning_research

Updated Jun 21, 2023
InternGPT Public
Forked from OpenGVLab/InternGPT

InternGPT / InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.

Python Apache License 2.0 Updated May 19, 2023

Xiangtai Li lxtGH

Achievements

Achievements

Highlights

lxtGH.github.io Public

Uh oh!

Awesome-HumanView-VideoUnderstanding Public

Uh oh!

Awesome-Visual-Tokenizer Public

Uh oh!

Open-o3-Video Public

Uh oh!

RecTok Public

Uh oh!

latex-vscode-config Public

Uh oh!

lxtGH Public

Uh oh!

OMG-Seg Public

Uh oh!

DenseWorld-1M Public

Uh oh!

VLMEvalKit Public

Uh oh!

describe-anything Public

Uh oh!

Sa2VA Public

Uh oh!

Panoptic-PartFormer Public

Uh oh!

Awesome-Segmentation-With-Transformer Public

Uh oh!

segment-anything-2 Public

Uh oh!

Tube-Link Public

Uh oh!

SFSegNets Public

Uh oh!

- Public

Uh oh!

DiT Public

Uh oh!

PixArt-alpha Public

Uh oh!

awesome-3D-gaussian-splatting Public

Uh oh!

LLaVA Public

Uh oh!

PointNeXt Public

Uh oh!

Fast_Seg Public

Uh oh!

Video-K-Net Public

Uh oh!

Awesome-CV-Foundational-Models Public

Uh oh!

PFSegNets Public

Uh oh!

TemporalPyramidRouting Public

Uh oh!

learning_research Public

Uh oh!

InternGPT Public

Uh oh!