-
Bytedance (Tiktok)
- Singapore
- https://lxtgh.github.io/
- @xtl994
Highlights
- Pro
-
lxtGH.github.io Public
Forked from RayeRen/acad-homepage.github.ioAcadHomepage: A Modern and Responsive Academic Personal Homepage
-
Awesome-HumanView-VideoUnderstanding Public
Forked from marinero4972/Awesome-HumanView-VideoUnderstandingUpdatedMay 12, 2026 -
Awesome-Visual-Tokenizer Public
Forked from Shi-qingyu/Awesome-Visual-TokenizerUpdatedMay 10, 2026 -
Open-o3-Video Public
Forked from marinero4972/Open-o3-Video[ICML 2026] Official implementation of "Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence"
Python Apache License 2.0 UpdatedMay 1, 2026 -
RecTok Public
Forked from Shi-qingyu/RecTok[CVPR 26] Official PyTorch Implementation of RecTok
Python UpdatedApr 22, 2026 -
latex-vscode-config Public
Forked from shinyypig/latex-vscode-configUse LaTeX in VSCode.
UpdatedJan 30, 2026 -
-
OMG-Seg Public
Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
-
DenseWorld-1M Public
Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"
-
VLMEvalKit Public
Forked from open-compass/VLMEvalKitOpen-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Python Apache License 2.0 UpdatedJul 14, 2025 -
describe-anything Public
Forked from NVlabs/describe-anythingImplementation for Describe Anything: Detailed Localized Image and Video Captioning
Python Apache License 2.0 UpdatedJun 26, 2025 -
Sa2VA Public
Forked from bytedance/Sa2VACode for our work: Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
-
Panoptic-PartFormer Public
[ECCV-2022] The First Unified End-to-End System for Panoptic Part Segmentation
-
[T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey
-
segment-anything-2 Public
Forked from facebookresearch/sam2The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
-
Tube-Link Public
[ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS
-
SFSegNets Public
[ECCV-2020-oral]-Semantic Flow for Fast and Accurate Scene Parsing
-
-
DiT Public
Forked from facebookresearch/DiTOfficial PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Python Other UpdatedFeb 20, 2024 -
PixArt-alpha Public
Forked from PixArt-alpha/PixArt-alphaFast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Python GNU Affero General Public License v3.0 UpdatedFeb 19, 2024 -
awesome-3D-gaussian-splatting Public
Forked from MrNeRF/awesome-3D-gaussian-splattingCurated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
-
LLaVA Public
Forked from haotian-liu/LLaVA[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Python Apache License 2.0 UpdatedJan 3, 2024 -
PointNeXt Public
Forked from guochengqian/PointNeXt[NeurIPS'22] PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies
Shell MIT License UpdatedDec 1, 2023 -
Fast_Seg Public
This repo provides ⚡ fast⚡ semantic segmentation models on CityScapes/Camvid DataSet by Pytorch
-
Video-K-Net Public
[CVPR-2022 (oral)]-Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation
-
Awesome-CV-Foundational-Models Public
Forked from awaisrauf/Awesome-CV-Foundational-ModelsUpdatedJul 29, 2023 -
-
TemporalPyramidRouting Public
Temporal Pyramid Routing For Video Instance Segmentation-T-PAMI-2022
-
-
InternGPT Public
Forked from OpenGVLab/InternGPTInternGPT / InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Python Apache License 2.0 UpdatedMay 19, 2023