Skip to content
View taokong's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@tsinghua-rll

Block or report taokong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 460 21 Updated Nov 29, 2025

Official implementation of GR-MG

Python 92 8 Updated Jan 12, 2025
Python 147 11 Updated Jul 8, 2025

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Python 308 16 Updated Apr 22, 2024
Python 7 Updated Nov 3, 2023

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

45 Updated Apr 19, 2024
Python 1,843 61 Updated Jun 28, 2024

A batched offline inference oriented version of segment-anything

Python 1,324 81 Updated Aug 22, 2025

Code for RoboFlamingo

Python 429 39 Updated May 8, 2024

[CVPR 2023] Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis

Python 540 51 Updated Apr 9, 2024

paper: https://arxiv.org/abs/2307.02469 page: https://lynx-llm.github.io/

Python 270 12 Updated Aug 9, 2023

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,909 6,315 Updated Sep 18, 2024

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,800 251 Updated Dec 12, 2023

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,471 1,225 Updated Jul 30, 2024

2023 Mobile Robot Grasping and Navigation Challenge

21 1 Updated Mar 15, 2023

Code release for "Learning Video Representations from Large Language Models"

Python 534 45 Updated Oct 1, 2023

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Python 997 47 Updated Jan 17, 2024

[ICLR2024] Exploring Target Representations for Masked Autoencoders

Python 56 8 Updated Jan 17, 2024

hand-eye calibration, tool-flange calibration

Python 52 18 Updated Oct 30, 2019

ROS package for calibrating sensors to a known reference frame.

Python 53 28 Updated Oct 25, 2020

Official implementation of Adabins: Depth Estimation using adaptive bins

Python 784 160 Updated May 29, 2022

Efficient 3D Backbone Network for Temporal Modeling

Python 109 6 Updated Apr 20, 2021

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Python 3,313 509 Updated Jul 29, 2024

X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)

Python 505 52 Updated Nov 25, 2022
Python 14 6 Updated Sep 29, 2021

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Python 1,994 225 Updated Mar 21, 2024

Referring Expression Datasets API

Jupyter Notebook 562 85 Updated Aug 27, 2024

The toolbox for the Google Refexp dataset proposed in this paper: http://arxiv.org/abs/1511.02283

Jupyter Notebook 166 43 Updated Mar 1, 2017

VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models

Python 342 54 Updated May 16, 2023
Next