taokong

Follow

🎯

Focusing

Tao Kong taokong

🎯

Focusing

Follow

XXX

381 followers · 14 following

Haidian Beijing
http://www.taokong.org

Organizations

Lists (1)

Sort

🚀 My stack

Stars

Robot-VLAs / RoboVLMs

Python 462 21 Updated Apr 14, 2026

bytedance / GR-MG

Official implementation of GR-MG

Python 91 8 Updated Jan 12, 2025

bytedance / IRASim

Python 149 11 Updated Jul 8, 2025

bytedance / GR-1

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Python 309 16 Updated Apr 22, 2024

ZhangHanbo / InViG-Dataset-API

Python 7 Updated Nov 3, 2023

GR1-Manipulation / GR-1

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

45 Updated Apr 19, 2024

ytongbai / LVM

Python 1,842 61 Updated Jun 28, 2024

meta-pytorch / segment-anything-fast

A batched offline inference oriented version of segment-anything

Python 1,323 81 Updated Aug 22, 2025

RoboFlamingo / RoboFlamingo

Code for RoboFlamingo

Python 429 39 Updated May 8, 2024

ZhangHanbo / invig-dataset

3 1 Updated Oct 30, 2023

ZrrSkywalker / Point-NN

[CVPR 2023] Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis

Python 540 51 Updated Apr 9, 2024

bytedance / lynx-llm

paper: https://arxiv.org/abs/2307.02469 page: https://lynx-llm.github.io/

Python 270 12 Updated Aug 9, 2023

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,960 6,321 Updated Sep 18, 2024

PhoebusSi / Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,798 251 Updated Dec 12, 2023

CompVis / taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,473 1,225 Updated Jul 30, 2024

robotgnchallenge / challenge2023

2023 Mobile Robot Grasping and Navigation Challenge

21 1 Updated Mar 15, 2023

facebookresearch / LaViLa

Code release for "Learning Video Representations from Large Language Models"

Python 533 46 Updated Oct 1, 2023

google-research / magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Python 997 47 Updated Jan 17, 2024

liuxingbin / dbot

[ICLR2024] Exploring Target Representations for Masked Autoencoders

Python 56 8 Updated Jan 17, 2024

eayvali / Pose-Estimation-for-Sensor-Calibration

hand-eye calibration, tool-flange calibration

Python 52 18 Updated Oct 30, 2019

crigroup / handeye

ROS package for calibrating sensors to a known reference frame.

Python 53 28 Updated Oct 25, 2020

shariqfarooq123 / AdaBins

Official implementation of Adabins: Depth Estimation using adaptive bins

Python 784 160 Updated May 29, 2022

youngwanLEE / VoV3D

Efficient 3D Backbone Network for Temporal Modeling

Python 109 6 Updated Apr 20, 2021

facebookresearch / Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Python 3,321 509 Updated Jul 29, 2024

zengyan-97 / X-VLM

X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)

Python 505 52 Updated Nov 25, 2022

IrisLi17 / bridge_construction

Python 14 6 Updated Sep 29, 2021

facebookresearch / Detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Python 1,994 226 Updated Mar 21, 2024

lichengunc / refer

Referring Expression Datasets API

Jupyter Notebook 566 85 Updated Aug 27, 2024

mjhucla / Google_Refexp_toolbox

The toolbox for the Google Refexp dataset proposed in this paper: http://arxiv.org/abs/1511.02283

Jupyter Notebook 166 43 Updated Mar 1, 2017

Vision-CAIR / VisualGPT

VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models

Python 342 54 Updated May 16, 2023