hekj

Follow

Keji hekj

Follow

Ph.D. student at CASIA NLPR CRIPAC. Research interests involve Machine Learning, Multimodality, and Embodied AI.

31 followers · 45 following

Institute of Automation, Chinese Academy of Sciences
BEIJING, CHINA

Stars

lpercc / HA3D_simulator

Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (NeurIPS DB Track'24 Spotlight).

C++ 54 7 Updated Dec 20, 2024

WebVLN / WebVLN

Official implementation of WebVLN: Vision-and-Language Navigation on Websites

Python 35 2 Updated Jan 2, 2024

Chenkehan21 / VLN-ATT

Everyday Object Disrupts Vision-and-Language Navigation Agent via Backdoor(VLN-ATT)

Python 9 Updated Dec 18, 2024

horseee / DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python 965 52 Updated Jun 27, 2024

hekj / Landmark-RxR

A human-annotated, fine-grained dataset for Vision-and-Language Navigation

13 1 Updated Jan 20, 2022

hekj / FDA

Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)

Python 14 Updated Jan 8, 2024

geekyutao / Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Jupyter Notebook 7,607 655 Updated Feb 29, 2024

MarSaKi / ETPNav

[TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"

Python 438 36 Updated Apr 5, 2025

cshizhe / VLN-DUET

Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).

Python 265 19 Updated Jun 27, 2023

google-research-datasets / RxR

Room-across-Room (RxR) is a large-scale, multilingual dataset for Vision-and-Language Navigation (VLN) in Matterport3D environments. It contains 126k navigation instructions in English, Hindi and T…

Python 177 14 Updated Jul 26, 2023

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 36,630 5,144 Updated Apr 9, 2026

YicongHong / Discrete-Continuous-VLN

Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation

Python 149 12 Updated Oct 31, 2023

IMNearth / Curriculum-Learning-For-VLN

Code for NeurIPS 2021 paper "Curriculum Learning for Vision-and-Language Navigation"

Python 15 1 Updated Dec 13, 2022

Eurus-Holmes / Awesome-Multimodal-Research

A curated list of Multimodal Related Research.

Python 1,390 147 Updated Aug 5, 2023

extreme-assistant / CVPR2024-Paper-Code-Interpretation

cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集，极市团队整理

12,497 2,248 Updated Apr 25, 2024

cshizhe / VLN-HAMT

Official implementation of History Aware Multimodal Transformer for Vision-and-Language Navigation (NeurIPS'21).

Python 143 14 Updated Jun 14, 2023

DirtyHarryLYL / Transformer-in-Vision

Recent Transformer-based CV and related works.

1,339 142 Updated Aug 22, 2023

pliang279 / awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

6,857 898 Updated Aug 20, 2024

GT-RIPL / robo-vln

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Python 90 8 Updated Jun 27, 2024

YuankaiQi / ORIST

Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation

C++ 16 2 Updated Feb 7, 2022

airbert-vln / airbert

Codebase for the Airbert paper

Python 46 7 Updated Mar 20, 2023

ChanganVR / awesome-embodied-vision

Reading list for research topics in embodied vision

704 78 Updated Jun 13, 2025

YicongHong / Recurrent-VLN-BERT

Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation

Python 202 36 Updated Aug 13, 2022

WesleyZhang1991 / Google_Landmark_Retrieval_2021_2nd_Place_Solution

Python 233 29 Updated Oct 10, 2022

TheShadow29 / awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

1,126 105 Updated Sep 21, 2025

hekj / awesome-embodied-vision

Forked from MarSaKi/awesome-embodied-vision

Reading list for research topics in embodied vision

1 Updated Jul 16, 2021

MarSaKi / NvEM

[ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"

C++ 78 2 Updated Nov 16, 2022

TianxiangMa / MUST-GAN

Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"

Python 75 17 Updated Jul 28, 2021

Maluuba / nlg-eval

Evaluation code for various unsupervised automated metrics for Natural Language Generation.

Python 1,394 226 Updated Aug 20, 2024

yuewang-cuhk / awesome-vision-language-pretraining-papers

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

1,156 104 Updated Aug 19, 2022