Suhao07

rickyman Suhao07

ZJUer since 2021, Learning RL /DL/Graphics/ Robotics/VLM ^_^

25 followers · 237 following

Zhejiang University
https://suhao07.github.io/

Highlights

Lists (21)

Sort

Stars

20robo / raenwm

Official implementation of RAE-NWM: Navigation World Model in Dense Visual Representation Space.

Python 12 Updated Apr 10, 2026

tanweai / pua

你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候，对你的期望是很高的。一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.

TypeScript 15,870 907 Updated Mar 31, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,650 1,343 Updated Apr 12, 2026

robotnav-bot / NOW

Python 12 Updated Mar 13, 2026

Tencent-Hunyuan / HY-WorldPlay

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,398 123 Updated Mar 24, 2026

H-EmbodVis / HyDRA

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Python 221 13 Updated Apr 10, 2026

amap-cvlab / OmniNav

【ICLR 2026】 Official implementation of [OmniNav: A Unified Framework for Prospective Exploration and Visual-Language Navigation]

Python 120 3 Updated Feb 12, 2026

jagan-shanmugam / open-streetmap-mcp

An OpenStreetMap MCP server implementation that enhances LLM capabilities with location-based services and geospatial data.

Python 181 41 Updated Jul 12, 2025

anonymous-cityseeker / CitySeeker

Jupyter Notebook 5 2 Updated Jun 28, 2025

lucas-maes / le-wm

Official code base for LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

Python 2,190 246 Updated Mar 27, 2026

yuantianyuan01 / FastWAM

Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?

Python 467 36 Updated Apr 3, 2026

CASIA-IVA-Lab / UrbanNav

[AAAI 2026] Official implementation of paper "UrbanNav: Learning Language-Guided Embodied Urban Navigation from Web-Scale Human Trajectories"

Python 60 4 Updated Mar 27, 2026

Fantasy-AMAP / fantasy-world

[ICLR 2026] FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction

Python 265 12 Updated Feb 25, 2026

Fantasy-AMAP / fantasy-vln

[CVPR 2026] Official implementation of FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-and-Language Navigation

Jupyter Notebook 24 Updated Feb 23, 2026

Allenxinn / DecoVLN

[CVPR 2026] Official code repository for : "DecoVLN: Decoupling Observation, Reasoning, and Correction for Vision-and-Language Navigation"

19 Updated Mar 19, 2026

TauricResearch / TradingAgents

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 49,773 9,009 Updated Apr 4, 2026

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,789 2,579 Updated Mar 5, 2026

Robbyant / lingbot-vla

A Pragmatic VLA Foundation Model

Python 1,037 89 Updated Mar 12, 2026

CrystalSixone / VLN_CLASH

This is the official repository for VLN-CLASH.

Python 24 2 Updated Aug 5, 2025

ucla-mobility / TIC-VLA

Official website for TIC-VLA

40 Updated Feb 3, 2026

qiujihao19 / LongVideo-R1

[CVPR 2026] LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding

Python 37 2 Updated Feb 28, 2026

OpenDriveLab / SparseVideoNav

Sparse Video Generation Model for Embodied Navigation conditioned on loose language guidance, 100% real world verification

Python 70 1 Updated Mar 31, 2026

dreamzero0 / dreamzero

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

Python 1,644 128 Updated Mar 18, 2026

VAIL-UCLA / S2E

[ICLR 2026] From Seeing to Experiencing: Scaling Navigation Foundation Models with Reinforcement Learning

58 1 Updated Apr 9, 2026

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,513 1,590 Updated Sep 5, 2024

abhigyanpatwari / GitNexus

GitNexus: The Zero-Server Code Intelligence Engine - GitNexus is a client-side knowledge graph creator that runs entirely in your browser. Drop in a GitHub repo or ZIP file, and get an interactive …

TypeScript 26,767 3,025 Updated Apr 12, 2026