Skip to content
View Suhao07's full-sized avatar

Highlights

  • Pro

Block or report Suhao07

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of RAE-NWM: Navigation World Model in Dense Visual Representation Space.

Python 12 Updated Apr 10, 2026

你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候,对你的期望是很高的。 一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.

TypeScript 15,870 907 Updated Mar 31, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,650 1,343 Updated Apr 12, 2026
Python 12 Updated Mar 13, 2026

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,398 123 Updated Mar 24, 2026

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Python 221 13 Updated Apr 10, 2026

【ICLR 2026】 Official implementation of [OmniNav: A Unified Framework for Prospective Exploration and Visual-Language Navigation]

Python 120 3 Updated Feb 12, 2026

An OpenStreetMap MCP server implementation that enhances LLM capabilities with location-based services and geospatial data.

Python 181 41 Updated Jul 12, 2025
Jupyter Notebook 5 2 Updated Jun 28, 2025

Official code base for LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

Python 2,190 246 Updated Mar 27, 2026

Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?

Python 467 36 Updated Apr 3, 2026

[AAAI 2026] Official implementation of paper "UrbanNav: Learning Language-Guided Embodied Urban Navigation from Web-Scale Human Trajectories"

Python 60 4 Updated Mar 27, 2026

[ICLR 2026] FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction

Python 265 12 Updated Feb 25, 2026

[CVPR 2026] Official implementation of FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-and-Language Navigation

Jupyter Notebook 24 Updated Feb 23, 2026

[CVPR 2026] Official code repository for : "DecoVLN: Decoupling Observation, Reasoning, and Correction for Vision-and-Language Navigation"

19 Updated Mar 19, 2026

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 49,773 9,009 Updated Apr 4, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,789 2,579 Updated Mar 5, 2026

A Pragmatic VLA Foundation Model

Python 1,037 89 Updated Mar 12, 2026

This is the official repository for VLN-CLASH.

Python 24 2 Updated Aug 5, 2025

Official website for TIC-VLA

40 Updated Feb 3, 2026

[CVPR 2026] LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding

Python 37 2 Updated Feb 28, 2026

Sparse Video Generation Model for Embodied Navigation conditioned on loose language guidance, 100% real world verification

Python 70 1 Updated Mar 31, 2026

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

Python 1,644 128 Updated Mar 18, 2026

[ICLR 2026] From Seeing to Experiencing: Scaling Navigation Foundation Models with Reinforcement Learning

58 1 Updated Apr 9, 2026

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,513 1,590 Updated Sep 5, 2024

GitNexus: The Zero-Server Code Intelligence Engine - GitNexus is a client-side knowledge graph creator that runs entirely in your browser. Drop in a GitHub repo or ZIP file, and get an interactive …

TypeScript 26,767 3,025 Updated Apr 12, 2026

Open-source Agent Operating System

Rust 16,576 2,073 Updated Apr 10, 2026

Elevate your AI research writing, no more tedious polishing ✨

17,089 1,375 Updated Mar 25, 2026

This is a repository for listing papers on scene graph generation and application.

630 42 Updated Apr 9, 2026

[ICLR2026] Official implementation for "JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation"

Python 533 39 Updated Jan 26, 2026
Next