Skip to content
View ylwhxht's full-sized avatar

Block or report ylwhxht

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2026] CompassNav:Steering From Path Imitation To Decision Understanding In Navigation

Python 9 1 Updated Apr 9, 2026

The agent that grows with you

Python 100,077 14,200 Updated Apr 19, 2026
Python 52 3 Updated Feb 12, 2026
Python 137 21 Updated Jul 9, 2024

你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候,对你的期望是很高的。 一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.

TypeScript 16,383 943 Updated Apr 18, 2026

每天早上打开通知,高质量论文摘要已经为你准备好 Ciallo~(∠・ω< )⌒☆

Python 284 38 Updated Apr 17, 2026
Python 28 1 Updated Sep 25, 2025

Qwen3.6 is the large language model series developed by Qwen team, Alibaba Group.

2,848 164 Updated Apr 16, 2026

[CVPR 2026] This repository is the official implementation of MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation

Python 111 1 Updated Mar 24, 2026

一个基于nano banana pro🍌的原生AI PPT生成应用,迈向真正的"Vibe PPT"; 支持上传任意模板图片;上传任意素材&智能解析;一句话/大纲/页面描述自动生成PPT;口头修改指定区域、一键导出可编辑ppt - An AI-native slides generator based on nano banana pro🍌

TypeScript 13,980 1,634 Updated Apr 19, 2026

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 1 Updated Apr 18, 2026

Imitation Learning; Robotics; Policy; VLA;

Python 35 5 Updated Apr 9, 2026

CVPR 2026 - MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation

Python 44 1 Updated Mar 23, 2026

[ICLR 2026] FastVGGT: Fast Visual Geometry Transformer

Python 737 43 Updated Jan 28, 2026

[ACM MM2025]: Unleashing the Power of Data Generation in One-Pass Outdoor LiDAR Localization

Python 19 Updated Oct 29, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,624 490 Updated Aug 7, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,529 1,589 Updated Sep 5, 2024

This checklist is designed to help you systematically prepare and polish academic papers for top conferences and journals (e.g., ICML, NeurIPS, CVPR). It incorporates widely recommended best practi…

213 14 Updated Jun 30, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,885 1,424 Updated Mar 3, 2026

This is a repository for listing papers on scene graph generation and application.

637 44 Updated Apr 9, 2026

Rank 1st in the leaderboard of SemanticKITTI semantic segmentation (both single-scan and multi-scan) (Nov. 2020) (CVPR2021 Oral)

Python 947 182 Updated Apr 5, 2023
2 Updated Mar 6, 2025

⚙️ Create and run workflows (RPA 2.0)

Python 3,949 313 Updated Apr 17, 2026

[CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"

Python 239 15 Updated Oct 2, 2025
Python 10 Updated Apr 22, 2025

This project was developed for providing a multi-agent 3D visualization tools for collaborative perception of 3D object detection tasks

Python 15 2 Updated Apr 19, 2025

在没有sudo权限的情况下,在linux上使用clash

Shell 199 21 Updated Nov 14, 2024

(2025 AAAI) CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework

Python 14 Updated Jul 31, 2025

Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method (CVPR-25)

Jupyter Notebook 233 15 Updated Aug 20, 2025

Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels. (CVPR2025)

Python 40 3 Updated Dec 11, 2025
Next