Skip to content
View lidingm's full-sized avatar
🎯
Day Day Up
🎯
Day Day Up

Highlights

  • Pro

Block or report lidingm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.

Python 12,441 939 Updated Mar 27, 2026

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python 273 14 Updated Mar 25, 2026

Official code for PEARL: Personalized Streaming Video Understanding Model

Python 41 3 Updated Mar 24, 2026

This repo accompanies the research paper, ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data and contains the data, scripts to visualize and proces…

Python 890 80 Updated Sep 21, 2024

Advancing AI by embracing human-likeness for better AI understanding, human–AI collaboration, and social simulation, bridging technology and genuine human experience.

Python 82 9 Updated Mar 27, 2026
Python 17 Updated Mar 19, 2026

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

Python 27 1 Updated Mar 17, 2026

[CVPR'26] SimRecon: SimReady Compositional Scene Reconstruction from Real Videos

Python 71 4 Updated Mar 19, 2026

A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular configuration across SFT, RLVR, and evaluation workflows.

Python 514 37 Updated Mar 27, 2026

Foundations of Medical Large Language Model Learning

441 70 Updated Mar 5, 2026

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Python 8 Updated Mar 11, 2026

CoCo: CoCo as CoT for Text-to-Image Preview and Rare Concept Generation

49 2 Updated Mar 10, 2026

[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s Multimodal Intelligence team.

Python 652 23 Updated Feb 27, 2026
Python 51 Updated Feb 25, 2026

VIGA: Vision-as-Inverse-Graphics Agent

Python 910 84 Updated Feb 25, 2026
Python 23 Updated Feb 3, 2026

Fast, Sharp & Reliable Agentic Intelligence

C++ 1,933 76 Updated Mar 22, 2026

[NeurIPS 2025] Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning

Python 289 12 Updated Jul 15, 2025

CoV: Chain-of-View Prompting for Spatial Reasoning

Python 52 1 Updated Jan 23, 2026

Holistic Evaluation of Multimodal LLMs on Spatial Intelligence

Dockerfile 91 7 Updated Mar 26, 2026

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 2,925 373 Updated Mar 28, 2026

[CVPR 2026] InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Python 852 26 Updated Mar 25, 2026

Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its size.

402 29 Updated Jan 21, 2026

Stable Virtual Camera: Generative View Synthesis with Diffusion Models

Python 1,581 115 Updated Mar 3, 2026

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Python 335 14 Updated Feb 5, 2026

Code for "In-Context Former: Lightning-fast Compressing Context for Large Language Model" (Findings of EMNLP 2024)

Python 21 2 Updated Nov 21, 2024

STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.

Python 2,103 178 Updated Mar 14, 2026

Depth Anything 3

Python 4,823 498 Updated Mar 21, 2026
Python 2 Updated Jan 29, 2026
Next