Skip to content
View lidingm's full-sized avatar
🎯
Day Day Up
🎯
Day Day Up

Highlights

  • Pro

Block or report lidingm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[Notice] The repo temporarily locked while ownership transfer. in the meantime we maintain on here: https://github.com/ultraworkers/claw-code-parity. The fastest repo in history to surpass 100K sta…

Rust 151,069 101,330 Updated Apr 2, 2026

Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.

Python 14,621 1,206 Updated Apr 2, 2026

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python 301 19 Updated Apr 2, 2026

Official code for PEARL: Personalized Streaming Video Understanding Model

Python 43 3 Updated Mar 24, 2026

This repo accompanies the research paper, ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data and contains the data, scripts to visualize and proces…

Python 893 80 Updated Sep 21, 2024

Advancing AI by embracing human-likeness for better AI understanding, human–AI collaboration, and social simulation, bridging technology and genuine human experience.

Python 85 9 Updated Mar 31, 2026
Python 18 Updated Mar 31, 2026

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

Python 28 1 Updated Mar 17, 2026

[CVPR'26] SimRecon: SimReady Compositional Scene Reconstruction from Real Videos

Python 72 4 Updated Mar 19, 2026

A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular configuration across SFT, RLVR, and evaluation workflows.

Python 524 37 Updated Mar 31, 2026

Foundations of Medical Large Language Model Learning

492 77 Updated Mar 5, 2026

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Python 8 Updated Mar 11, 2026

CoCo: CoCo as CoT for Text-to-Image Preview and Rare Concept Generation

50 2 Updated Mar 10, 2026

[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s Multimodal Intelligence team.

Python 653 23 Updated Feb 27, 2026
Python 51 Updated Feb 25, 2026

VIGA: Vision-as-Inverse-Graphics Agent

Python 913 83 Updated Feb 25, 2026
Python 23 Updated Feb 3, 2026

Fast, Sharp & Reliable Agentic Intelligence

C++ 1,970 77 Updated Mar 22, 2026

[NeurIPS 2025] Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning

Python 290 12 Updated Jul 15, 2025

CoV: Chain-of-View Prompting for Spatial Reasoning

Python 52 1 Updated Jan 23, 2026

Holistic Evaluation of Multimodal LLMs on Spatial Intelligence

Dockerfile 93 7 Updated Apr 2, 2026

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 2,974 378 Updated Apr 2, 2026

[CVPR 2026] InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Python 878 29 Updated Apr 2, 2026

Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its size.

402 29 Updated Jan 21, 2026

Stable Virtual Camera: Generative View Synthesis with Diffusion Models

Python 1,582 116 Updated Mar 3, 2026

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Python 336 14 Updated Feb 5, 2026

Code for "In-Context Former: Lightning-fast Compressing Context for Large Language Model" (Findings of EMNLP 2024)

Python 21 2 Updated Nov 21, 2024

STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.

Python 2,108 178 Updated Mar 14, 2026

Depth Anything 3

Python 4,878 504 Updated Mar 21, 2026
Next