Skip to content
View lidingm's full-sized avatar
🎯
Day Day Up
🎯
Day Day Up

Highlights

  • Pro

Block or report lidingm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"

Python 162 5 Updated Apr 7, 2026

The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.

Rust 179,526 106,484 Updated Apr 9, 2026

Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.

Python 16,646 1,441 Updated Apr 2, 2026

Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.

Python 360 22 Updated Apr 9, 2026

Official code for PEARL: Personalized Streaming Video Understanding Model

Python 49 3 Updated Mar 24, 2026

This repo accompanies the research paper, ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data and contains the data, scripts to visualize and proces…

Python 893 80 Updated Sep 21, 2024

Advancing AI by embracing human-likeness for better AI understanding, human–AI collaboration, and social simulation, bridging technology and genuine human experience.

Python 88 9 Updated Apr 9, 2026

[ACL 2026] CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution

Python 21 Updated Apr 6, 2026

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

Python 28 2 Updated Mar 17, 2026

[CVPR'26 Highlight] SimRecon: SimReady Compositional Scene Reconstruction from Real Videos

Python 75 4 Updated Apr 9, 2026

A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular configuration across SFT, RLVR, and evaluation workflows.

Python 540 37 Updated Apr 7, 2026

Foundations of Medical Large Language Model Learning

562 92 Updated Mar 5, 2026

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Python 8 Updated Apr 7, 2026

CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

Python 50 2 Updated Apr 9, 2026

[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s Multimodal Intelligence team.

Python 655 23 Updated Feb 27, 2026
Python 51 Updated Feb 25, 2026

VIGA: Vision-as-Inverse-Graphics Agent

Python 917 83 Updated Feb 25, 2026
Python 23 Updated Feb 3, 2026

Fast, Sharp & Reliable Agentic Intelligence

C++ 1,994 79 Updated Apr 3, 2026

[NeurIPS 2025] Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning

Python 291 12 Updated Jul 15, 2025

[ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning

Python 52 1 Updated Apr 7, 2026

Holistic Evaluation of Multimodal LLMs on Spatial Intelligence

Python 98 7 Updated Apr 3, 2026

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 3,037 392 Updated Apr 9, 2026

[CVPR 2026] InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Python 903 31 Updated Apr 3, 2026

Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its size.

402 28 Updated Jan 21, 2026

Stable Virtual Camera: Generative View Synthesis with Diffusion Models

Python 1,586 116 Updated Mar 3, 2026

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Python 337 14 Updated Feb 5, 2026

Code for "In-Context Former: Lightning-fast Compressing Context for Large Language Model" (Findings of EMNLP 2024)

Python 21 2 Updated Nov 21, 2024

STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.

Python 2,116 180 Updated Mar 14, 2026
Next