Skip to content
View yzoaim's full-sized avatar
  • Beijing
  • 15:35 (UTC +08:00)

Block or report yzoaim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 13,610 1,717 Updated Jun 15, 2026

[ICML 2026] What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-Zoom

Python 22 Updated May 15, 2026

Code for paper OpenWebRL: Online Multi-Turn Reinforcement Learning for Visual Web Agents

Python 37 2 Updated Jun 6, 2026

InfoSFT is a modified supervised fine-tuning algorithm that generalizes better and forgets less.

Python 4 Updated May 19, 2026

CUA-Gym-Hub: mock web apps as reproducible RL training environments for computer-use agents

JavaScript 56 4 Updated May 26, 2026

Scalable pipeline for synthesizing verifiable RLVR training data for computer-use agents

Python 161 12 Updated May 26, 2026
Python 206 13 Updated Jun 15, 2026

Symphony turns project work into isolated, autonomous implementation runs, allowing teams to manage work instead of supervising coding agents.

Elixir 25,549 2,590 Updated Jun 9, 2026

OpenSeeker: A search agent with open-source data and models

Python 750 56 Updated May 22, 2026

[NeurIPS 2025 Spotlight] OpenCUA: Open Foundations for Computer-Use Agents

Python 788 104 Updated May 25, 2026

Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"

Python 1,096 119 Updated Mar 4, 2024

An Illusion of Progress? Assessing the Current State of Web Agents

Python 182 12 Updated May 28, 2026

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 874 95 Updated Apr 13, 2026

This is the official code base of AgentNetTool in OpenCUA. Website: https://opencua.xlang.ai/

TypeScript 48 10 Updated Sep 3, 2025

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 5,232 902 Updated Apr 1, 2026

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 2,957 484 Updated Jun 10, 2026

Implementation of paper: Scaling the Scaling Logic

Python 3 Updated Mar 1, 2026
Python 155 7 Updated May 14, 2025

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 9,022 766 Updated Mar 25, 2026

[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.

Python 510 43 Updated Jan 28, 2026

Easy and Efficient dLLM Fine-Tuning

Python 259 15 Updated Mar 2, 2026

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Python 825 57 Updated Jul 9, 2025

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)

Python 1,654 89 Updated Feb 14, 2026

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 1,047 130 Updated May 30, 2026

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,831 268 Updated Nov 12, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 17,404 1,400 Updated Mar 25, 2026

GLM-OCR: Accurate × Fast × Comprehensive

Python 7,029 641 Updated Apr 21, 2026
Python 284 12 Updated Mar 4, 2026

Multimodal OCR: Parse Anything from Documents

Python 272 21 Updated Mar 20, 2026

DMax: Aggressive Parallel Decoding for dLLMs

Python 126 7 Updated May 25, 2026
Next