Skip to content
View thuqinyj16's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report thuqinyj16

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source unified multimodal model

Python 5,783 512 Updated Oct 27, 2025

Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).

Python 13,339 823 Updated Mar 31, 2026

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,566 65 Updated Jun 14, 2025

[ICLR 2026] Computer Agent Arena: Toward Human-Centric Evaluation and Analysis of Computer-Use Agents

HTML 59 4 Updated Feb 26, 2026

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,782 168 Updated Mar 31, 2026

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 29,179 2,860 Updated Mar 27, 2026

A series of technical report on Slow Thinking with LLM

Python 764 41 Updated Aug 13, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,347 3,540 Updated Mar 31, 2026

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,978 287 Updated May 15, 2025

Pioneering Automated GUI Interaction with Native Agents

Python 10,016 732 Updated Jan 27, 2026

OS-ATLAS: A Foundation Action Model For Generalist GUI Agents

Python 442 28 Updated Apr 20, 2025
Python 1,346 53 Updated Nov 21, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 21,258 2,266 Updated Mar 11, 2025

🚀 Efficient implementations for emerging model architectures

Python 4,773 472 Updated Mar 31, 2026

Utilities intended for use with Llama models.

Python 7,539 1,352 Updated Feb 11, 2026

Agentic components of the Llama Stack APIs

4,300 641 Updated Aug 5, 2025

Universal memory layer for AI Agents

Python 51,562 5,771 Updated Mar 31, 2026
Jupyter Notebook 1,735 110 Updated Nov 5, 2025

🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.

Python 264 7 Updated Feb 20, 2025

Grok open release

Python 51,519 8,467 Updated Aug 30, 2024

A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.

Python 222 22 Updated Apr 15, 2025

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,086 585 Updated Apr 24, 2024

ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K question-answer pairs collected by human annotators for ~35K…

Python 142 9 Updated Feb 7, 2025

UFO³: Weaving the Digital Agent Galaxy

Python 8,306 1,007 Updated Mar 25, 2026

Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"

Python 62 7 Updated Feb 20, 2024

[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.

Python 409 36 Updated Dec 23, 2024

A keyboard shortcut browser extension for keyboard-based navigation and tab operations with an advanced omnibar

TypeScript 4,334 299 Updated Feb 3, 2026

The agent engineering platform

Python 131,754 21,715 Updated Mar 31, 2026

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,907 305 Updated Jan 16, 2024

The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".

Python 86 12 Updated Jul 13, 2024
Next