Skip to content
View twweeb's full-sized avatar

Block or report twweeb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 5 Updated May 27, 2026

AI agents running research on single-GPU nanochat training automatically

Python 87,204 12,627 Updated Mar 26, 2026
Python 5 Updated Mar 31, 2026

[Arxiv] Official repo for "Subspace Control: Turning Constrained Model Steering into Controllable Spectral Optimization"

Python 5 Updated Apr 5, 2026

This code implements the algorithm of FIPO, a value-free RL recipe for eliciting deeper reasoning from a clean base model.

Python 17 1 Updated Mar 24, 2026

Repo for vLLM Hook, an vLLM plug-in for programming internal states of models deployed on vLLM

Jupyter Notebook 85 21 Updated Jun 11, 2026

This code implements the algorithm of FIPO, a value-free RL recipe for eliciting deeper reasoning from a clean base model.

Python 126 6 Updated Apr 7, 2026
Python 15 1 Updated May 2, 2026

🚀 Efficient implementations for emerging model architectures

Python 5,224 557 Updated Jun 11, 2026

Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".

Python 406 110 Updated Jun 13, 2025

Official Implementation for NorMuon paper

Python 81 4 Updated Apr 30, 2026

A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.

Python 12,565 1,425 Updated Aug 20, 2024

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,338 173 Updated May 16, 2026

Muon is an optimizer for hidden layers in neural networks

Python 2,664 123 Updated May 24, 2026

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 884 123 Updated Aug 20, 2024

A comprehensive benchmark framework for evaluating the physical safety of Large Language Models (LLMs).

Python 2 1 Updated Oct 2, 2025

[CVPR 2025] Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents

Python 33 1 Updated Jun 3, 2025

[ICCV 2025] GUIOdyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUIOdyssey consists of 8,834 episodes from 6 mobile devices, spanning 6 types of cross-app…

Python 158 9 Updated Jan 3, 2026

[ICLR 2026] Variation in Verification: Understanding Verification Dynamics in Large Language Models

Jupyter Notebook 6 1 Updated Apr 29, 2026

Awesome GUI Agent Paper List

TypeScript 821 41 Updated Jun 17, 2026
Python 7 1 Updated Jun 22, 2025

Official Repo for LayerCraft

Python 15 1 Updated May 3, 2026

Mobile-Agent: The Powerful GUI Agent Family

Python 8,841 887 Updated May 14, 2026

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,953 91 Updated Jan 8, 2026

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 848 109 Updated Feb 3, 2025

We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for complex reasoning tasks. Building on this resource, we propose Sou…

Python 1,111 131 Updated Nov 26, 2025

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 1,516 242 Updated Nov 26, 2025

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 36,552 3,692 Updated May 18, 2026

Pioneering Automated GUI Interaction with Native Agents

Python 10,964 824 Updated Jan 27, 2026
Next