Skip to content
View eric-xw's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • University of California, Santa Barbara

Highlights

  • Pro

Organizations

@eric-ai-lab

Block or report eric-xw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 2 Updated Mar 18, 2026

Official repository for paper: "OmniTrace: A Unified Framework for Generation-Time Attribution in Omni-Modal LLMs"

Python 3 1 Updated Mar 22, 2026
Jupyter Notebook 20 2 Updated Dec 15, 2025

Official Implementation of Papar CM2

Python 12 2 Updated Feb 15, 2026

[Up-To-Date] Awesome Agent Memory Paper Resource

114 6 Updated Feb 11, 2026

SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration

Python 9 2 Updated Feb 11, 2026

This is the source code for the SafePro paper

Python 2 Updated Mar 10, 2026

ICLR2026 SAFER: Risk-Constrained Sample-then-Filter in Large Language Models

Python 7 1 Updated Feb 14, 2026

Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"

Python 70 5 Updated Mar 20, 2026

[ICLR26] Official codebase for the paper "Presenting a Paper is an Art: Self-Improvement Aesthetic Agents for Academic Presentations"

Python 335 22 Updated Oct 14, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,559 239 Updated Jan 8, 2026

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 4,300 442 Updated Feb 1, 2026

[NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Python 396 49 Updated Feb 11, 2026

[NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models

Python 75 5 Updated May 31, 2025

[EMNLP 2025] Official code for the paper "SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning"

Python 14 1 Updated Jun 30, 2025

Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"

Python 181 11 Updated Jan 16, 2026

Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 323 39 Updated Jan 26, 2026

Agent S: an open agentic framework that uses computers like a human

Python 10,651 1,235 Updated Feb 21, 2026

Universal memory layer for AI Agents

Python 51,235 5,730 Updated Mar 27, 2026

[ICLR 2025] EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing

Python 24 5 Updated Apr 1, 2025

LLM101n: Let's build a Storyteller

36,621 2,002 Updated Aug 1, 2024

[ACL 2025 Findings] "Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models"

Python 14 1 Updated Feb 25, 2025

Official repo for the paper "Mojito: Motion Trajectory and Intensity Control for Video Generation""

Python 33 1 Updated Jun 11, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,785 1,698 Updated Jan 30, 2026

Large Concept Models: Language modeling in a sentence representation space

Python 2,343 207 Updated Jan 29, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,096 516 Updated Jan 6, 2026

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,583 2,152 Updated Sep 12, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 21,240 2,260 Updated Mar 11, 2025

[ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"

Python 32 2 Updated Jun 23, 2025
Next