Skip to content
View Evanwu1125's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • 香港科技大学(广州)
  • 00:28 (UTC +08:00)

Block or report Evanwu1125

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Native Multimodal Models are World Learners

Python 1,129 39 Updated Nov 5, 2025

Next paradigm for LLM Agent. Unify plan and action through recursive code generation for adaptive, human-like decision-making.

Python 354 38 Updated Nov 1, 2025

Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"

Python 218 10 Updated Oct 29, 2025

VisJudgeBench: A comprehensive benchmark for aesthetics and quality assessment of visualizations, featuring 3,090 expert-annotated samples with six-dimensional quality scores.

55 1 Updated Nov 4, 2025

C++17 Low latency log lib

C++ 2 Updated Oct 26, 2025

交易模块

Python 7,434 1,691 Updated Sep 10, 2025

UI-Venus is a native UI agent designed to perform precise GUI element grounding and effective navigation using only screenshots as input.

Python 495 34 Updated Aug 25, 2025

神级Cursor Rule

1,753 269 Updated Oct 5, 2025

DataMosaic: Explainable and Verifiable Document-Based Data Analytics

TypeScript 20 Updated Jun 30, 2025

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

Python 1,756 187 Updated Nov 4, 2025
Python 8 Updated Sep 15, 2025

🤗 smolagents: a barebones library for agents that think in code.

Python 23,784 2,097 Updated Oct 30, 2025

✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Python 267 22 Updated May 9, 2025
Python 15 Updated Jun 10, 2025

🔥[NeurIPS'25] DeepFund: Pilot for Your Next Fund Investment

Python 213 35 Updated Oct 30, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,130 2,427 Updated Nov 5, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,243 100 Updated Oct 29, 2025

Witness the aha moment of VLM with less than $3.

Python 3,975 290 Updated May 19, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,318 807 Updated Oct 31, 2025

一个由大语言模型驱动的AI版骗子酒馆对战框架

Python 633 118 Updated Mar 7, 2025

ArxivFlow - Periodic Track on arXiv Paper

JavaScript 49 1 Updated Sep 10, 2025

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 14,703 1,614 Updated Nov 5, 2025

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,154 5,215 Updated Jun 27, 2024

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Python 1,503 227 Updated Apr 3, 2024

A fork to add multimodal model training to open-r1

Python 1,416 70 Updated Feb 8, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,673 366 Updated Oct 21, 2025

OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next-generation models that surpass DeepSeek.

Python 238 39 Updated Sep 12, 2025

Famous Vision Language Models and Their Architectures

Markdown 1,065 50 Updated Feb 24, 2025

Doge Family of Small Language Models

Python 180 12 Updated Aug 13, 2025
Next