XuGW-Kevin

Guowei Xu XuGW-Kevin

Student in Tsinghua University

99 followers · 19 following

Achievements

Highlights

Stars

126 results for source starred repositories

Clear filter

Lyy-iiis / imeanflow

Official Implementation of iMF https://arxiv.org/abs/2512.02012

Python 159 2 Updated Jan 31, 2026

test-time-training / discover

Python 382 43 Updated Jan 29, 2026

huggingface / smolagents

🤗 smolagents: a barebones library for agents that think in code.

Python 25,259 2,280 Updated Jan 23, 2026

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 63,978 4,836 Updated Feb 4, 2026

compling-wat / vlm-lens

[EMNLP 2025 Demo] Extracting internal representations from vision-language models. Beta version.

Python 107 5 Updated Nov 13, 2025

ZHZisZZ / dllm

dLLM: Simple Diffusion Language Modeling

Python 1,704 169 Updated Jan 6, 2026

piesauce / awesome-dLLM-resources

Frequently updated list of dLLM (Diffusion Large Language Models) papers, models, and other resources

Python 22 Updated Jan 30, 2026

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 11,356 3,016 Updated Feb 3, 2026

AMAP-ML / Tree-GRPO

[ICLR 2026] Tree Search for LLM Agent Reinforcement Learning

Python 281 24 Updated Jan 26, 2026

Shalev-Lifshitz / MultiAgentVerification

Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers

Python 27 2 Updated Mar 1, 2025

aakaran / reasoning-with-sampling

Python 386 51 Updated Nov 7, 2025

zou-group / metatextgrad

metaTextGrad: Automatically optimizing language model optimizers. Published in NeurIPS 2025.

Python 8 2 Updated Nov 5, 2025

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,291 418 Updated Jan 21, 2026

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,680 1,549 Updated Apr 24, 2025

suninghuang19 / mentor

MENTOR is a highly efficient visual RL algorithm that excels in both simulation and real-world complex robotic learning tasks.

Python 27 1 Updated Jul 9, 2025

dllm-reasoning / d1

Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"

Python 402 50 Updated Jan 26, 2026

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,546 238 Updated Nov 12, 2025

yongliang-wu / DFT

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 531 21 Updated Jan 4, 2026

Unispac / shallow-vs-deep-alignment

Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep

Python 172 14 Updated Apr 23, 2025

arcprize / arc-agi-benchmarking

Testing baseline LLMs performance across various models

Python 336 60 Updated Feb 3, 2026

SWE-bench / SWE-bench

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,232 748 Updated Feb 3, 2026

haykgrigo3 / TimeCapsuleLLM

A LLM trained only on data from certain time periods to reduce modern bias

Python 1,814 62 Updated Feb 2, 2026

PKU-YuanGroup / Look-Back

This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".

Python 84 4 Updated Jul 10, 2025

microsoft / rho

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

456 14 Updated Apr 18, 2024

gaojl19 / LfVoid

[NeurIPS 2023] Official code release for the paper: "Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning?"

Python 6 Updated Sep 29, 2024

PKU-YuanGroup / UniWorld

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python 838 25 Updated Dec 23, 2025

YuyangSunshine / bioprotocolbench

Python 38 1 Updated Jun 12, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,683 3,322 Updated Feb 4, 2026

hendrycks / math

The MATH Dataset (NeurIPS 2021)

Python 1,298 112 Updated Sep 6, 2025

nickscamara / open-deep-research

An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl

TypeScript 6,165 741 Updated May 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guowei Xu XuGW-Kevin

Achievements

Achievements

Highlights

Block or report XuGW-Kevin

Stars

Lyy-iiis / imeanflow

test-time-training / discover

huggingface / smolagents

anthropics / claude-code

compling-wat / vlm-lens

ZHZisZZ / dllm

piesauce / awesome-dLLM-resources

EleutherAI / lm-evaluation-harness

AMAP-ML / Tree-GRPO

Shalev-Lifshitz / MultiAgentVerification

aakaran / reasoning-with-sampling

zou-group / metatextgrad

huggingface / lighteval

Jiayi-Pan / TinyZero

suninghuang19 / mentor

dllm-reasoning / d1

ML-GSAI / LLaDA

yongliang-wu / DFT

Unispac / shallow-vs-deep-alignment

arcprize / arc-agi-benchmarking

SWE-bench / SWE-bench

haykgrigo3 / TimeCapsuleLLM

PKU-YuanGroup / Look-Back

microsoft / rho

gaojl19 / LfVoid

PKU-YuanGroup / UniWorld

YuyangSunshine / bioprotocolbench

NVIDIA-NeMo / NeMo

hendrycks / math

nickscamara / open-deep-research