Skip to content
View XuGW-Kevin's full-sized avatar

Highlights

  • Pro

Block or report XuGW-Kevin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
126 results for source starred repositories
Clear filter

Official Implementation of iMF https://arxiv.org/abs/2512.02012

Python 159 2 Updated Jan 31, 2026

🤗 smolagents: a barebones library for agents that think in code.

Python 25,259 2,280 Updated Jan 23, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 63,978 4,836 Updated Feb 4, 2026

[EMNLP 2025 Demo] Extracting internal representations from vision-language models. Beta version.

Python 107 5 Updated Nov 13, 2025

dLLM: Simple Diffusion Language Modeling

Python 1,704 169 Updated Jan 6, 2026

Frequently updated list of dLLM (Diffusion Large Language Models) papers, models, and other resources

Python 22 Updated Jan 30, 2026

A framework for few-shot evaluation of language models.

Python 11,356 3,016 Updated Feb 3, 2026

[ICLR 2026] Tree Search for LLM Agent Reinforcement Learning

Python 281 24 Updated Jan 26, 2026

Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers

Python 27 2 Updated Mar 1, 2025

metaTextGrad: Automatically optimizing language model optimizers. Published in NeurIPS 2025.

Python 8 2 Updated Nov 5, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,291 418 Updated Jan 21, 2026

Minimal reproduction of DeepSeek R1-Zero

Python 12,680 1,549 Updated Apr 24, 2025

MENTOR is a highly efficient visual RL algorithm that excels in both simulation and real-world complex robotic learning tasks.

Python 27 1 Updated Jul 9, 2025

Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"

Python 402 50 Updated Jan 26, 2026

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,546 238 Updated Nov 12, 2025

[ICLR 2026] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.

Python 531 21 Updated Jan 4, 2026

Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep

Python 172 14 Updated Apr 23, 2025

Testing baseline LLMs performance across various models

Python 336 60 Updated Feb 3, 2026

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,232 748 Updated Feb 3, 2026

A LLM trained only on data from certain time periods to reduce modern bias

Python 1,814 62 Updated Feb 2, 2026

This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".

Python 84 4 Updated Jul 10, 2025

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

456 14 Updated Apr 18, 2024

[NeurIPS 2023] Official code release for the paper: "Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning?"

Python 6 Updated Sep 29, 2024

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python 838 25 Updated Dec 23, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,683 3,322 Updated Feb 4, 2026

The MATH Dataset (NeurIPS 2021)

Python 1,298 112 Updated Sep 6, 2025

An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl

TypeScript 6,165 741 Updated May 7, 2025
Next