yueliu1999

Follow

🎯

Focusing

yueliu1999 yueliu1999

🎯

Focusing

Follow

Yue Liu a Ph.D. student at NUS.

393 followers · 544 following

National University of Singapore
Singapore
yueliu1999.github.io

Achievements

Achievements

Lists (1)

Sort

🔮 Future ideas

Stars

zz-haooo / ReCreate

Python 101 Updated Jan 30, 2026

Buyun-Liang / SECA

[NeurIPS 2025] SECA: Semantically Equivalent and Coherent Attacks for Eliciting LLM Hallucinations

Python 67 Updated Dec 10, 2025

YuyaoGe / Awesome-Vibe-Coding

112 14 Updated Oct 29, 2025

IBM / ares

AI Robustness Evaluation System

Python 34 19 Updated Feb 4, 2026

WenkeHuang / MAPO

MAPO: MIXED ADVANTAGE POLICY OPTIMIZATION

Python 38 Updated Sep 24, 2025

eval-sys / mcpmark

MCPMark is a comprehensive, stress-testing MCP benchmark designed to evaluate model and agent capabilities in real-world MCP use.

Python 383 29 Updated Jan 27, 2026

laude-institute / terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

Python 1,474 466 Updated Jan 22, 2026

zhirui-gao / Curve-Gaussian

[ICCV 2025] Official PyTorch Implementation of "Curve-Aware Gaussian Splatting for 3D Parametric Curve Reconstruction""

Python 51 1 Updated Sep 5, 2025

zhirui-gao / PartGS

[ICCV 2025] Official PyTorch Implementation of "Learning Self-supervised Part-aware 3D Hybrid Representations of 2D Gaussians and Superquadrics"

Python 61 2 Updated Dec 22, 2025

AndrewWTY / UNIT

Python 33 1 Updated Jun 24, 2025

HKUST-KnowComp / Awesome-LLM-Scientific-Discovery

[EMNLP2025] From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery

296 36 Updated Nov 5, 2025

NiceRingNode / Awesome-Generative-Models-for-OCR

[arXiv 25] Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR

247 3 Updated Aug 28, 2025

MoonshotAI / Kimi-Dev

open-source coding LLM for software engineering tasks

Python 1,122 139 Updated Sep 30, 2025

real-absolute-AI / SynthRL

SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis

Python 68 Updated Jul 24, 2025

sail-sg / VeriFree

Reinforcing General Reasoning without Verifiers

Python 96 6 Updated Jun 24, 2025

chchenhui / mlrbench

MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research

Python 22 Updated Sep 23, 2025

chchenhui / awesome-research-agents

🤖️ A collection of papers, blogs and projects of research agents.

6 Updated Feb 2, 2026

JusperLee / AudioTrust

AudioTrust: Benchmarking the Multi-faceted Trustworthiness of Audio Large Language Models

Shell 210 22 Updated Jan 28, 2026

openags / Awesome-AI-Scientist-Papers

A collection of resources and papers on AI Scientist / Robot Scientist

124 4 Updated Sep 30, 2025

sail-sg / AnytimeReasoner

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Python 51 3 Updated Jul 15, 2025

LukeChen-go / indirect-pia-detection

The official implementation of the work "Can Indirect Prompt Injection Attacks Be Detected and Removed?"

Python 5 1 Updated Dec 25, 2025

LukeChen-go / pia-defense-by-attack

The official implementation of the work "Defense Against Prompt Injection Attack by Leveraging Attack Techniques"

Python 8 3 Updated Jul 22, 2025

yueliu1999 / GuardReasoner-VL

[NeurIPS 2025] An official source code for paper "GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning".

Python 115 9 Updated Sep 19, 2025

zhiyuanhubj / Meta-Ability-Alignment

Official code of paper "Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models"

Python 86 6 Updated May 27, 2025

MASWorks / MAS-GPT

Official implementation of MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems

Python 73 3 Updated Jun 26, 2025

zhiyuanhubj / social-reasonser

1 Updated May 3, 2025

deepseek-ai / DeepSeek-Prover-V2

1,233 94 Updated Jul 18, 2025

WangCheng0116 / Awesome-LRMs-Safety

Official repository for "Safety in Large Reasoning Models: A Survey" - Exploring safety risks, attacks, and defenses for Large Reasoning Models to enhance their security and reliability.

87 3 Updated Aug 25, 2025

sail-sg / FlowReasoner

Python 144 7 Updated May 6, 2025

yingweima2022 / SWE-Reasoner

25 Updated Aug 2, 2025