-
Nanyang Technological University
- Singapore
- https://www.zhihu.com/people/warrior-18-53
Stars
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
Fogsight is an AI agent and animation engine powered by Large Language Models.
π€ smolagents: a barebones library for agents that think in code.
π Make websites accessible for AI agents. Automate tasks online with ease.
Playwright Model Context Protocol Server - Tool to automate Browsers and APIs in Claude Desktop, Cline, Cursor IDE and More π
TPAMI 2026 | This repository collects awesome survey, resource, and paper for lifelong learning LLM agents
OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's AlphaEvolve.
LIFEBENCH: Evaluating Length Instruction Following in Large Language Models
Open-source implementation of AlphaEvolve
π The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
An awesome repository & A comprehensive survey on interpretability of LLM attention heads.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 π and reasoning techniques.
The code for AED which's a method to help LLM defend jailbreaks
[ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications
S-Eval: Towards Automated and Comprehensive Safety Evaluation for Large Language Models
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
Using sparse coding to find distributed representations used by neural networks.
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]
Repository for "StrongREJECT for Empty Jailbreaks" paper
Train transformer language models with reinforcement learning.
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Papers and resources related to the security and privacy of LLMs π€
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models