Skip to content
View lifan-yuan's full-sized avatar
  • Urbana-Champaign, IL

Highlights

  • Pro

Block or report lifan-yuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scalable RL solution for advanced reasoning of language models

Python 1,843 110 Updated Mar 18, 2025

Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"

Python 62 1 Updated Jun 3, 2024

A large-scale, fine-grained, diverse preference dataset (and models).

Python 367 16 Updated Dec 29, 2023

Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and …

Python 135 8 Updated Jun 4, 2024

[NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations".

Python 37 4 Updated Jun 8, 2023

Source code for ACL 2023 Findings paper "From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework"

Python 8 1 Updated Jun 15, 2023

Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"

Python 11 1 Updated May 9, 2023

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,770 143 Updated Aug 4, 2024

A Parallel Completion Python Library that boosts your OpenAI-API query with task queue & multiprocessing.

Python 25 1 Updated May 15, 2023

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 73,156 7,887 Updated Mar 11, 2026

深度学习经典、新论文逐段精读

32,838 2,782 Updated Mar 22, 2025

An (unofficial) implementation of Focal Loss, as described in the RetinaNet paper, generalized to the multi-class case.

Python 240 27 Updated Jan 22, 2024

An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)

Python 203 27 Updated Apr 10, 2023

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,209 32,834 Updated Apr 11, 2026