Skip to content
View brightjade's full-sized avatar
☁️
☁️

Highlights

  • Pro

Block or report brightjade

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Harbor is a framework for running agent evaluations and creating and using RL environments.

Python 213 119 Updated Dec 19, 2025

Must-read papers on Repository-level Code Generation & Issue Resolution 🔥

225 21 Updated Dec 9, 2025

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

1,121 97 Updated Dec 17, 2025

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Jupyter Notebook 3,484 317 Updated Dec 24, 2024

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,808 151 Updated Jun 17, 2025

The AILuminate v1.1 benchmark suite is an AI risk assessment benchmark developed with broad involvement from leading AI companies, academia, and civil society.

64 14 Updated Jun 11, 2025

s1: Simple test-time scaling

Python 6,614 764 Updated Jun 25, 2025

[BOOSTCAMP AI 3rd][NLP][🥉3등] 문장의 단어(Entity)에 대한 속성과 관계를 예측하는 인공지능 만들기

Jupyter Notebook 2 Updated Aug 23, 2022

A simple evaluation of generative language models and safety classifiers.

Python 80 22 Updated Dec 11, 2025

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

19,772 2,488 Updated Dec 16, 2025

A curated list of awesome open-source libraries for production LLM

506 54 Updated Dec 31, 2024

A curated list of awesome Multimodal studies.

301 23 Updated Dec 14, 2025

Set of tools to assess and improve LLM security.

Python 3,932 681 Updated Dec 19, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,018 1,094 Updated Dec 12, 2025

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

HTML 1,718 87 Updated Dec 19, 2025

Official repository for "Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration, arXiv 2024.06" (https://arxiv.org/pdf/2406.16469)

Python 7 2 Updated Aug 17, 2024

Instruction Tuning with GPT-4

HTML 4,339 306 Updated Jun 11, 2023

Holistic evaluation of multimodal foundation models

Python 47 1 Updated Aug 11, 2024

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,393 461 Updated Dec 18, 2025

A resource repository for machine unlearning in large language models

514 31 Updated Dec 17, 2025

Grok open release

Python 50,572 8,373 Updated Aug 30, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,217 985 Updated Jul 1, 2024
Python 231 31 Updated Mar 30, 2021

Awesome list of Korean Large Language Models.

474 33 Updated Oct 31, 2023

A framework for few-shot evaluation of language models.

Python 10,974 2,909 Updated Dec 18, 2025

Editing Models with Task Arithmetic

Python 522 48 Updated Jan 11, 2024

Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".

Python 107 10 Updated Jun 8, 2023
Next