Skip to content
View brightjade's full-sized avatar
☁️
☁️

Highlights

  • Pro

Block or report brightjade

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
56 results for source starred repositories
Clear filter

Demonstration and Template Projects

GDScript 8,171 2,027 Updated Feb 3, 2026

A curated list of free/libre plugins, scripts and add-ons for Godot

9,325 493 Updated Feb 5, 2026

Large Language Models for Software Engineering: A Systematic Literature Review

TeX 104 7 Updated Dec 8, 2025

[ICLR 2026] LLM/VLM gaming agents and model evaluation through games.

Python 858 93 Updated Nov 16, 2025

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 11,267 1,265 Updated Feb 2, 2026

Unix ASCII games

HTML 970 69 Updated Aug 27, 2025

Harbor is a framework for running agent evaluations and creating and using RL environments.

Python 556 316 Updated Feb 6, 2026

Must-read papers on Repository-level Code Generation & Issue Resolution 🔥

250 22 Updated Dec 22, 2025

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

1,197 101 Updated Feb 6, 2026

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Jupyter Notebook 3,550 317 Updated Dec 24, 2024

The AILuminate v1.1 benchmark suite is an AI risk assessment benchmark developed with broad involvement from leading AI companies, academia, and civil society.

69 14 Updated Jun 11, 2025

s1: Simple test-time scaling

Python 6,634 766 Updated Jun 25, 2025

A simple evaluation of generative language models and safety classifiers.

Python 85 22 Updated Dec 11, 2025

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

20,102 2,511 Updated Feb 1, 2026

A curated list of awesome open-source libraries for production LLM

508 56 Updated Dec 31, 2024

A curated list of awesome Multimodal studies.

312 23 Updated Dec 14, 2025

Set of tools to assess and improve LLM security.

Python 4,010 695 Updated Feb 5, 2026

✨✨Latest Advances on Multimodal Large Language Models

17,316 1,111 Updated Jan 27, 2026

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

HTML 1,768 91 Updated Feb 1, 2026

Official repository for "Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration, arXiv 2024.06" (https://arxiv.org/pdf/2406.16469)

Python 7 2 Updated Aug 17, 2024

Instruction Tuning with GPT-4

HTML 4,340 308 Updated Jun 11, 2023

Holistic evaluation of multimodal foundation models

Python 49 1 Updated Aug 11, 2024

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,625 512 Updated Feb 4, 2026

A resource repository for machine unlearning in large language models

533 30 Updated Jan 6, 2026

Grok open release

Python 51,464 8,495 Updated Aug 30, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,296 1,003 Updated Jul 1, 2024
Next