Skip to content
View zycheiheihei's full-sized avatar

Block or report zycheiheihei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Focused Papers, Delivered Simply :)

Python 46 1 Updated Nov 21, 2025

PRISM: Robust VLM Alignment with Principled Reasoning for Integrated Safety in Multimodality

Python 4 Updated Sep 12, 2025

A toolbox for benchmarking Multimodal LLM Agents trustworthiness across truthfulness, controllability, safety and privacy dimensions through 34 interactive tasks

Python 60 4 Updated Jun 30, 2025
Python 704 14 Updated Nov 20, 2025

[ISMIR 2025] A curated list of vision-to-music generation: methods, datasets, evaluation and challenges.

113 3 Updated Aug 9, 2025

[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"

Python 181 21 Updated Apr 12, 2025
Python 33 10 Updated Mar 6, 2025

Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"

Python 86 6 Updated Feb 26, 2025

AISafetyLab: A comprehensive framework covering safety attack, defense, evaluation and paper list.

Python 220 15 Updated Aug 29, 2025
Python 25 1 Updated Nov 4, 2024

Codebase for PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs.

Python 3 Updated Sep 20, 2023

Open source AI/ML capabilities for the FiftyOne ecosystem

Python 152 13 Updated Dec 18, 2025

DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis. [ACMMM 2024] Official PyTorch implementation

Python 38 1 Updated Sep 24, 2024

[NTIRE2024] official code for "Towards Real-world Video Face Restoration: A New Benchmark"

Python 31 2 Updated Jul 29, 2024

Adversarial Distributional Training (NeurIPS 2020)

Python 63 9 Updated Mar 17, 2021

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Python 1,603 83 Updated Oct 29, 2025

A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)

Python 173 11 Updated Jun 27, 2025

[NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)

Python 107 5 Updated Aug 5, 2025

[CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompting for Multimodal Large Language Models" has been accepted …

Python 46 Updated Dec 20, 2024

[EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models

Python 212 19 Updated Feb 11, 2024

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python 3,173 615 Updated Jul 19, 2024

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Jupyter Notebook 857 108 Updated Jan 16, 2025

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,807 151 Updated Jun 17, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,321 4,781 Updated Jun 2, 2025

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,568 128 Updated Nov 24, 2025

Refine high-quality datasets and visual AI models

Python 10,156 693 Updated Dec 19, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,254 4,029 Updated Jul 17, 2024
Next