xinykou

Follow

Xin Yi xinykou

Follow

8 followers · 17 following

ECNU
Shanghai, China

Lists (9)

Sort

Better Dataset

dataset

Knowledge

new

find a direction

parameter_efficient

remote sensing

Social Good

toxic

unlearning

forget privacy message

Stars

kweaver-ai / adp

ADP is an intelligent data platform that bridges the gap between heterogeneous data sources and AI agents. It abstracts data complexity through business knowledge networks, provides unified data ac…

Go 36 21 Updated Apr 23, 2026

ttguy0707 / CyberClaw

👾 下一代透明智能体架构 | Next-Gen Transparent Agent Architecture 🔍 全行为审计 | 🛡️ 两段式安全调用 | 🧠 双水位记忆 | ⏰ 心跳任务 📊 P0 级事故率降低 80% | 兼容 OpenClaw + Claude Code 技能生态

Python 165 17 Updated Apr 22, 2026

VisionXLab / Awesome-RS-VL-Data

Awesome Remote Sensing Vision-Language Datasets

69 2 Updated Apr 22, 2026

thunlp / H-Neurons

The official implementation of the paper: H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs

Python 60 11 Updated Jan 14, 2026

ws-jiang / MetaDefense

Official Implementation of MetaDefense (NeurIPS 2025)

Python 6 2 Updated Oct 10, 2025

xinykou / CDG-KD

Python 2 Updated Aug 25, 2025

TheShineyue / HSR

[ACL 2025 Findings] Hierarchical Safety Realignment: Lightweight Restoration of Safety in Pruned Large Vision-Language Models

Python 5 1 Updated May 25, 2025

redwyd / SymMark

Accepted to ACL'25 (main)

Python 7 1 Updated May 16, 2025

xinykou / FGDILP

Fine-Grained Detoxification via Instance-Level Prefixes for Large Language Models (accepted by Nurocomputing)

Python 1 Updated Oct 7, 2024

THU-BPM / Watermark-Radioactivity-Attack

[ACL 2025 Main] Code and data for paper "Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?"

Python 22 2 Updated Jun 18, 2025

Trustworthy-AI-Group / Adversarial_Examples_Papers

A list of recent papers about adversarial learning

349 21 Updated Apr 17, 2026

ydyjya / Awesome-LLM-Safety

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

HTML 1,835 104 Updated Apr 18, 2026

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,676 8,634 Updated Apr 27, 2026

git-disl / Vaccine

This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)

Shell 49 5 Updated Jan 15, 2026

tanganke / subspace_fusion

Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"

Python 14 Updated Mar 28, 2024

mlabonne / llm-datasets

Curated list of datasets and tools for post-training.

4,473 367 Updated Apr 27, 2026

Lucidreamer9 / SHED-Shapley-Based-Automated-Dataset-Refinement

Python 9 2 Updated Jul 21, 2025

declare-lab / resta

Restore safety in fine-tuned language models through task arithmetic

Python 32 2 Updated Mar 28, 2024

zhoucz97 / myLearning

记录个人的学习历程。包括但不限于算法、机器学习、论文写作等。

116 12 Updated Feb 24, 2025

Cohere-Labs-Community / goodtriever

Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"

Jupyter Notebook 25 3 Updated May 30, 2024

margaritageleta / multilingual-toxicity-detector

NLP deep learning model for multilingual toxicity detection in text 📚

Jupyter Notebook 13 1 Updated Aug 10, 2020

Yangyi-Chen / Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

758 42 Updated Apr 6, 2026

CharlesYu2000 / PCGU-UnlearningBias

Python 17 2 Updated Nov 7, 2023

Lingzhi-WANG / KGAUnlearn

Python 19 Updated Sep 10, 2023

luka-group / PaCo

This is a code repository for PaCo (Preconditions Attributed to Commonsense Knowledge) @EMNLP-Findings 2022

2 Updated Aug 30, 2023

goldengua / Counterfactual_Inference_LM

Python 11 Updated May 18, 2023

vickywu1022 / OntoProbe-PLMs

Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"

Python 33 2 Updated Oct 16, 2023

sail-sg / lorahub

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Python 669 42 Updated Jul 22, 2024

ellaneeman / disent_qa

This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.

Python 16 2 Updated Mar 20, 2023

yasumasaonoe / entity_knowledge_propagation

Python 17 2 Updated Aug 2, 2023