NFDS

This repository conatins the implementation of the paper "Navigating Towards Fairness with Data Selection," accepted at AAAI 2025.

Abstract

Machine learning algorithms often struggle to eliminate inherent data biases, particularly those arising from unreliable labels, which poses a significant challenge in ensuring fairness. Existing fairness techniques that address label bias typically involve modifying models and intervening in the training process, but these lack flexibility for large-scale datasets. To address this limitation, we introduce a data selection method designed to efficiently and flexibly mitigate label bias, tailored to more practical needs. Our approach utilizes a zero-shot predictor as a proxy model that simulates training on a clean holdout set. This strategy, supported by peer predictions, ensures the fairness of the proxy model and eliminates the need for an additional holdout set, which is a common requirement in previous methods. Without altering the classifier's architecture, our modality-agnostic method effectively selects appropriate training data and has proven efficient and effective in handling label bias and improving fairness across diverse datasets in experimental evaluations.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
dataset		dataset
README.md		README.md
filter_function.py		filter_function.py
readme		readme
reducible_loss.py		reducible_loss.py
run.sh		run.sh
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NFDS

Abstract

About

Uh oh!

Releases

Packages

Languages

co234/NFDS

Folders and files

Latest commit

History

Repository files navigation

NFDS

Abstract

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages