Fairness between African Americans (AA) and white Americans (WA) in RSFC-based behavioral prediction using the HCP dataset

Reference

Jingwei Li, Danilo Bzdok, Jianzhong Chen, Angela Tam, Leon Qi Rong Ooi, Avram J. Holmes, Tian Ge, Kaustubh R. Patil, Mbemba Jabbi, Simon B. Eickhoff, B.T. Thomas Yeo*, Sarah Genon*, (2022), Cross-ethnicity/race generalization failure of behavioral prediction from resting-state functional connectivity, Science Advances, 8(11):eabj1812.

Background

Algorithmic biases that favor majority populations pose a key challenge to the application of machine learning for precision medicine. Here, we assessed such bias in prediction models of behavioral phenotypes from brain functional magnetic resonance imaging. We examined the prediction bias using two independent datasets (pre-adolescent versus adult) of mixed ethnic/racial composition. When predictive models were trained on data dominated by white Americans (WA), out-of-sample prediction errors were generally higher in African Americans (AA) than for WA. This bias towards WA corresponds to more WA-like brain-behavioral association patterns learned by models. When models were trained on AA only, compared to training only on WA or an equal number of AA and WA participants, AA prediction accuracy improved but stayed below that for WA. Overall, the results point to the need for caution and further research regarding the application of current brain-behavior prediction models in minority population.

Usage

First, this repository relies on multiple utility functions in the Computational Brain Imaging Group repository (CBIG; https://github.com/ThomasYeoLab/CBIG), e.g. kernel ridge regression package. Please follow the configuration instructions of CBIG repository before you use the current repository. Also, make sure you have the HCP csv files prepared on your devices.

After that, this repository should be used as the following steps:

Run HCP_addpath when everytime you open a new matlab session, to add all subfolders of the current repository into your matlab paths.
Follow the README in match_split folder to find matched AA and WA pairs, and split all subjects into multiple folds to facilitate the nested cross-validation procedure for later steps.
Follow the README in KRR folder to run kernel ridge regression in a nested cross-validation manner as split in step 1.
Follow the README in AAvsWA folder to calculate the out-of-sample prediction accuracy of matched AA and WA, and test the significancy of the accuracy difference.
Follow the README in plot folder to plot the accuracy differencies.

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
AAvsWA		AAvsWA
FC_no_censor		FC_no_censor
FC_subsample		FC_subsample
Interpretation		Interpretation
KRR		KRR
LRR		LRR
command_history		command_history
elasticNet		elasticNet
external_packages		external_packages
match_split		match_split
others		others
plot		plot
HCP_addpath.m		HCP_addpath.m
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fairness between African Americans (AA) and white Americans (WA) in RSFC-based behavioral prediction using the HCP dataset

Reference

Background

Usage

About

Uh oh!

Releases

Packages

Languages

License

jingwei-li/Unfairness_HCP_process

Folders and files

Latest commit

History

Repository files navigation

Fairness between African Americans (AA) and white Americans (WA) in RSFC-based behavioral prediction using the HCP dataset

Reference

Background

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages