AIM-AHEAD AI/ML in Biomedical Research and Clinical Practice That Embodies Ethics and Equity (ABC-EE)

Call for Proposals

Key Dates:

Solicitation Release Date: April 26, 2024
Application Due Date: June 16, 2024 (11:59 EDT)
Application Review Date: July 26, 2024
Earliest Start Date: September 16, 2024

Informational Webinar:

View the informational webinar recording and presentation on the AIM-AHEAD AI/ML in Biomedical Research and Clinical Practice That Embodies Ethics and Equity (ABC-EE) program, which addresses the program as a whole and the application process.

Click here to view the webinar recording.

Click here to view the webinar presentation deck.

Questions:

Please refer to the Frequently Asked Questions document before creating a help-desk ticket.

AI/ML in Biomedical Research and Clinical Practice FAQs

If your question is not answered in the above FAQ document, please create a help-desk ticket using the link below.

https://helpdesk.aim-ahead.net/ticket/create/ABC

Issued by

Artificial Intelligence/Machine Learning Consortium to Advance Health Equity and Researcher Diversity (AIM-AHEAD) program

Overview

The goal of the AIM-AHEAD AI/ML in Biomedical Research and Clinical Practice That Embodies Ethics and Equity (ABC-EE) program is to support multidisciplinary teams in the domains of biomedical research and clinical applications to develop and/or use AI/ML technologies that align with and/or extend the AIM-AHEAD ethics and equity principles [1]. This program will support teams at any stage of development, including those creating and/or validating frameworks for reasoning about the creation of AI/ML tools, on application of such tools in biomedical or behavioral health research, and on their translation into clinical care in a socially responsible manner. Topics of interest for this program include but are not limited to: the co-creation of data and/or AI/ML tools with target populations; the development of privacy enhancing technologies to support the collection, sharing, and/or use of such diverse data in AI/ML; and investigations into the trustworthiness and reliability of AI/ML, including emerging technologies such as generative AI and chatbots.

Applicants are expected to propose projects that address emerging issues to reduce health disparities while improving healthcare and outcomes in populations that experience health disparities in the United States.

[1] Hendricks-Sturrup R, et al. Developing ethics and equity principles, terms, and engagement tools to advance health equity and researcher diversity in AI and machine learning: modified delphi approach. JMIR AI. 2023; 2: e52888.

Background

The National Institutes of Health’s AIM-AHEAD program has established mutually beneficial, coordinated, and trusted partnerships to promote broad participation in the development of AI/ML models and to improve the capabilities of this emerging technology, beginning with electronic health records (EHR) and extending to other diverse data to address health disparities and inequities. Led by the AIM-AHEAD Coordinating Center, the consortium of institutions and organizations maintain as its core mission to serve underserved groups impacted by health disparities.

The rapid increase in the volume of data (e.g., in EHRs, public health activities, and biomedical and social science research) about factors that contribute to health and disease presents exciting opportunities for developing data science approaches (e.g., AI/ML methods) for improving healthcare. Many challenges hinder more widespread use of AI/ML technologies, such as the cost, capability for and acceptability of widespread application, and access to appropriate infrastructure, resources, and training. Additionally, without focuses on including representative data and promoting diversity in the AI/ML field, the field runs the risk of creating and perpetuating harmful biases in its practice, algorithms, and outcomes, thus fostering continued health disparities and inequities and undermining trust. Many underrepresented and underserved communities, which are often disproportionately affected by diseases and health conditions, have the potential to contribute expertise, perspectives, data, recruitment and engagement strategies, and cutting-edge science to inform the field on the most urgent research questions. Many communities and institutions, however, often lack financial, infrastructural, and data science training capacity to apply AI/ML approaches to research questions of interest to them. This program also seeks to build trust with individuals and groups affected by AI/ML.

AIM-AHEAD is committed to leveraging the potential of AI/ML to accelerate the pace of biomedical innovation, while prioritizing and addressing health disparities and inequities. Tackling the complex drivers of health disparities and inequities requires building a culture of innovative and transdisciplinary collaboration that transcends scientific and organizational silos and the status quo. Mutually beneficial and trusted partnerships can be established to encourage broad participation in AI/ML modeling and application and to improve capabilities for data curation and this emerging technology.

Please note that consistent with NIH practice and applicable law, funded programs may not use the race, ethnicity, or sex of prospective program participants or faculty as an eligibility or selection criteria. The race, ethnicity, or sex of candidates will not be considered by NIH in the application review process or when making funding decisions.

Purpose/Objectives

A major commitment of AIM-AHEAD is ensuring that AI/ML address health disparities and inequities and increase acceptance of these tools to improve healthcare access and health outcomes for populations that experience health disparities. Achieving these outcomes requires adherence to ethics and equity principles and building capacity among investigators and communities from diverse backgrounds. The primary objectives of this solicitation are to support: (a) multidisciplinary research projects that assess and enhance the role of ethics and equity in AI/ML for health and (b) projects that develop and implement strategies to overcome ethics and equity challenges.

Through this program, AIM-AHEAD envisions research projects that will help AI/ML research collaborators to:

Better understand how decisions are made throughout the development of AI/ML, focusing on:
- How fully issues of ethics and equity are identified and addressed at all phases;
- The efficacy and timeliness of engagement with participants and communities in promoting ethics and equity and decreasing health disparities; and
- Identifying and overcoming institutional challenges in conducting AI/ML research;
Better understand how AI/ML can be used to detect and mitigate bias in AI/ML models to avoid perpetuating, automating, and/or exacerbating bias and unfairness and reduce health disparities; and
Improve healthcare and health outcomes.

Proposals to this program must include a plan for ensuring work is guided by a concern for human rights and social impacts, and attention to ethical, legal, and socio-economic implications of AI/ML, including but not limited to: (1) biases in datasets, algorithms, and applications; (2) issues related to identifiability and privacy; (3) impacts on disadvantaged or marginalized groups; (4) health disparities; and (5) unintended or adverse social, individual, and community consequences of development of implementation and research on AI/ML solutions.

Applicants are encouraged to propose projects that consider ethical issues and potential unintended adverse outcomes of bias and other issues in applying AI/ML approaches to improving health outcomes. It is expected that proposals will most likely be successful if they are engaged in and examining existing AI/ML research, whether collecting data prospectively or using retrospective data given the time frame. Applications that address freestanding normative and/or legal analyses that promise to provide new knowledge are within the scope of this program as well. Examples of the types of research projects that may be supported include but are not limited to:

Examining the extent and impact of community engagement on data collected and/or research design for the creation or application of AI/ML
Examining the extent and impact of inclusive research teams on research design and conduct
Developing strategies for continual assessment of ethical and equity issues throughout the design, development, and use of AI/ML
Documenting institutional challenges or barriers that arise during the course of research on AI/ML and, if applicable, steps that you took to address them
Developing freestanding normative or legal analyses, for example,
- To what extent should community engagement be involved with existing datasets including the collection and interpretation of data?
- Evaluating evolving policy guidance regarding regulation of AI/ML or privacy protection on federal and/or state levels
Developing human-centered and/or values-sensitive approaches to the creation and application of AI/ML
Creating and/or evaluating methodologies that detect and/or mitigate biases in AI/ML

The NIH defines health disparities as differences in the incidence, prevalence, morbidity, mortality, and burden of diseases and other adverse health outcomes among specific population groups. Research projects should focus on one or more NIH-designated health disparity populations in the United States. These population groups include racial and ethnic minorities (African Americans, American Indians, Alaska Natives, Asian Americans, Hispanic Americans, Native Hawaiians, and other U.S. Pacific Islanders, as well as subpopulations of all of these racial/ethnic groups), socioeconomically disadvantaged individuals, sexual and gender minorities, underserved rural communities, and people with disabilities (see https://www.nimhd.nih.gov/about/strategic-plan/nih-strategic-plan-definitions-and-parameters.html).Research projects that include populations that identify across more than one subpopulation with and/or focus on intersectional effects on health disparities are encouraged.

Eligible Organizations

Consistent with the goals of the AIM-AHEAD Coordinating Center, the following types of Higher Education and other institutions/organizations are highly encouraged to apply for support:

Higher Education Institutions

Public/State Controlled Institutions of Higher Education
Private Institutions of Higher Education

The following types of Higher Education Institutions are always encouraged to apply for NIH support as Public or Private Institutions of Higher Education:

Hispanic-serving Institutions
Historically Black Colleges and Universities (HBCUs)
Tribally Controlled Colleges and Universities (TCCUs)
Alaska Native and Native Hawaiian Serving Institutions
Asian American Native American Pacific Islander Serving Institutions (AANAPISIs)

Nonprofits Other Than Institutions of Higher Education

Nonprofits with 501(c)(3) IRS Status
Nonprofits without 501(c)(3) IRS Status
Tribal health and/or human service organizations or Tribally derived institutions (Urban Indian Health Organizations, Tribal Epidemiology Centers)
- Small Businesses
- For-Profit Organizations (Other than Small Businesses)

To be eligible for this CFP, the applicant institution must be a domestic institution located in the United States and its territories which meet both of the following requirements:

Has received an average of less than $50 million per year total costs of NIH support for the past three fiscal years.
This requirement can be checked online on https://report.nih.gov/award/index.cfm.
Follow this quick tutorial here to check your institutional NIH support.
The administrative review (before scientific review) will check this requirement.
Except for community organizations, nonprofits and non-academic institutions, has a documented historical mission to educate students from any of the populations that have been identified as underrepresented in biomedical, behavioral and social science research as defined by the National Science Foundation NSF, see http://www.nsf.gov/statistics/wmpd/) (i.e., African Americans or Blacks, Hispanic or Latino Americans, American Indians, Alaska Natives, Native Hawaiians, U.S. Pacific Islanders, and persons with disabilities) or has a documented historical track record of:
- Recruiting, training and/or educating, and graduating underrepresented students as defined by NSF (see above),
- Working with community stakeholders (e.g., Community-based organizations, Nonprofits (with or without 501(c)(3) IRS status, Faith-based organizations, Healthcare Providers, Health Systems, Small businesses, Large businesses, start-ups.) that have historically not participated in biomedical, behavioral, and social sciences research in the areas of AIM/ML

Community organizations, Nonprofits and Non-academic institutions are strongly encouraged to apply but should have a documented interest in working with health disparity populations. Before applying, these organizations must be registered with System for Award Management (SAM; see https://sam.gov/content/home) and must maintain active SAM registration throughout the award period.

Foreign Institutions

Non-domestic (non-U.S.) Entities (Foreign Institutions) are not eligible to apply.

Non-domestic (non-U.S.) components of U.S. Organizations are not eligible to apply.

Foreign components, as defined in the NIH Grants Policy Statement, are not allowed.

Required Registrations

Applicant organizations

Applicant organizations must complete and maintain the following registrations as described in the SF 424 (R&R) Application Guide to be eligible to apply for or receive an award. All registrations must be completed prior to the application being submitted. Registration can take 6 weeks or more, so applicants should begin the registration process as soon as possible. The NIH Policy on Late Submission of Grant Applications states that failure to complete registrations in advance of a due date is not a valid reason for a late submission.

System for Award Management (SAM)Applicants must complete and maintain an active registration, which requires renewal at least annually. The renewal process may require as much time as the initial registration. SAM registration includes the assignment of a Commercial and Government Entity (CAGE) Code for domestic organizations which have not already been assigned a CAGE Code.
- NATO Commercial and Government Entity (NCAGE) Code Foreign organizations must obtain an NCAGE code (in lieu of a CAGE code) in order to register in SAM.
- Unique Entity Identifier (UEI) - A UEI is issued as part of the SAM.gov registration process. The same UEI must be used for all registrations, as well as on the grant application.
eRA Commons- Once the unique organization identifier is established, organizations can register with eRA Commons in tandem with completing their Grants.gov registration; all registrations must be in place by time of submission. eRA Commons requires organizations to identify at least one Signing Official (SO) and at least one Program Director/Principal Investigator (PD/PI) account in order to submit an application.
Applicants must have an active SAM registration in order to complete the Grants.gov registration.

Program Directors/Principal Investigators (PD(s)/PI(s))

All PD(s)/PI(s) must have an eRA Commons account. PD(s)/PI(s) should work with their organizational officials to either create a new account or to affiliate their existing account with the applicant organization in eRA Commons. If the PD/PI is also the organizational Signing Official, they must have two distinct eRA Commons accounts, one for each role. Obtaining an eRA Commons account can take up to 2 weeks.

Application Submission Guidelines, Components, and Review Process

Application Milestones

Milestones	Deadlines
Application Open	04/26/2024
Application Submission Deadline	06/16/2024
Reviews Complete	08/30/2024
Notice of Award Dispatch	09/03/2024
Program Start	09/16/2024

Required Format:

Arial font and no smaller than 11 point; margins at least 0.5 inches (sides, top and bottom); single-spaced lines. Submit the application as a single Word or pdf document to the application portal here.

Required Elements of the Proposal

Title: The title should describe the project in the concise, informative language.
Project Summary/Abstract (1-page limit): Provide a succinct description of the proposed work including the project’s long-term objectives, and a description of the research design and methods for the entire AI/ML project.
Project Description: The project description should contain the following components adhering to the page limits.
Specific Aims (1 page): Provide a clear, concise summary of the aims of the work proposed and its relationship to your long-term goals. State the hypothesis to be tested.
Research Plan (5 pages)
Background and Significance: Sketch the background and problem of the statement leading to this proposal. Summarize important results outlined by others in the same field, critically evaluating existing knowledge. Identify gaps that this project is intended to fill. State concisely the importance and relevance of the research to AI and health disparities research. Describe the health disparity population and ethical issues of interest. Also, it is incumbent upon the applicant to make a clear link between the project and the mission of the AIM-AHEAD. The significance section will be assessed in terms of the potential impact on the AIM-AHEAD mission; this will be factored into the overall priority score as noted in the peer review criteria.
Preliminary Studies: Describe concisely previous work by the applicant related to the proposed research that will help to establish the experience and competence of the investigator to pursue the proposed project. Include pilot studies showing the work is feasible. (If none, so state.)

iii. Research Design and Methods: Description of proposed tests, methods or procedures should be explicit, sufficiently detailed, and well-defined to allow adequate evaluation of the approach to the problem. Describe any new methodology and how your research idea innovates or has an advantage over existing methods/applications and brings together diverse disciplines. Clearly describe the overall design of the study, with careful consideration to AI/ML and health disparities aspects of the approach or, and how your methods will control for bias and ethics, as well as how results will be analyzed. Include details of any collaborative arrangements that have been made. Applicants must explain how relevant social determinants of health and biological variables, such as sex, are factored into the research design, analysis and reporting. Furthermore, describe the infrastructure modality that would be used, data sources needed and used, compute infrastructure needed and used (see data and infrastructure section). Describe plan for ensuring work is guided by a concern for human and social impact and attention to ethical, legal, and social implications of AI/ML including but not limited to (1) ethics and equity, (2) biases in datasets, algorithms, and applications; (2) privacy, security, and trust; (3) impacts on disadvantaged or marginalized groups; (4) health disparities; and (5) unintended, adverse social, individual, and community consequences of research and development.

Data: Applicants are required to describe (1) reference(s) to the data under consideration and reasons for this choice; (2) the potential impact of scientific advances that could be made from AI/ML applications developed with the data; (3) proposed methods/data modalities to be used; (4) how the data will be made available to AI/ML applications and researchers, for example, through NIH repositories, NIH knowledge bases, or other data sharing resources including those appropriate for controlled access data; and (5) how the ethical implications of the data will be identified and addressed.
References Cited (maximum 3 pages):List only references cited in the Project Description or supplementary documents of the proposal.
Detailed Budget and Budget Justification: Funding Amounts

In order to ensure that the AIM-AHEAD –Coordinating Center is able to fund substantial projects, the budget for the projects will be in the range of $150,000 to $250,000 in total costs. The total cost includes indirect cost allowances. It is possible that the budget for the projects could be higher (with appropriate justification) or lower than this estimate based on the resources needed to complete the project.

Funding Period

Applicants may request up to 1 year of funding support commensurate with project scope.

Budget: Year One

Personnel	Base Salary	Months Effort	Requested Salary	Fringe	Total Funds Requested
Principal Investigator:	$		$	$	$
MPI:	$		$	$	$
Co-Investigator:	$		$	$	$
Co-Investigator:	$		$	$	$
Research Coordinator:	$		$	$	$
Research Assistant	$		$	$	$
Subtotal: Personnel	$		$	$	$

Maintenance & Operation					Funds Requested
Subject Payments					$
Vertebrate Animals					$
Consumable Supplies					$
Equipment					$
Travel					$
Subtotal: Maintenance & Operation					$

Total Direct Costs					$
Indirect Costs/F&A (xx% of Direct Costs)					$
Total Costs					$

Allowable costs:

Academic Institutions: The award may be used for salary and fringe benefits of the Co-Principal Investigators, collaborating investigator(s), and other participants with faculty appointments, commensurate with percent effort, and for project-related expenses, such as salaries of technical personnel essential to the conduct of the project, supplies, equipment, computers/electronics, travel (including international travel), volunteer subject costs, data management, publication costs, etc. Tuition support for graduate students may also be requested.
Other non-Academic Institutions/Organizations: The award may be used for salary and fringe benefits of the Principal Investigators, collaborating investigator(s), and other participants consistent with salary structure, and for project-related expenses, such as salaries of technical personnel essential to the conduct of the project, supplies, equipment, computers/electronics, travel (including international travel), volunteer subject costs, data management, publication costs, etc.

The budget justification should entail a narrative explanation of each of the components of the costs for performing proposed work. The budget explanations should focus on how each budget item is required to achieve the project’s objectives and how the estimated costs in the budget were calculated.

6. Facilities, Equipment and Other Resources: Facilities, Equipment and Other Resources should describe the resources needed and those that are available to for the proposed research project. Collaborators should also provide this information. Applicants should explain how well the scientific environment in which they will conduct their research contributes to the probability of success.

7. Senior Personnel Documents:

Biographical Sketches: Biographical sketches are required for the PI, any co-PIs, and each of the participating Senior Personnel listed in the Project Description (including postdocs, staff, and /or students). All biographical sketches submitted in response to this solicitation are expected to follow the NIH format. Biosketch Format Pages, Instructions and Samples | grants.nih.gov
Current and Pending Support: Include any funding in the last three years from NIH or AIM-AHEAD or Federal sources.
Collaborators and Other Affiliations Information.

8. Human Subjects Research.

Does any of the proposed research involve human subjects and/or data (Including EHR and/or repository data)? Describe how such subjects and/or data are or are not regulated by the Common Rule
Does your research require a Data Use Agreement or other agreement(s) for use of data?
If yes, what is the plan to execute these agreements among all partners/collaborators?

Note: Definition of Human subject: A living individual about whom an investigator (whether professional or student) conducting research: (1) obtains information or biospecimens through intervention or interaction with the individual, and uses, studies, or analyzes the information or biospecimens; or (2) obtains, uses, studies, analyzes, or generates identifiable private information or identifiable biospecimens. [45 CFR 46.102(e)].

Required Training/Certifications

Awardees will be expected to complete Human Subjects Research training (e.g., CITI or the free training offered by OHRP found at https://www.hhs.gov/ohrp/education-and-outreach/human-research-protection-training/human-research-protection-foundational-training/index.html) and any other training required by datasets they intend to use in their projects.

9. Other Supplementary Documents:

Institutional Need and Support Statement (up to 3 pages). This statement must be signed by the leadership of the institution (e.g., University president, provost or college/school dean as applicable to show support for the partnership and commitment to additional resources necessary to ensure that these partnerships will have the maximum sustainability. This letter should include
- Assessment of the institution’s current AI research, education as well as data and infrastructure capacity;
- A statement of commitment of institutional support for the proposed activities. List the specific resources, space, protected time, etc. These statements should also identify the specific number of positions that are wholly dedicated to AI data and infrastructure under the partnership. In addition, if American Indians are involved, a Letter of Commitment from the Tribal Nation Leader is required.
Letter from Sponsor Office: This letter should indicate that your institution has received an average of less than $50 million per year total costs of NIH support for the past three fiscal years.
Results from Prior NIH/AIM-AHEAD Support.If applicable, you must submit the Results from Prior NIH/AIM-AHEAD Support as a supplementary document, rather than as part of your Project Description. This allows you to maximize the use of the page allowance to describe the proposed activities.
Cloud Computing Resources (if applicable).

Project Personnel and Partner Organizations (required).

Review Process

A Review Committee composed of AIM-AHEAD Consortium members will oversee the following steps and criteria to evaluate proposals and recommend award recipients to NIH, which makes the final decision on awards.

Proposals will be reviewed with respect to their consideration of:

Human-centered design of AI/ML
Multidisciplinary approaches to the creation and application of AI/ML
Community engagement
Access to data, algorithms, and software
Health disparities
Algorithmic fairness
Privacy, security, and trust
Attention to ethics and equity
Legal issues where appropriate

Assessment of scientific merit will be guided by the specific questions related to the proposed project:

How well does the project collect empirical evidence about the application of ethics and equity principles in the design, development, and/or use of AI/ML and their impact on addressing health disparities?
How well does the project collect information about challenges, inclusive of ethics and equity issues, in the design, development, and/or use of AI/ML and their impact on addressing health disparities in their institution?
Does the team describe plans to identify, develop, and evaluate strategies to overcome any challenges observed?
To what extent does the proposal appropriately consider ethical, legal, and privacy considerations?
To what extent the proposal describes how the selected dataset(s) will contribute to addressing the research topic and approaches and/or creating tools that will benefit the functionality of the AIM-AHEAD ecosystem?
How well does the proposal clearly articulate the goals of the project and expected outcomes in terms of research products or other artifacts?
To what extent is the proposed approach reasonably likely to achieve the goals of the project? What is the likelihood of a successful outcome? Are the measures of success clearly defined?
Does the team have the necessary background and capabilities to accomplish the proposed work in the expected funding period?

A Review Committee composed of AIM-AHEAD Consortium members will evaluate proposals and recommend award recipients to NIH, which makes the final decision on awards.

Compliance review: Program staff will do an initial check of applications for compliance with requirements of this Call for Proposals. Non-compliant applications may be rejected or require remediation prior to Scientific Review and Programmatic Evaluation.

Scientific evaluation: A panel of reviewers will apply standard NIH scoring ranges to each application on each of the nine scientific merit criteria listed above.

Programmatic evaluation:

It is expected that approximately 10 awards will be assigned to the highest-scoring applications based on Scientific Merit and that the rest of the awards could be, but not necessarily will be, recommended for NIH consideration based on both Scientific Merit scores and Programmatic Evaluation considerations. The consortium will take steps to avoid any conflicts of interest with reviewers or voting MPIs. This is particularly important because this CFP allows for participation of AIM-AHEAD Hub or Core partners in the projects.

Applicants will receive their scores and notes from the Scientific Merit reviewers.

Awardee Expectations

Applicants are encouraged to propose projects that consider ethical issues and potential unintended adverse outcomes of bias in applying AI/ML approaches to improving health outcomes. It is expected that proposals will most likely be successful if they are engaged in and examining existing AI/ML research, whether collecting data prospectively or using retrospective data, given the time frame. Applications that address freestanding normative and/or legal analyses that promise to provide new knowledge are within the scope of this program as well. Therefore, awardees are expected to fulfill the following expectations:

Participation and Reports

Awardees will participate in regular awardee virtual meetings.
Awards will submit bimonthly reports and budget reports.
Awardees will participate in meetings with the AIM-AHEAD Coordinating Center to inform overall AIM-AHEAD data sharing / data access strategies.
Awardees will act as AIM-AHEAD community ambassadors to help onboard colleagues.
Awardees must be willing to attend and present the results of their work at future AIM-AHEAD events as well as volunteer to review applications for future AIM-AHEAD programs.
Awardees agree to have AIM-AHEAD promote the project online through websites, social media, and other communication channels.
All awardees will be expected to be involved in AIM-AHEAD activities during the year, including attending the in-person annual meetings.
Awardees must provide a summary of research status about milestones listed in the proposal, challenges faced and plans to overcome those challenges, usage of funds, and next steps.
By one month after the project end date, awardees must provide a final report of research findings, use of funds, and a list of publications, grant applications, articles, and conference talks emerging from the research.

Compliance and Governance

For all research projects involving human subjects research (including secondary data analysis), awardees are expected to submit their study for review to an IRB and obtain a determination or letter/response (See section Application Components section, subsection 10). **Even if the projects do not involve human subjects, a “letter of determination” is required. When a local IRB is not available, alternative options should be used.
Awardees are required to obtain any required Data Use/Sharing or Regulatory/Contractual Agreements for data needed for their research.
Any current or future data sharing must follow relevant governance documents and agreements.
Awardees must comply with all applicable Federal statutes (such as those included in appropriations acts), regulations, and policies in addition to their institutional and state policies to receive research funding.

Note: No funds can be drawn down from NIH payment system, and no obligations may be made by any site of AIM-AHEAD’s coordinating center for research for any period not covered by both an OHRP-approved Federal Wide Assurance and approval from the IRB, as required, consistent with 45 CFR Part 46 and any NIH required policies for research involving human subjects.

Ethical Use of Data

The Applied Ethical AI (AEAI) Subcore was established to provide support in surfacing, reasoning about, and resolving ethical issues in the AIM-AHEAD program. It is expected that all projects will work with the AEAI on the development of an initial ethics review within the first 4-6 weeks of support and will commit to participating in two AEAI-supported ethics forums during the course of their sponsorship. These forums are designed to highlight and address ethical challenges in the development and implementation of AI and machine learning and range from open office hours to moderated discussions on hot topics in ethics and AI/ML. Topics that may be addressed during these forums include but are not limited to:

Investigation into the ethical grounding of the project and the plans for embedding ethical reasoning into the project.

Consultation on how to test for algorithmic fairness and data bias with respect to potentially sensitive variables (e.g., socio-demographics)

Detection of potential data privacy concerns and alternatives for their resolution

Sharing of research products

AIM-AHEAD Service Workbench Data Repository

The AIM-AHEAD Service Workbench platform https://aim-ahead.net/swb is a FISMA (Federal Information Security Modernization Act) secure environment working towards Authority to Operate (ATO) status that can serve as a repository for the data sharing needs of AIM-AHEAD projects and members. The AIM-AHEAD Service Workbench Centralized FISMA secure Infrastructure can provide secure hosting of data, management of access controls with appropriate authentication and authorization and continuous monitoring of the centralized environment that ensures system cybersecurity. More information in this document from NIH on how AIM-AHEAD repository was established: Establishing a Controlled Access Repository for AIM-A H E AD

AIM-AHEAD will assist awarded investigators to identify an appropriate data repository, prepare data for transfer, and meet requirements of the data repository.

Questions

Please direct questions regarding the ABC-EE program to applicant support at Carolyn.diehl@vumc.org or Laurie.l.novak@vumc.org.

Answers to frequently asked questions are posted here.

Awardee Resources

Glossary from Developing Ethics and Equity Principles, Terms, and Engagement Tools to Advance Health Equity and Researcher Diversity in AI and Machine Learning: Modified Delphi Approach, R. Hendricks-Sturrup, M. Simmons, S. Anders, K. Aneni, E. W. Clayton, J. Coco, et al., JMIR AI 2023 Vol. 2 Issue 1 Pages e52888

No.	Glossary Term	AIM-AHEAD Definition
1	Ethnicity	Distinct patterns of language, lifestyle, illness, and health beliefs encountered among an individual or representative population, regardless of race, and that may subject the individual or population to bias or discrimination.
2	Race	A social construct or assumption based on patterns in an individual’s or representative population’s language, lifestyle, and/or health beliefs, and immutable characteristics, such as skin tone/color or hair texture, regardless of immigration status, socioeconomic status, genetic ancestry, or geographic origin, which may subject the individual’s or population to bias, structural racism, and/or discrimination that would warrant corrective anti-racism action(s).
3	Bias	Systematic error in information originating, gathering, or assessment activities, leading to selecting or encouraging one outcome or answer over others, which can result in human decisions and values that echo societal or historical inequities, and/or produce inconclusive or limited assumptions about the broader population.
4	Equity	Equity is fairness and justice in policy, practice, and opportunity designed to address the distinct challenges of non-dominant social groups, with an eye to progressive outcomes. Health equity is the state in which everyone has the opportunity to attain full health potential and no individual is disadvantaged from achieving this potential because of social position or any other socially defined circumstance.
5	Algorithmic Bias	Systematic and repeated errors in the collection and consideration of a variety of factors, including but not limited to the design of the algorithm; unintended or unanticipated use or decisions relating to the way data are collected, represented, or used; lack of sensitivity to identity factors that contribute to bias in the evaluation of the algorithm, or misappropriation of the algorithm through miscommunicating or misunderstanding its limitations.
6	Diversity	The wide variety of shared and different personal and group characteristics among human beings. There are many kinds of diversity, including gender, sexual orientation, class, age, country of origin, education, religion, geography, physical or cognitive abilities, or other characteristics. Valuing diversity means recognizing differences between people, acknowledging that these differences are a valued asset, and striving for diverse representation as a critical step towards equity.
7	Inclusive	Avoiding bias by providing equitable and open access to opportunities and resources for engagement. This can be accomplished, for example, by enforcing fairness in the data collection methods, enforcing fairness in the assignment of labels, developing explainable, transparent, and interpretable models, having diverse teams monitoring models and looking for biases and eliminating them.
8	Fairness	Intent to promote nondiscrimination and population representation when assessing a group’s eligibility for a benefit or penalty. This is particularly important given the statistical likelihood that AI/ML systems could produce discriminatory outputs once algorithms are implemented across one or more datasets.
9	Representative	An individual or body chosen or appointed to act or speak for an individual, population, or subpopulation sharing a set of features or characteristics, including but not limited to gender, race, and/or sexual orientation.
10	Representative Sample	A subset of a population that reflects the characteristics of the entire population from which it has been selected.
11	Gender Identity	An individual’s sense of oneself as male, female, or something else. When an individual’s gender identity and biological sex are not congruent, the individual may identify along the transgender spectrum. An individual may choose to change their gender one or more times. Varying cultural indicators of gender, such as clothing choice, speech patterns, and personality traits relate to gender but are not acceptable means to determine another’s gender identity. The change in an individual’s gender can be used to abuse, discriminate against, and misrepresent individuals and groups.
12	Sexual Orientation	An individual’s capacity for attraction to and sexual activity with the same or different sex. An individual’s sexual orientation is indicated by one or more of the following: how an individual identifies their own sexual orientation, an individual’s capacity for experiencing sexual and/or affectional attraction to people of the same and/or different gender, and/or an individual’s sexual behavior with people of the same and/or different gender. Sexual orientation incorporates three core ideas: consensual human relationships—sexual, romantic, or both—the biological sex of an individual’s actual or potential relationship partners, and enduring patterns of experience and behavior. Sexual minorities, or people whose sexual orientation does not conform to heteronormative cultural expectations, are vulnerable to violence and discrimination.

AIM-AHEAD consists of its Leadership Core made up of Regional Hubs and the following Cores:

Data sources and accessibility

Proposed projects may either identify their own datasets and/or be approved for use of existing AIM-AHEAD resources, including the OCHIN Community Health Equity Database on AIM-AHEAD Service Workbench or MedStar Health EHR through the AIM-AHEAD Data Bridge (AADB).

Requirements Prior to Obtaining Access to AIM-AHEAD Curated Data (OCHIN/AADB)

Mandatory Human Subjects Research Training such as CITI “Human Research (Protection of Human Subjects)” and “Responsible Conduct of Research.”
Data consult with the data source (i.e., OCHIN or MedStar) to finalize data request within 30 days of award.
Submission for IRB approval/determination within 60 days of award.
Signed data use agreement and IRB approval/determination within 90 days of award.

Dataset options (more information available on below datasets/cohorts):

Data set	Brief Description	Data Allowed	Size	Analysis platform tools
A customized subset from OCHIN Community Health Equity Database	EHR data from Underserved communities	HIPAA Limited dataset, individual-patient level data with dates and geographic indicators if needed for research	A customized subset will be created for the research question of awarded fellow from over 6 million records	AIM-AHEAD Service Workbench
Curated data from the MedStar Health EHR	EHR data from hospital system network with 31% African American patient representation	Multiple curated dataset options (further detail on website) pre-curated or custom curated de-identified EHR, Limited Dataset, Full PHI EHR dataset, Imaging, Select clinical notes, select genomics data, synthetic data	Pre-curated datasets and custom-curated datasets of varying sizes Curated from the EHR with over 5 million patient records	AIM-AHEAD Data Bridge
60+ studies from NHLBI BioData Catalyst	Selected large-scale cohorts related to heart, lung, blood and sleep disorders. Includes both prospective clinical studies and associated genomic TOPMED data.	De-identified dataset. Including individual level genomic (TOPMED full genomes) and clinical datasets.	Additional description List of studies: 60+ studies are available to choose from	NHLBI BioData Catalyst PIC-SURE and Seven Bridges Platforms
Selected 15 Open datasets on AWS	A variety of datasets available including clinical and genomic data	Public data, and controlled access data (depends on dataset)	Selected 15 Open datasets on AWS	AIM-AHEAD Service Workbench
NIH All of Us	The All of Us Research Program is building one of the largest biomedical data resources of its kind.	The All of Us Research Hub stores health data from a diverse group of participants from across the United States.	Additional descriptions 616,000+ Participants 360,000+ Electronic Health Records 444,000+ Biosamples Received	All of Us Researcher Workbench

Dataset Infrastructure

Infrastructure of 1) OCHIN and 2) AWS Open Datasets will use AIM-AHEAD Service Workbench (SWB). SWB is a tool for any cloud computing data analysis that will be conducted throughout this fellowship. More information is available here.

Infrastructure for BioData Catalyst will use the NHLBI BioData Catalyst PIC-SURE and Seven Bridges Platforms. An NIH eRA Commons ID in order to access these resources.

Infrastructure for AIM-AHEAD Data Bridge will connect to a Windows Virtual Machine (VM).

Infrastructure for All of Us Research Program will use the All of Us Workbench which is a cloud-based platform where registered researchers can access Registered and Controlled Tier data.

OCHIN Community Health Equity Database

Applicants may want to explore the OCHIN Community Health Equity Database for feasibility assessment as they write their proposals. OCHIN provides the means to do so through the use of Cohort Discovery, a web-based software tool for obtaining counts of patients matching user-specified inclusion/exclusion criteria.

To gain access to Cohort Discovery, AIM-AHEAD program applicants must have completed and be up to date with standard training in Human Subjects Research and Responsible Conduct of Research such as those offered by the CITI Program. To request access to OCHIN’s Cohort Discovery, AIM-AHEAD program applicants can complete the OCHIN i2b2 End User Agreement.

AIM-AHEAD DataBridge

Applicants whose research question can be addressed with AIM-AHEAD Data Bridge (AADB) data may choose to schedule a pre-award data consultation with AADB to discuss their research question with the AADB team and methodologist. The AADB can perform a preliminary cohort discovery to provide estimated sample sizes for custom dataset requests.

Available EHR data including (but not limited to): demographics, problems list, surgical history, observations, medical history, procedures, prescriptions, billing, provider, clinical notes, clinical unit/department, allergies, images, social history, diagnoses, admissions, encounters, social history, social determinants of health, device implants

NHLBI BioData Catalyst studies

Over the last several decades, NHLBI has invested in creating a significant resource for research and development by supporting the creation of many observational, epidemiological, and longitudinal datasets related to heart, lung, blood and sleep phenotypes, with the aim of uncovering insights that may be leveraged toward novel therapeutic, interventional, or preventive strategies resulting in improved patient outcomes. New technologies and favorable cost trajectories have enabled detailed characterization of these study participants, including whole genome sequencing (and other omics) and imaging on hundreds of thousands of participants. Together, this data coupled with animal and cellular models, increase opportunities for data-driven translational science. We have fully entered the “Big Data” arena, in which we encounter both unprecedented opportunities as well as challenges. Current paradigms for analyzing and combining these datasets are limited by both practical and conceptual constraints. The NHLBI BioData Catalyst, is a novel ecosystem of platforms and tools to enable and accelerate the mining of these rich datasets from diverse populations. Full Description (including patient, variables, and sample counts ) is available here.

Acronym	Name	Study Focus	Study Design
FHS	Framingham Cohort	Cardiovascular Disease	Prospective Longitudinal Cohort
JHS	Jackson Heart Study (JHS) Cohort	Cardiovascular Disease	Prospective Longitudinal Cohort
CARDIA	Coronary Artery Risk Development in Young Adults (CARDIA)	Cardiovascular Disease	Prospective Longitudinal Cohort
ARIC	NHLBI TOPMed - NHGRI CCDG: Atherosclerosis Risk in Communities (ARIC)	Cardiovascular Disease	Case-Control
WHI	Women's Health Initiative Clinical Trial and Observational Study	Women's Health	Prospective Longitudinal Cohort
ACTIV4a	A Multicenter, Adaptive, Randomized Controlled Platform Trial of the Safety and Efficacy of Antithrombotic Strategies in Hospitalized Adults with COVID-19 (ACTIV4A)	COVID-19	Interventional
ACTIV4b	COVID-19 Positive Outpatient Thrombosis Prevention in Adults Aged 40-80	COVID-19	Interventional
AMISH	NHLBI TOPMed: Genetics of Cardiometabolic Health in the Amish	Cardiovascular Disease	Family/Twin/Trios
BABYHUG	Hydroxyurea to Prevent Organ Damage in Children with Sickle Cell Anemia (BABY HUG) Phase III Clinical Trial and Follow-Up Observational Studies I and II	Sickle Cell Anemia	Clinical Trial
BAGS	NHLBI TOPMed: The Genetics and Epidemiology of Asthma in Barbados	Asthma	Family/Twin/Trios
C3PO	Clinical-trial of COVID-19 Convalescent Plasma in Outpatients	COVID-19	Clinical Trial
CATHGEN	CATHeterization GENetics (CATHGEN)	Coronary Disease	Cross-Sectional
CCAF	The Cleveland Clinic Foundation's Lone Atrial Fibrillation GWAS Study	Atrial Fibrillation	Case Set
CFS	NHLBI Cleveland Family Study (CFS) Candidate Gene Association Resource (CARe)	Sleep Apnea Syndromes	Prospective Longitudinal Cohort
COPDGENE	Genetic Epidemiology of COPD (COPDGene)	Pulmonary Disease, Chronic Obstructive	Case-Control
CRA	NHLBI TOPMed: The Genetic Epidemiology of Asthma in Costa Rica	Asthma	Family/Twin/Trios
CSSCD	Cooperative Study of Sickle Cell Disease (CSSCD)	Sickle Cell Disease	Clinical Trial
DHS	NHLBI TOPMed: Diabetes Heart Study (DHS) African American Coronary Artery Calcification (AA CAC)	Cardiovascular Disease	Cross-Sectional
ECLIPSE	Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints (ECLIPSE)	Pulmonary Disease, Chronic Obstructive	Case-Control
EOCOPD	NHLBI TOPMed: Boston Early-Onset COPD Study	Chronic Obstructive Pulmonary Disease	Family/Twin/Trios
GALAII	Genes-Environments and Admixture in Latino Asthmatics (GALA II) Study	Lung Diseases	Case-Control
GENESTAR	GeneSTAR (Genetic Study of Atherosclerosis Risk) NextGen Consortium: Functional Genomics of Platelet Aggregation Using iPS and Derived Megakaryocytes	Platelet Aggregation	Prospective Longitudinal Cohort
GENOA	Genetic Epidemiology Network of Arteriopathy (GENOA)	Hypertension	Prospective Longitudinal Cohort
GENSALT	Genetic Epidemiology Network of Salt Sensitivity (GenSalt)	Arterial Pressure, Mean	Interventional
GOLDN	NHLBI TOPMed: GOLDN Epigenetic Determinants of Lipid Response to Dietary Fat and Fenofibrate	Lipids	Prospective Longitudinal Cohort
HCHSSOL	Hispanic Community Health Study /Study of Latinos (HCHS/SOL)	Cardiovascular Disease	Prospective Longitudinal Cohort
HCT_for_SCD	Hematopoietic Cell Transplant for Sickle Cell Disease (HCT for SCD)	Sickle Cell Disease	Prospective Longitudinal Cohort
HVH	Heart and Vascular Health Study (HVH)	Cardiovascular Disease	Case-Control
HYPERGEN	NHLBI TOPMed: HyperGEN - Genetics of Left Ventricular (LV) Hypertrophy	Hypertrophy, Left Ventricular	Family/Twin/Trios
MAYOVTE	NHLBI TOPMed: Whole Genome Sequencing of Venous Thromboembolism (WGS of VTE)	Venous Thromboembolism	Case Set
MESA	Multi-Ethnic Study of Atherosclerosis (MESA) SHARe	Cardiovascular Disease	Prospective Longitudinal Cohort
MGHAF	Massachusetts General Hospital (MGH) Atrial Fibrillation Study	Atrial Fibrillation	Case Set
MSH	Multicenter Study of Hydroxyurea (MSH)	Sickle Cell Disease	Clinical Trial
NSRR-CFS	National Sleep Research Resource (NSRR): Cleveland Family Study (CFS)	Sleep Apnea Syndromes	Prospective Longitudinal Cohort
ORCHID	COVID19-ORCHID	COVID-19	Clinical Trial
PARTNERS	NHLBI TOPMed: Partners HealthCare Biobank	Atrial Fibrillation	Case Set
PCGC	The Pediatric Cardiac Genetics Consortium (PCGC) Study	Heart Defects, Congenital	Prospective Longitudinal Cohort
RED_CORAL	PETAL Repository of Electronic Data COVID-19 Observational Study (RED CORAL)	COVID-19	Control Set
SAFHS	NHLBI TOPMed: San Antonio Family Heart Study (SAFHS)	Cardiovascular Disease	Family/Twin/Trios
SAGE	NHLBI TOPMed: Study of African Americans, Asthma, Genes and Environment (SAGE) Study	Lung Diseases	Case Set
SARCOIDOSIS	NHLBI TOPMed: African American Sarcoidosis Genetics Resource	Sarcoidosis	Family/Twin/Trios
SARP	NHLBI GO-ESP: Lung Cohorts Exome Sequencing Project (Asthma): Genetic variants affecting susceptibility and severity	Asthma	Case Set
SAS	Genome-Wide Association Study of Adiposity in Samoans	Obesity	Cross-Sectional
SHARP	SNP Health Association Asthma Resource Project	Lung Diseases
STOP-II	Optimizing Primary Stroke Prevention in Children with Sickle Cell Anemia (STOP II)	Sickle Cell Disease	Clinical Trial
THRV	NHLBI TOPMed: Rare Variants for Hypertension in Taiwan Chinese (THRV)	Blood Pressure	Prospective Longitudinal Cohort
VAFAR	NHLBI TOPMed - NHGRI CCDG: The Vanderbilt AF Ablation Registry	Atrial Fibrillation	Case Set
VUAF	NHLBI TOPMed: The Vanderbilt Atrial Fibrillation Registry (VU_AF)	Atrial Fibrillation	Case Set
Walk-PHaSST	Treatment of Pulmonary Hypertension and Sickle Cell Disease with Sildenafil Therapy (Walk-PHaSST)	Sickle Cell Anemia	Clinical Trial
WGHS	NHLBI TOPMed: Novel Risk Factors for the Development of Atrial Fibrillation in Women	Atrial Fibrillation	Case Set

AWS Open Datasets

More information (description and links) is available here

Acronym	Name of AWS open dataset
EMBED	EMory BrEast Imaging Dataset
1000-genomes	1000 Genomes
tcga	The Cancer Genome Atlas
broad-gnomad	Genome Aggregation Database (gnomAD)
broad-pan-ukb	UK Biobank Pan-Ancestry Summary Statistics
kids-first	Gabriella Miller Kids First Pediatric Research Program (Kids First)
target	Therapeutically Applicable Research to Generate Effective Treatments (TARGET)
hcmi-cmdc	Human Cancer Models Initiative (HCMI) Cancer Model Development Center
cgci	Cancer Genome Characterization Initiatives - Burkitt Lymphoma, HIV+ Cervical Cancer
organoid-pancreatic	Pancreatic Cancer Organoid Profiling
nciccr-dlbcl	National Cancer Institute Center for Cancer Research - Diffuse Large B Cell Lymphoma (DLBCL) Genomics and Expression
mmrf-commpass	CoMMpass from the Multiple Myeloma Research Foundation
hcp-openaccess	The Human Connectome Project
cptac-2	Clinical Proteomic Tumor Analysis Consortium 2 (CPTAC-2)
cptac-3	Clinical Proteomic Tumor Analysis Consortium 3 (CPTAC-3)

NIH All of Us

The All of Us Research Program’s centralized, secure, cloud-based platform allows researchers across a wide range of settings and institutions and at all stages of their careers (e.g., students, early-stage investigators) to execute rapid, hypothesis-driven research with just a computer and an Internet connection.

All of Us facilitates equity in access in a deliberately inclusive way, creating a demographically diverse researcher cohort.

Currently, academic, not-for-profit and healthcare organizations are eligible to apply for an All of Us Data Use and Registration Agreement (DURA). This is the first step to accessing the All of Us Researcher Workbench.

Within the Researcher Workbench's Controlled Tier, data from nearly 250,000 whole genome sequences and more than 312,900 genotyping arrays are integrated alongside data from surveys, physical measurements, EHRs, and wearables. The data and tools are available only to registered researchers who have taken additional steps and training to access these data.

Submission using AIM-AHEAD Connect and InfoReady platform

Step 1: Click here to login or to register as a “mentor" on AIM-AHEAD Connect (our Community Building Platform)
Step 2: After you login to AIM-AHEAD Connect, click here to submit an application for review using the InfoReady platform^*.

*To submit your application in InfoReady, please use Chrome, Firefox, or Edge. If you're using Safari, make sure to clear your cache before logging in.

Please note both steps must be completed for consideration.

All applications must be received by June 16, 2024 — 11:59 PM Eastern Time.

About AIM-AHEAD

News & Events

Resources