Drug utilization
Epidemiology (incidence/prevalence)
Natural history
Risk factors
a Sentinel is the United States Food and Drug Administration’s national electronic system to proactively monitor medical product safety post-marketing, through rapidly and securely accessing data from large amounts of electronic healthcare records, insurance claims, and registries, from a diverse group of data partners [ 24 ]
PROBE prospective randomized open blinded endpoint, eCRF electronic case report form, SAE serious adverse event
Epidemiologic and observational research.
EHR data have been used to support observational studies, either as stand-alone data or following linkage to primary research data or other administrative data sets [ 3 , 11 – 14 ]. For example, the initial Euro Heart Survey [ 15 ] and subsequent Eurobservational Research Program (EORP) [ 16 ], the American College of Cardiology National Cardiovascular Data Registry (ACC-NCDR) [ 14 ], National Registry of Myocardial Infarction (NRMI), and American Heart Association Get With the Guidelines (AHA GWTG) [ 17 ] represent clinical data (collected from health records into an electronic case report form [eCRF] designed for the specific registry) on the management of patients across a spectrum of different cardiovascular diseases. However, modern EHR systems can minimize or eliminate the need for duplicate data collection (i.e., in a separate registry-specific eCRF), are capable of integrating large amounts of medical information accumulated throughout the patient’s life, enabling longitudinal study of diseases using the existing informatics infrastructure [ 18 ]. For example, EHR systems increasingly house imaging data which provide more detailed disease characterization than previously available in most observational data sets. In some countries (e.g., Farr Institute in Scotland [ 19 ]), the EHR can be linked, at an individual level, to other data sets, including general population health and lifestyle surveys, disease registries, and data collected by other sectors (e.g., education, housing, social care, and criminal justice). EHR data support a wide range of epidemiological research on the natural history of disease, drug utilization, and safety, as well as health services research.
Active post-marketing safety surveillance and signal detection are important, emerging applications for EHRs, because they can provide realistic rates of events (unlike spontaneous event reports) and information on real-world use of drugs [ 20 ]. The EU-ADR project linked 8 databases in four European countries (Denmark, Italy, The Netherlands, United Kingdom) to enable analysis of select target adverse drug events [ 21 ]. The European Medicines Agency (EMA) coordinates the European Network of Centres for Pharmacoepidemiology and Pharmacovigilance (ENCePP) which aims to conduct post-marketing risk assessment using various EHR sources [ 22 , 23 ]. In the United States, the Food and Drug Administration (FDA) uses EHR data from several different sources (e.g., Sentinel and Mini-Sentinel System [ 24 ], Centers for Medicare and Medicaid Services [CMS], Veterans Affairs, Department of Defense, Substance Abuse and Mental Health Services Administration) to support post-marketing safety investigations [ 25 ].
National patient registries that contain data extracted from the EHR are an accepted modality to assess guideline adherence and the effectiveness of performance improvement initiatives [ 26 – 33 ]. However, the use of EHRs for prospective clinical research is still limited, despite the fact that data collected for routine medical care overlap considerably with data collected for research. The most straightforward and generally accepted application for EHR is assessing trial feasibility and facilitating patient recruitment, and EHRs are currently used for this purpose in some centers. Using EHR technology to generate lists of patients who might be eligible for research is recognized as an option to meet meaningful use standards for EHR in the United States [ 6 ]. However, incomplete data may prohibit screening for the complete list of eligibility criteria [ 34 ], but EHRs may facilitate pre-screening of patients by age, gender, and diagnosis, particularly for exclusion of ineligible patients, and reduce the overall screening burden in clinical trials [ 35 ]. A second, and more complex, step involves the reuse of information collected in EHRs for routine clinical care as source data for research. Using EHRs as the source for demographic information, co-morbidities, and concomitant medications has several advantages over separately recording these data into an eCRF. Transcription errors may be reduced, since EHR data are entered by providers directly involved in a patient’s care as opposed to secondary eCRF entry by study personnel. The eCRF may be a redundant and costly step in a clinical trial, since local health records (electronic or paper) are used to verify source data entered into the eCRF. Finally, EHRs might enhance patient safety and reduce timelines if real-time EHR systems are used in clinical trials, in contrast to delays encountered with manual data entry into an eCRF. The EHR may facilitate implementation of remote data monitoring, which has the potential to greatly reduce clinical trial costs. The Innovative Medicine Initiative (IMI) Electronic Health Records for Clinical Research (EHR4CR, http://www.ehr4cr.eu ) project is one example, where tools and processes are being developed to facilitate reuse of EHR data for clinical research purposes. Systems to assess protocol feasibility and identify eligible patients for recruitment have been implemented, and efforts to link EHRs with clinical research electronic data collection are ongoing [ 36 ].
A shift towards pragmatic trials has been proposed as a mechanism to improve clinical trial efficiency [ 37 ]. Most of the data in a pragmatic trial are collected in the context of routine clinical care, which reduce trial-specific clinic visits and assessments, and should also reduce costs [ 38 ]. This concept is being applied in the National Institutes of Health (NIH) Health Care Systems Research Collaboratory. Trials conducted within the NIH Collaboratory aim to answer questions related to care delivery and the EHR contains relevant data for this purpose. Studies may have additional data collection modules if variables not routinely captured in the EHR are needed for a specific study. Similarly, the Patient-Centered Outcomes Research Institute (PCORI) has launched PCORnet, a research network that uses a common data platform alongside the existing EHR to conduct observational and interventional comparative effectiveness research [ 9 , 39 , 40 ].
The integration of EHRs in the conventional randomized controlled trials intended to support a new indication is more complex. EHRs may be an alternative to eCRFs when data collection is focused and limited to critical variables that are consistently collected in routine clinical care. Regulatory feedback indicates that while a new indication for a marketed drug might be achieved through EHRs, first marketing authorization using data entirely from EHRs would most likely not be possible with current systems until validation studies are performed and reviewed by regulatory agencies. The EHR could also be used to collect serious adverse events (SAE) that result in hospitalization, or to collect endpoints that do not necessarily require blinded adjudication (e.g., death), although the utility of EHRs for this purpose is dependent on the type of endpoint, whether it can reliably be identified in the EHR, and the timeliness of EHR data availability. Events that are coded for reimbursement (e.g., hospitalizations, MI) or new diagnoses, where disease-specific therapy is initiated (e.g., initiation of glucose lowering drugs to define new onset diabetes) tend to be more reliable. The reliability of endpoint collection varies by region and depends on the extent of linkage between different databases.
Challenges to using EHRs in clinical trials have been identified, related to data quality and validation, complete data capture, heterogeneity between systems, and developing a working knowledge across systems (Table 2 ). Ongoing projects, such as those conducted within the NIH Collaboratory and PCORnet [ 39 , 41 ] in the United States or the Farr Institute of Health Informatics Research in Scotland, have demonstrated the feasibility of using EHRs for aspects of clinical research, particularly comparative effectiveness. The success of these endeavors is connected to careful planning by a multi-stakeholder group committed to patient privacy, data security, fair governance, robust data infrastructure, and quality science from the outset. The next hurdle is to adapt the accrued knowledge for application to a broader base of clinical trials.
Challenges of using electronic health records in research
Problem | Example | Potential Solutions |
---|---|---|
Data quality and validation | Selecting measurement of interest for a clinical trial when multiple measurements are available (e.g., laboratory data) Inaccurate information in EHRs Coding errors | Specific parameters (e.g., using date or time windows) stated in protocol or operating procedures for extracting data from EHR into eCRF Use codes linked to reimbursement, which have greater likelihood of reliability Stakeholder collaboration to develop validation methodology Stakeholder collaboration to contribute data for EHR validation studies |
Complete data capture | Clinical endpoints SAEs Problematic in multiple-payer systems Death | Develop standards for data sharing and privacy Explore linking EHRs to national death registries |
Heterogeneity among systems | Multiple different vendors within a given country or region Inconfigurable systems Lack of flexible architecture Lack of common data fields, data definitions, and difficulty with data mapping Incomplete data capture Missing fields of interest (i.e. relevant to some diseases but not others) Inability to link systems (i.e. different patient identifiers) | Commit resources to harmonization efforts Form working group with representation from all stakeholders to develop consensus agreement on a common set of data variables to be included in all systems |
System knowledge | Inadequate understanding of database and its structure Researchers may not understand limitations of database | Transparency Develop and maintain data standards and operations manuals Report strengths, limitations, and nuances of databases in primary manuscripts Informatics training for investigators |
EHR electronic health record, SAE serious adverse event
Data quality and validation are key factors in determining whether EHRs might be suitable data sources in clinical trials. Concerns about coding inaccuracies or bias introduced by selection of codes driven by billing incentives rather than clinical care may be diminished when healthcare providers enter data directly into the EHRs or when EHRs are used throughout all areas of the health-system, but such systems have not yet been widely implemented [ 42 ]. Excessive or busy workloads may also contribute to errors in clinician data entry [ 43 ]. Indeed, errors in EHRs have been reported [ 43 – 45 ].
Complete data capture is also a critical aspect of using EHRs for clinical research, particularly if EHRs are used for endpoint ascertainment or SAE collection. Complete data capture can be a major barrier in regions, where patients receive care from different providers or hospitals operating in different EHR systems that are not linked.
Consistent, validated methods for assessing data quality and completeness have not yet been adopted [ 46 ], but validation is a critical factor for the regulatory acceptance of EHR data. Proposed validation approaches include using both an eCRF and EHRs in a study in parallel and comparing results using the two data collection methods. This approach will require collaborative efforts to embed EHR substudies in large cardiovascular studies conducted by several sponsors. Assessing selected outcomes of interest from several EHR-based trials to compare different methodologies with an agreed statistical framework will be required to gauge precision of data collection via EHRs. A hybrid approach has also been proposed, where the EHR is used to identify study endpoints (e.g., death, hospitalization, myocardial infarction, and cancer), followed by adjudication and validation of EHR findings using clinical data (e.g., electrocardiogram and laboratory data).
Validity should be defined a priori and should be specific to the endpoints of interest as well as relevant to the country or healthcare system. Validation studies should aim to assess both the consistency between EHR data and standard data collection methods, and also how identified differences influence a study’s results. Proposed uses of EHRs for registration trials and methods for their validation will likely be considered by regulatory agencies on a case-by-case basis, because of the limited experience with EHRs for this purpose at the current time. Collaboration among industry sponsors to share cumulative experiences with EHR validation studies might lead to faster acceptance by regulatory authorities.
The ESC-CRT recommends that initial efforts to integrate EHRs in clinical trials focus on a few efficacy endpoints of interest, preferably objective endpoints (e.g., all-cause or cause-specific mortality) that are less susceptible to bias or subjective interpretation. As noted above, mortality may be incompletely captured in EHRs, particularly if patients die outside of the hospital, or at another institution using a non-integrated EHR. Thus, methods to supplement endpoint ascertainment in the EHR may be necessary if data completeness is uncertain. Standardized endpoint definitions based on the EHR should be included in the study protocol and analysis plan. A narrow set of data elements for auditing should be prospectively defined to ensure the required variables which are contained in the EHR.
Early interaction between sponsors, clinical investigators, and regulators is recommended to enable robust designs for clinical trials aiming to use EHRs for endpoint ascertainment. Plans to translate Good Clinical Practice into an EHR facilitated research environment should be described. Gaps in personnel training and education should be identified and specific actions to address training deficiencies should be communicated to regulators and in place prior to the start of the trial.
The potential for delays in data access is an important consideration when EHRs are used in clinical trials. EHRs may contain data originally collected as free text that was later coded for the EHR. Thus, coded information may not be available for patient identification/recruitment during the admission. Similarly, coding may occur weeks or months after discharge. In nationally integrated systems, data availability may also be delayed. These delays may be critical depending on the purpose of data extracted from the EHR (e.g., SAE reporting, source data, or endpoints in a time-sensitive study).
Patients may be treated by multiple healthcare providers who operate independently of one another. Such patients may have more than one EHR, and these EHRs may not be linked. This heterogeneity adds to the complexity of using EHRs for clinical trials, since data coordinating centres have to develop processes for interacting or extracting data from any number of different systems. Differences in quality [ 47 ], non-standardized terminology, incomplete data capture, issues related to data sharing and data privacy, lack of common data fields, and the inability of systems to be configured to communicate with each other may also be problematic. Achieving agreement on a minimum set of common data fields to enable cross communication between systems would be a major step forward towards enabling EHRs to be used in clinical trials across centers and regions [ 48 , 49 ].
Privacy issues and information governance are among the most complex aspects of implementing EHRs for clinical research, in part because attitudes and regulations related to data privacy vary markedly around the world. Data security and appropriate use are high priorities, but access should not be restricted to the extent that the data are of limited usefulness. Access to EHR data by regulatory agencies will be necessary for auditing purposes in registration trials. Distributed analyses have the advantage of allowing data to remain with the individual site and under its control [ 39 , 41 ].
Pre-trial planning is critical to anticipate data security issues and to develop optimal standards and infrastructure. For pivotal registration trials, patients should be informed during the consent process about how their EHRs will be used and by whom. Modified approaches to obtaining informed consent for comparative effectiveness research studies of commonly used clinical practices or interventions may be possible [ 50 ]. A general upfront consent stating that EHR data may be used for research is a proactive step that may minimize later barriers to data access, although revision of existing legislation or ethics board rules may be needed to allow this approach. Patients and the public should be recognized as important stakeholders, and they can be advocates for clinical research using EHRs and improve the quality of EHR-based research if they are educated and engaged in the process and the purpose and procedures for EHR use are transparent. Developing optimal procedures for ensuring patients that are informed and protected, balanced with minimizing barriers to research is a major consideration as EHR-based research advances.
EHRs for use in clinical research need a flexible architecture to accommodate studies of different interventions or disease states. EHR systems may be capable of matching eligibility criteria to relevant data fields and flagging potential trial subjects to investigators. Patient questionnaires and surveys can be linked to EHRs to provide additional context to clinical data. Pre-population of eCRFs has been proposed as a potential role for EHRs, but the proportion of fields in an EHR that can be mapped to an eCRF varies substantially across systems.
EHRs may be more suitable for pragmatic trials where data collection mirrors those variables collected in routine clinical care. Whether regulators would require collection of additional elements to support a new drug or new indication depends on the drug, intended indication, patient population, and potential safety concerns.
The sustainability of EHRs in clinical research will largely depend on the materialization of their promised efficiencies. Programs like the NIH Collaboratory [ 41 ] and PCORnet [ 39 , 41 ], and randomized registry trials [ 51 , 52 ] are demonstrating the feasibility of these more efficient approaches to clinical research. The sustainability of using EHRs for pivotal registration clinical trials will depend on regulatory acceptance of the approach and whether the efficiencies support a business case for their use.
To make the vision of EHRs in clinical trials a reality, stakeholders should collaborate and contribute to the advancement of EHRs for research. Professional bodies, such as the ESC, can play a major role in the training and education of researchers and the public about the potential value of EHR. Clinical trialists and industry must be committed to advancing validation methodology [ 53 ]. Investigators should develop, conduct, and promote institutional EHR trials that change clinical practice; such experience may encourage EHR trial adoption by industry and the agencies. Development of core or minimal data sets could streamline the process, reduce redundancy and heterogeneity, and decrease start-up time for future EHR-based clinical trials. These and other stakeholder contributions are outlined in Table 3 .
Role and influence of stakeholders in advancing the use of electronic health records in clinical research
Stakeholder | Contribution |
---|---|
Professional societies | Training and education Global platform for education at annual meetings or congresses Leverage industry support Public education to foster public support Transform EORP into a prospective trial instrument; generate support from industry who may use this resource for future trials Develop data standards (CARDS-revisited) Organize working groups charged with generating common EHR templates or data sets, or achieving agreement on minimum standards Lobby regulatory agencies and industry sponsors |
Clinical trialists and industry | Engage other collaborators (e.g., ethicists, CROs, academic CROs, information governance, registries, IT providers, EHR companies, patient advocacy groups, data protection/security experts, legislators/agencies, public funders, legal experts, treating physicians, hospital administrators) Pilot the evaluation of EHR versus conventional non-EHR trials Pilot trials to compare event collection using EHRs versus usual eCRF Conduct actual EHR trials, initially in smaller countries, adapting the approach based on lessons learned, then applying to larger settings Adopt EHRs on an experimental basis for feasibility assessments and patient recruitment Lobby other stakeholders to collaborate towards developing robust methodology to incorporate EHRs in clinical trials Educate professionals and the public about potential value of EHRs |
Regulatory | Work with industry to identify appropriate ways to incorporate EHR data prospectively into study designs |
EHR vendors | Invest in building research capabilities on EHR platforms |
CARDS cardiology audit and registration data standards, CRO contract research organization, eCRF electronic case report form, IT information technology, EHR electronic health record, EORP European Observational Research Program
Electronic health records are a promising resource to improve the efficiency of clinical trials and to capitalize on novel research approaches. EHRs are useful data sources to support comparative effectiveness research and new trial designs that may answer relevant clinical questions as well as improve efficiency and reduce the cost of cardiovascular clinical research. Initial experience with EHRs has been encouraging, and accruing knowledge will continue to transform the application of EHRs for clinical research. The pace of technology has produced unprecedented analytic capabilities, but these must be pursued with appropriate measures in place to manage security, privacy, and ensure adequacy of informed consent. Ongoing programs have implemented creative solutions for these issues using distributed analyses to allow organizations to retain data control and by engaging patient stakeholders. Whether EHRs can be successfully applied to the conventional drug development in pivotal, registration trials remains to be seen and will depend on demonstration of data quality and validity, as well as realization of expected efficiencies.
This paper was generated from discussions during a cardiovascular round table (CRT) Workshop organized on 23–24 April 2015 by the European Society of Cardiology (ESC). The CRT is a strategic forum for high-level dialogues between academia, regulators, industry, and ESC leadership to identify and discuss key strategic issues for the future of cardiovascular health in Europe and other parts of the world. We acknowledge Colin Freer for his participation in the meeting. This article reflects the views of the authors and should not be construed to represent FDA’s views or policies. The opinions expressed in this paper are those of the authors and cannot be interpreted as the opinion of any of the organizations that employ the authors. MRC’s salary is supported by the National Institute for Health Research (NIHR) Cardiovascular Biomedical Research Unit at the Royal Brompton Hospital, London, UK.
Martin R. Cowie: Research grants from ResMed, Boston Scientific, and Bayer; personal fees from ResMed, Boston Scientific, Bayer, Servier, Novartis, St. Jude Medical, and Pfizer. Juuso Blomster: Astra Zeneca employee. Lesley Curtis: Funding from FDA for work with the Mini-Sentinel program and from PCORI for work with the PCORnet program. Sylvie Duclaux: None. Ian Ford: None. Fleur Fritz: None. Samantha Goldman: None. Salim Janmohamed: GSK employee and shareholder. Jörg Kreuzer: Employee of Boehringer-Ingelheim. Mark Leenay: Employee of Optum. Alexander Michel: Bayer employee and shareholder. Seleen Ong: Employee of Pfizer. Jill Pell: None. Mary Ross Southworth: None. Wendy Gattis Stough: Consultant to European Society of Cardiology, Heart Failure Association of the European Society of Cardiology, European Drug Development Hub, Relypsa, CHU Nancy, Heart Failure Society of America, Overcome, Stealth BioTherapeutics, Covis Pharmaceuticals, University of Gottingen, and University of North Carolina. Martin Thoenes: Employee of Edwards Lifesciences. Faiez Zannad: Personal fees from Boston Scientific, Servier, Pfizer, Novartis, Takeda, Janssen, Resmed, Eli Lilly, CVRx, AstraZeneca, Merck, Stealth Peptides, Relypsa, ZS Pharma, Air Liquide, Quantum Genomics, Bayer for Steering Committee, Advisory Board, or DSMB member. Andrew Zalewski: Employee of GSK.
The daily journal of the united states government.
This site displays a prototype of a “Web 2.0” version of the daily Federal Register. It is not an official legal edition of the Federal Register, and does not replace the official print version or the official electronic version on GPO’s govinfo.gov.
The documents posted on this site are XML renditions of published Federal Register documents. Each document posted on the site includes a link to the corresponding official PDF file on govinfo.gov. This prototype edition of the daily Federal Register on FederalRegister.gov will remain an unofficial informational resource until the Administrative Committee of the Federal Register (ACFR) issues a regulation granting it official legal status. For complete information about, and access to, our official publications and services, go to About the Federal Register on NARA's archives.gov.
The OFR/GPO partnership is committed to presenting accurate and reliable regulatory information on FederalRegister.gov with the objective of establishing the XML-based Federal Register as an ACFR-sanctioned publication in the future. While every effort has been made to ensure that the material on FederalRegister.gov is accurately displayed, consistent with the official SGML-based PDF version on govinfo.gov, those relying on it for legal research should verify their results against an official edition of the Federal Register. Until the ACFR grants it official status, the XML rendition of the daily Federal Register on FederalRegister.gov does not provide legal notice to the public or judicial notice to the courts.
A Notice by the Food and Drug Administration on 07/25/2024
Information about this document as published in the Federal Register .
Enhanced content.
Relevant information about this document from Regulations.gov provides additional context. This information is not part of the official Federal Register document.
This document has been published in the Federal Register . Use the PDF linked in the document sidebar for the official electronic format.
This table of contents is a navigational tool, processed from the headings within the legal text of Federal Register documents. This repetition of headings to form internal navigation links has no substantive legal effect.
Written/paper submissions, for further information contact:, supplementary information:, i. background, ii. paperwork reduction act of 1995, iii. electronic access, enhanced content - submit public comment.
Enhanced content - sharing.
These tools are designed to help you understand the official document better and aid in comparing the online edition to the print edition.
These markup elements allow the user to see how the document follows the Document Drafting Handbook that agencies use to create their documents. These can be useful for better understanding how a document is structured but are not part of the published document itself.
This document is available in the following developer friendly formats:.
More information and documentation can be found in our developer tools pages .
This PDF is the current document as it appeared on Public Inspection on 07/24/2024 at 8:45 am. It was viewed 0 times while on Public Inspection.
If you are using public inspection listings for legal research, you should verify the contents of the documents against a final, official edition of the Federal Register. Only official editions of the Federal Register provide legal notice of publication to the public and judicial notice to the courts under 44 U.S.C. 1503 & 1507 . Learn more here .
Food and Drug Administration, HHS.
Notice of availability.
The Food and Drug Administration (FDA or Agency) is announcing the availability of a final guidance for industry entitled “Real-World Data: Assessing Electronic Health Records and Medical Claims Data To Support Regulatory Decision-Making for Drug and Biological Products.” FDA is issuing this guidance as part of its Real-World Evidence (RWE) program and to satisfy, in part, the mandate under the Federal Food, Drug, and Cosmetic Act (FD&C Act) to issue guidance about the use of RWE in regulatory decision making. This guidance is intended to provide sponsors and other interested parties with considerations when proposing to use electronic health records (EHRs) or medical claims data in clinical studies to support a regulatory decision for effectiveness or safety. This guidance finalizes the draft guidance of the same title issued on September 30, 2021.
The announcement of the guidance is published in the Federal Register on July 25, 2024.
You may submit either electronic or written comments on Agency guidances at any time as follows:
Submit electronic comments in the following way:
Submit written/paper submissions as follows:
Instructions: All submissions received must include the Docket No. FDA-2020-D-2307 for “Real-World Data: Start Printed Page 60431 Assessing Electronic Health Records and Medical Claims Data To Support Regulatory Decision-Making for Drug and Biological Products.” Received comments will be placed in the docket and, except for those submitted as “Confidential Submissions,” publicly viewable at https://www.regulations.gov or at the Dockets Management Staff between 9 a.m. and 4 p.m., Monday through Friday, 240-402-7500.
Docket: For access to the docket to read background documents or the electronic and written/paper comments received, go to https://www.regulations.gov and insert the docket number, found in brackets in the heading of this document, into the “Search” box and follow the prompts and/or go to the Dockets Management Staff, 5630 Fishers Lane, Rm. 1061, Rockville, MD 20852, 240-402-7500.
You may submit comments on any guidance at any time (see 21 CFR 10.115(g)(5) ).
Submit written requests for single copies of this guidance to the Division of Drug Information, Center for Drug Evaluation and Research, Food and Drug Administration, 10001 New Hampshire Ave., Hillandale Building, 4th Floor, Silver Spring, MD 20993-0002 or to the Office of Communication, Outreach and Development, Center for Biologics Evaluation and Research (CBER), Food and Drug Administration, 10903 New Hampshire Ave., Bldg. 71, Rm. 3128, Silver Spring, MD 20993-0002. Send one self-addressed adhesive label to assist that office in processing your requests. See the SUPPLEMENTARY INFORMATION section for electronic access to the guidance document.
Dianne Paraoan, Center for Drug Evaluation and Research, Food and Drug Administration, 10903 New Hampshire Ave., Bldg. 51, Rm. 3226, Silver Spring, MD 20993-0002, 301-796-3161; or James Myers, Center for Biologics Evaluation and Research, Food and Drug Administration, 10903 New Hampshire Ave., Bldg. 71, Rm. 7301, Silver Spring, MD 20993-0002, 240-402-7911.
FDA is announcing the availability of a guidance for industry entitled “Real-World Data: Assessing Electronic Health Records and Medical Claims Data To Support Regulatory Decision-Making for Drug and Biological Products.” This guidance discusses the following topics related to the potential use of EHRs and medical claims in clinical studies to support regulatory decisions: selection of data sources that appropriately address the study question and sufficiently capture study populations, exposure, outcomes of interest, and key covariates; development and validation of definitions for study design elements ( e.g., exposure, outcomes, covariates); and data traceability and quality during data accrual, data curation, and incorporation into the final study-specific dataset.
Section 3022 of the 21st Century Cures Act (Cures Act) of 2016 amended the FD&C Act to add section 505F, Utilizing Real World Evidence ( 21 U.S.C. 355g ), which requires FDA to issue guidance about the use of RWE in regulatory decision making. In addition, under the Prescription Drug User Fee Amendments of 2017 (PDUFA VI), FDA committed to publish draft guidance on how RWE can contribute to the assessment of safety and effectiveness in regulatory submissions. In 2018, FDA created an RWE Framework and RWE Program to evaluate the potential use of RWE to help support the approval of a new indication for a drug already approved under the FD&C Act or to help support or satisfy postapproval study requirements. In late 2021, FDA utilized the program to issue draft guidances outlining considerations for the use of real-world data and RWE in regulatory decision making to, among other things, help satisfy the Cures Act mandate and the PDUFA VI commitment.
This guidance finalizes the draft guidance of the same title issued on September 30, 2021 ( 86 FR 54219 ). FDA considered comments received on the draft guidance as the guidance was finalized. Changes from the draft to the final guidance include: (1) clarifying that the selection of study variables for validation and the extent of effort required for validation depends on the necessary level of certainty and the implication of potential misclassification on study inference; (2) noting that choice of a reference standard for validation may vary by the study design and question, variable of interest, and the necessary level of certainty; (3) recommending the use of quantitative approaches, such as quantitative bias analyses, either a priori for feasibility assessment, or to facilitate interpretation of study results, or for both purposes, to demonstrate whether and how misclassification, if present, might impact study findings; and (4) removing defined terms that are generally understood and transferring other relevant definitions from a glossary to the text. In addition, editorial changes were made to improve clarity.
This guidance is being issued consistent with FDA's good guidance practices regulation ( 21 CFR 10.115 ). The guidance represents the current thinking of FDA on “Real-World Data: Assessing Electronic Health Records and Medical Claims Data To Support Regulatory Decision-Making for Drug and Biological Products.” It does not establish any rights for any person and is not binding on FDA or the public. You can use an alternative approach if it satisfies the requirements of the applicable statutes and regulations.
While this guidance contains no collection of information, it does refer to previously approved FDA collections of information. The previously approved collections of information are subject to review by the Office of Management and Budget (OMB) under the Paperwork Reduction Act of 1995 (PRA) ( 44 U.S.C. 3501-3521 ). The collections of information in 21 CFR part 11 have been approved under OMB control number 0910-0303; the collections of information in 21 CFR part 312 have been approved under OMB control number 0910-0014; the collections of information in 21 CFR part 314 have been approved under OMB control number 0910-0001; and the collections Start Printed Page 60432 of information in 21 CFR part 601 have been approved under OMB control number 0910-0338.
Persons with access to the internet may obtain the guidance at https://www.fda.gov/drugs/guidance-compliance-regulatory-information/guidances-drugs , https://www.fda.gov/vaccines-blood-biologics/guidance-compliance-regulatory-information-biologics/biologics-guidances , https://www.fda.gov/regulatory-information/search-fda-guidance-documents , or https://www.regulations.gov .
Dated: July 22, 2024.
Lauren K. Roth,
Associate Commissioner for Policy.
[ FR Doc. 2024-16338 Filed 7-24-24; 8:45 am]
BILLING CODE 4164-01-P
Information.
Importance The use of large language models (LLMs) in medicine is increasing, with potential applications in electronic health records (EHR) to create patient cohorts or identify patients who meet clinical trial recruitment criteria. However, significant barriers remain, including the extensive computer resources required, lack of performance evaluation, and challenges in implementation.
Objective This study aims to propose and test a framework to detect disease diagnosis using a recent light LLM on French-language EHR documents. Specifically, it focuses on detecting gout (“goutte” in French), a ubiquitous French term that have multiple meanings beyond the disease. The study will compare the performance of the LLM-based framework with traditional natural language processing techniques and test its dependence on the parameter used.
Design The framework was developed using a training and testing set of 700 paragraphs assessing “gout”, issued from a random selection of retrospective EHR documents. All paragraphs were manually reviewed and classified by two health-care professionals (HCP) into disease (true gout) and non-disease (gold standard). The LLM’s accuracy was tested using few-shot and chain-of-thought prompting and compared to a regular expression (regex)-based method, focusing on the effects of model parameters and prompt structure. The framework was further validated on 600 paragraphs assessing “Calcium Pyrophosphate Deposition Disease (CPPD)”.
Setting The documents were sampled from the electronic health-records of a tertiary university hospital in Geneva, Switzerland.
Participants Adults over 18 years of age.
Exposure Meta’s Llama 3 8B LLM or traditional method, against a gold standard.
Main Outcomes and Measures Positive and negative predictive value, as well as accuracy of tested models.
Results The LLM-based algorithm outperformed the regex method, achieving a 92.7% [88.7-95.4%] positive predictive value, a 96.6% [94.6-97.8%] negative predictive value, and an accuracy of 95.4% [93.6-96.7%] for gout. In the validation set on CPPD, accuracy was 94.1% [90.2-97.6%]. The LLM framework performed well over a wide range of parameter values.
Conclusions and Relevance LLMs were able to accurately detect disease diagnoses from EHRs, even in non-English languages. They could facilitate creating large disease registries in any language, improving disease care assessment and patient recruitment for clinical trials.
Question How accurate and efficient are large language models (LLMs) in detecting diseases from unstructured electronic health records (EHR) text compared to traditional natural language processing techniques?
Findings This study proposes a framework based on Meta’s Llama 3 8B, a recent public LLM, outperforming traditional natural language processing techniques in detecting gout and calcium pyrophosphate deposition disease in unstructured text. It achieves high positive and negative predictive values and accuracy. Performance was robust over a wide range of parameters.
Meaning The proposed framework can ease the use of LLMs in effectively detecting disease in EHR data for various clinical applications.
The authors have declared no competing interest.
This project was funded by the Private Foundation of the Geneva University Hospitals, a not-for-profit foundation.
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This study involves human participants and the creation and use of the register for quality improvement programs has been approved by the Geneva ethics commission (CCER 2023-00129). The need for consent was waived by the Geneva Ethics Committee because this study qualifies as a quality improvement initiative.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
↵ * co-last authors
All prompts and code have been made available at the following gitlab repository: https://gitlab.unige.ch/goutte/llm_detection_of_diagnosis . Due to medical confidentiality, we are unable to share the sentences and document data. However, if authorization is obtained from the ethics committee, we may be able to provide access to the data
https://gitlab.unige.ch/goutte/llm_detection_of_diagnosis
View the discussion thread.
Supplementary Material
Thank you for your interest in spreading the word about medRxiv.
NOTE: Your email address is requested solely to identify you as the sender of this article.
BMC Medical Informatics and Decision Making volume 24 , Article number: 209 ( 2024 ) Cite this article
63 Accesses
1 Altmetric
Metrics details
The National Institute of Health and Social Care Research (NIHR) Health Informatics Collaborative (HIC) for Hearing Health has been established in the UK to curate routinely collected hearing health data to address research questions. This study defines priority research areas, outlines its aims, governance structure and demonstrates how hearing health data have been integrated into a common data model using pure tone audiometry (PTA) as a case study.
After identifying key research aims in hearing health, the governance structure for the NIHR HIC for Hearing Health is described. The Observational Medical Outcomes Partnership (OMOP) was chosen as our common data model to provide a case study example.
The NIHR HIC Hearing Health theme have developed a data architecture outlying the flow of data from all of the various siloed electronic patient record systems to allow the effective linkage of data from electronic patient record systems to research systems. Using PTAs as an example, OMOPification of hearing health data successfully collated a rich breadth of datapoints across multiple centres.
This study identified priority research areas where routinely collected hearing health data could be useful. It demonstrates integration and standardisation of such data into a common data model from multiple centres. By describing the process of data sharing across the HIC, we hope to invite more centres to contribute and utilise data to address research questions in hearing health. This national initiative has the power to transform UK hearing research and hearing care using routinely collected clinical data.
Peer Review reports
An estimated 12 million UK adults have hearing loss. This is set to rise to 14.2 million by 2035 [ 1 , 2 ]. The total cost in the UK of untreated, disabling hearing loss is estimated at £25.5 billion annually [ 3 ]. Hearing loss affects functioning, communication [ 4 ], social interactions [ 5 ] and employment opportunities [ 6 ].
Despite hearing loss ranking third for Years Lived with Disability [ 7 ] and being the commonest sensory disorder [ 8 ], it receives less than 1% of UK research funding [ 9 ]. Research funding spent for hearing loss per individual is only £1, compared to £11 for sight loss. Efforts are underway to raise awareness and increase the budget for hearing research [ 10 ].
This article highlights how the National Institute of Health and Social Care Research (NIHR) Health Informatics Collaborative (HIC) for Hearing Health [ 11 ] has been established. We first list priority research areas where data could be useful, then outline the NIHR HIC Hearing Health’s aims, governance structure and study adoption processes. We demonstrate how hearing health data have been integrated into a common data model, using pure tone audiometry (PTA) as a case study. Finally, we detail how health data from contributing centres are ingested and stored.
Most estimates on the prevalence of hearing loss are from the 1980s [ 12 ]. There is a need for up-to-date data on the burden of hearing loss, causes, risk factors and predictors of progression so that new and effective treatments can be developed.
Hearing loss is unequally distributed, with people from lower socioeconomic and ethnic minority backgrounds being at higher risk [ 13 , 14 ]. Since these groups are less likely to seek interventions and participate in research, true risk may be underestimated, calling for novel approaches to include data from these groups.
Hearing loss has recently been linked to other chronic conditions such as dementia [ 15 ], diabetes [ 16 ] and falls [ 17 ]. Data-driven approaches could disentangle these associations and provide possible models of causation.
Hearing devices are the most common treatment for adults with hearing loss. The NHS is the largest purchaser of hearing aids worldwide, procuring 1.2 million annually. However, most people with aidable hearing loss never receive a hearing aid [ 18 ] and those that do may not always use them. Identifying patient, disease and device characteristics that predict who is most unlikely to be offered or use hearing devices would help devise strategies to improve uptake and usage.
Recent insights in genetic and molecular mechanisms causing hearing loss [ 19 ] have allowed detection of therapeutic targets and development of therapies aimed at protecting or restoring hearing [ 20 ]. These highly targeted treatments call for large scale geno- and phenotyping efforts to improve patient selection for upcoming clinical trials.
National Institute of Health and Social Care Research Health Informatics Collaborative.
The UK offers a unique infrastructure for data-driven research because 80% of all healthcare is provided by the National Health Service (NHS). This creates an unparalleled flow of routine health data across diverse ethnic and socio-economic groups [ 21 ]. To standardise and combine data across NHS providers, NIHR established its HIC; a collaboration between NHS trusts and their partner universities, hosted by the Biomedical Research Centres (BRCs) [ 22 ]. The NIHR HIC brings together clinical, scientific, and informatics expertise to support the establishment and maintenance of catalogued, comparable, and comprehensive flows of patient data at each Trust, and to create a governance framework for data sharing and re-use across the trusts and partner organisations.
Recognising clinical need and opportunities for hearing health, the NIHR HIC Hearing Health theme was established in 2022. It aims to bring together the following routinely collected hearing health data and repurpose them for research. We interviewed NHS England, local commissioning groups, NHS procurement and NHS genomics to better understand hearing data collected through NHS organisations across England. Parameters included number of individuals undergoing hearing tests as part of the newborn hearing screen, volume of routine hearing consultations, number of audiometric assessments, number of hearing devices fitted and amount of requests for hearing panels. Based on responses received, the following was obtained:
This national programme uses automated oto-acoustic emissions (OAE) at birth and auditory brainstem response (ABR) for those who fail or have specific risk factors. Since 2017, the UK has had between 680,000 and 750,000 births annually and a coverage of the new-born hearing screen of > 95%. This equates to hearing data of over three million patients, of which 80,000 failed the initial OAE screen and had ABR, and an estimated 5,000 who had confirmed permanent hearing loss.
Across the UK, 355,000 new consultations by audiologists and ENT surgeons are undertaken for hearing loss annually. These provide data on demographics, hearing symptoms, risk factors and interventions. Data are documented in electronic hospital records.
Audiometric assessments inform the diagnosis of hearing loss and effects of interventions. Each of the 106 UK Clinical Commissioning Groups commissions 10,000–30,000 assessments annually, totalling 10 million hearing tests over five years, stored on NHS hospital or audiology clinic servers in codified format.
The NHS is the largest procurer of hearing aids in the world, fitting new hearing aids for the first time on 355,000 adults annually. Additionally, 1,000 NHS patients receive cochlear implants annually, with over 12,000 NHS patients with cochlear implants so far. Data on the provision, maintenance, and use of these devices, as well as user, environmental and impedance data are stored on NHS servers.
Since 2021, all children and adults in the UK with potential inherited causes of bilateral sensorineural hearing loss are eligible for genetic testing. There has been an increase in the number of hearing loss gene panels requested, with nearly 1000 requested in 2022.
Between 2nd December 2020 and 16th January 2021, a stakeholder consultation took place online using a newly developed questionnaire on the platform Select Surveys (Appendix 1). It was aimed at clinicians, academic scientists and industry partners. They were asked to list any questions they would like to see formally investigated relating to diagnosis/assessment, treatment/ intervention and follow-up support. A total of 74 stakeholders (34 clinicians, 33 academic scientists, 7 industry partners) responded. The responses were condensed into four key topics:
Exploration of effects of known and novel risk factors such as disease clustering for hearing loss.
Identification of genetic causes of hearing loss.
Definition of hearing loss sub-types.
Optimisation of benefit from individualised treatment strategies.
The NIHR HIC Hearing Health theme is co-led by the founding BRCs: University College London Hospital Trust (UCLH), Nottingham University Hospitals Trust (NUH) and Manchester Foundation Trust (MFT), and their academic partners. Patients are key to decision-making processes. Specifically, regulation around the national data opt-out was directly informed by patient representatives. Patient public involvement (PPI) support is offered to all researchers submitting study requests to ensure that their research question is in line with patient priorities, and that their research study includes proportionate and meaningful PPI.
All NHS hearing health providers are encouraged to join the NIHR HIC Hearing Health theme as contributing centres, and contribute their locally stored, de-identified datasets to the central data repository, which is stored in a secure server at University College London (UCL).
The NIHR HIC Hearing Health theme does not allow any data to be removed once it has arrived in the data repository. All researchers at contributing centres are welcome to submit a study request for approval by the steering committee. Once the request is approved, the study is added to the Hearing Health HIC’s portfolio, and researchers are allowed to undertake research within the secure server at UCL.
Further detail is available here, which describes the framework through which we created this resource ( https://github.com/uclh-criu/hic-hearing-health-docs ) and specific code that relates to a licensed version of their electronic patient record can be accessed upon request.
The protocol for the collection and management of data was approved by Central Bristol Research Ethics Committee (Reference Number: 21/SW/0139).
All NHS hearing health providers can contribute data to the NIHR HIC Hearing Health theme under the NIHR data sharing framework. This covers a range of data and research collaborations and must be signed by all contributing centres. The NIHR HIC data sharing framework addresses common requirements and considerations regarding data sharing between centres, contractual responsibilities, confidentiality, intellectual property and a publications policy. This general agreement will underpin individual agreements for research collaborations with third party academic, clinical and industry partners. Any collaboration with industry partners requires additional agreements, with additional governance checks by participating sites. Industry partners will only be allowed to participate in research by collaborating with a contributing centre.
The HIC has applications to other international healthcare systems where existing relationships between hospitals and academic institutes exist, however given that it focuses on the UK based on the NIHR data sharing agreement, this is a limitation of this paper that its application is not directly transferrable and inevitably obstacles may be encountered in other counties based on local frameworks.
The process that allows hearing health researchers from contributing centres to undertake research on the NIHR HIC Hearing Health theme’s dataset is summarised in Fig. 1 .
Process of requesting approval to use NIHR HIC Hearing Health data for research
Researchers from contributing centres can request to undertake an analysis on existing data, or can request new data fields to be extracted from contributing centres to be added to the central data repository for the purpose of their study.
The steering committee reviews study requests from researchers at contributing centres. At the quarterly meeting, each proposal is reviewed by the contributing centres and PPI group, and a decision is made, through consensus amongst the steering committee, as to whether the study should be adopted, rejected, or sent back for further refinement. Studies may be rejected if they do not align with the key topics described above or are unachievable.
For studies that are to be adopted by the NIHR HIC Hearing Health theme, each centre is given the opportunity to submit their de-identified hospital data for pooled analysis. This offers each centre continued autonomy over their local data, irrespective of where the data are housed. This policy has been introduced following patient and key stakeholder input into anxieties over loss of autonomy over local data.
Researchers from the approved study centre will be onboarded to the central data repository and given access to the environment to analyse an excerpt of the dataset that is relevant to their study questions. Whilst the results of their analyses can be extracted from the secure environment, no raw data will be sent out.
Hearing healthcare data are stored across multiple platforms on local servers. Since 2002, all providers of NHS hearing aids have moved to patient management systems as part of the Department of Health and Social Care Modernising Hearing Aid Services; however, these management systems preceded electronic patient records and are not integrated with Trust electronic care records or with each other. No healthcare recording system currently exists that collects and stores all NHS hearing healthcare information within a single database. Each hospital has its own data flow and software specific databases. This is because a multitude of proprietary audiology-specific hard- and software products are used.
The variety of electronic patient record systems means that assessment and management data are stored across multiple software databases:
Diagnostic hearing tests are undertaken using locally purchased proprietary hardware, each with its own software and data model (assessment).
Hearing aid fitting is undertaken through manufacturer specific hardware, each with its own software and data model (management).
Patient management system stores patient data.
Linkage software communicates between the above software systems that stores its own versions of datasets.
The integration of assessment and management data is undertaken by a third-party software called NOAH, developed by Hearing Instrument Manufacturers’ Software Association (HIMSA). NOAH’s primary function is to provide a unified system managing data collected during a diagnostic hearing test, which can then be used by manufacturer-specific hearing aid fitting systems. NOAH software is built into patient management software, which captures appointment level data and allows a single portal to link assessment and management.
Locally, databases exist for diagnostic hearing tests, hearing aid fitting software, NOAH and clinical management systems, each storing part of the dataset (see Fig. 2 ). In addition to hearing-related information, general operational data around patient referrals, waiting times and staffing capacity are stored on bespoke databases. Medical information, including surgical interventions is captured on general hospital medical records, whilst imaging data is stored on Picture Archiving and Communication Systems (PACS).
Local databases for auditory assessment (green) and hearing aid fitting (blue) and patient management (orange) all need to be combined in a meaningful way
Combining all these different types of data in a meaningful way is challenging. The NIHR HIC Hearing Health theme have developed a data architecture outlying the flow of data from all the various siloed electronic patient record systems to allow the effective linkage of data from electronic patient record to research systems.
Data extraction, linkage and standardisation, within contributing centres.
Whilst each software package allows patient by patient data review and occasionally extraction, very few allow wholesale extraction. Using Open Database Connectivity (ODBC), an application programming interface, we have been able to extract all diagnostic hearing tests and hearing device fitting data stored on audiology patient management software, which generally are all based on SQL databases.
The architecture of the database, as well as data structure within each database, varies between software packages. We pursued manufacturers to release internal database architecture and data keys. This information was used to identify and decode key data fields, such as patient identifiers and clinical fields such as hearing test results.
Using probabilistic linkage algorithms, based on national ID, hospital ID, name, date of birth, we linked hearing data to hospital medical records. We prioritised demographic data from patient medical records if they conflicted with patient management software. Patients who signed up for the national data opt-out were removed from the cohort.
The formats in which demographics, diagnoses, and treatments are stored within separate databases within and across hospitals do not always match. Therefore, we opted to convert all data into a model using common data terminologies. We chose the Observational Medical Outcomes Partnership (OMOP) as our common data model. This is an international data model that enables the capture of information (e.g., encounters, patients, providers, diagnoses, drugs, devices, measurements and procedures) in the same way across different institutions. Its usefulness has been demonstrated in multiple health themes [ 23 ]. This model is coding language agnostic and maps across multiple vocabularies. Additionally, OMOP does not require a specific technology. It can be realised in any relational database, such as Oracle, SQL Server etc. or as SAS analytical datasets.
The NIHR HIC Hearing Health OMOP data model outlines the structure of the dataset and the associations between the data fields. Local vocabularies are mapped onto standardised OMOP vocabularies and labelled with OMOP domains.
To provide details on how we standardised data with OMOP, we have used hearing test data as an example.
The challenge.
PTA is the standard test of hearing [ 24 ]. It measures the lowest level (in Decibels) that a pure tone can be reliably heard at multiple sound frequencies in each ear (Hertz). The sound can be presented to the ear canal (air conduction), or onto the bone behind the ear (bone conduction). The non-tested ear can be deliberately presented with noise (masking) to prevent it from hearing sound presented to the test ear.
There are already several PTA-related concepts imported from various clinical vocabularies into the OMOP framework (Table 1 ). However, there is no architecture that inter-relates these codes. As such, new classifications and an inter-relational architecture were required.
We mapped test outcomes from PTA to OMOP by having a [Procedure_Occurrence] record for each test performed. The most appropriate OMOP procedure concept (from the Systematized Nomenclature of Medicine - SNOMED - vocabulary) was used in each case (e.g. 4091134 = Pure tone audiometry, or 4091877 = Soundfield audiometry). Multiple records from the [Observation] table were then related to the [Procedure_Occurrence], to describe results for each frequency (and ear) tested. Concepts such as masking level, air conduction or bone conduction could then be related to the [Observation], whereas concepts such as headphone (worn over the ear) or inserts (worn in the ear) could be related to the [Procedure Occurrence]. Fact relationship tables were used to communicate these relations [ 25 ]. These tables allow relationships to be defined between concepts from the same table or different tables. Figure 3 shows the structure of how codes were linked using fact relationships, using a few common examples. Whilst codes exist for most variables, some local codes were developed when no concept was previously available, for example to assign noise status (see Fig. 3 ).
Examples of linking structure of codes using fact relationships
The process of data flow from contributing centres to the central data repository is summarised in Fig. 4 . The HIC Data Alchemist platform has been designed to import data from contributing centres within the NIHR HIC Hearing Health theme into a centralised OMOP database. This platform was established to manage data provided in its most raw form, as CSV files compliant with RFC 4180, placing as little burden as possible on individual hospitals. Data is provided in data bundles, each of a different data type, allowing the contributing centre to progressively build their data contribution. Each data bundle focuses on a different goal. This stepwise process facilitates incremental development and feedback. To progress to the next data bundle, the previous data bundle must be completed to a specific standard. Compliance will be measured strictly, as any issues at an early stage may create a risk for further data bundles.
The different stages of data standardisation and integration from different types of data across a variety of platforms and providers within the field of hearing health research. There are two main steps:1. Step 1: generate local view of the uploaded CSV files for the sites to inspect and ensures it has been possible to load data into the database2. Step 2: updates and merges site’s existing data with newly ingested data, ensuring there are valid foreign keys (a system of connecting different tables of data), duplicates have been removed and deletions and updates have been processed to obtain the latest local version of a site’s data
Each data bundle undergoes a data ingestion process that runs from handling raw CSV files to integration within the multisite database (see Fig. 4 ).
Further details about the HIC Data Alchemist can be found at https://uclh-criu.github.io/hic-hearing-health-docs/ .
Data is stored in the UCL Data Safe Haven (DSH). This is an ISO27001 certified and NHS Data Security Protection Toolkit compliant Trusted Research Environment (TRE). Processing, analysis and storage capabilities exceed what is available to hospital-based researchers and include a full High Performance Computing cluster, cutting-edge GPU computing nodes for AI research, end-user environments with the latest analytical software and the facility to host any database or application server on enterprise-grade virtual machine infrastructure.
The NIHR HIC Hearing Health theme Research Database has been developed and is in the process of OMOPification. Data ingestion is underway. The data collected fall into the following categories: (a) basic information (e.g., demographics, hospital visits, death, discharge and study sites), (b) laboratory data, (c) treatments, (d) diagnoses, (e) hearing test data, (f) device data and (g) other clinical information.
An NIHR HIC Hearing Health theme has been established, to bring together routinely collected hearing health data, to address urgent research questions in an efficient and cost-effective way. After identifying priority research areas where these data could be useful, the NIHR HIC Hearing Health’s aims, governance structure and study adoption processes were identified. Key hearing health data can be integrated into a common data model. Health data from contributing centres are ingested and stored on a research ready database. Data across a variety of platforms and providers within the field of hearing health research can be standardised and integrated. By detailing the process of sharing data and submitting research proposals we hope to inspire more hearing-health researchers and NHS trusts to contribute to the database and use the wealth of its data to address urgent questions in hearing-health research. This national initiative has the power to transform UK hearing research and hearing care using routinely collected clinical data.
All the raw data (including participants’ voice files and the texts of the interviews) will be confidential and will not be able to share publicly. However, the codes that emerged during the current study are available from the corresponding author upon reasonable request.
Royal National Institute for the Deaf. Facts and figures - RNID. https://rnid.org.uk/about-us/research-and-policy/facts-and-figures/ (accessed 12 Apr 2023).
Office of National Statistics. 2011 censuses UK comparability report. https://www.ons.gov.uk/census/planningforcensus2021/ukcensusdata (accessed 12 Apr 2023).
Shield B. Hearing Loss-Numbers and Costs EVALUATION OF THE SOCIAL AND ECONOMIC COSTS OF HEARING IMPAIRMENT A report for Hear-It AISBL. 2019.
Demorest ME, Erdman SA. Relationships among behavioral, environmental, and affective communication variables: a canonical analysis of the CPHI. J Speech Hear Disorders. 1989;54:180–8. https://doi.org/10.1044/jshd.5402.180 .
Article CAS Google Scholar
Meadow-Orlans KP. Social and psychological effects of hearing loss in adulthood: a literature review. Adjustment Adult Hear loss 1985;:35–57.
Shan A, Ting JS, Price C, et al. Hearing loss and employment: a systematic review of the association between hearing loss and employment among adults. J Laryngol Otol. 2020;134:387–97. https://doi.org/10.1017/S0022215120001012 .
Article CAS PubMed Google Scholar
Haile L, Orji A, Briant P, et al. Updates on hearing from the global burden of Disease Study. Innov Aging. 2020;4:808–808. https://doi.org/10.1093/geroni/igaa057.2934 .
Article PubMed Central Google Scholar
Raviv D, Dror AA, Avraham KB. Hearing loss: a common disorder caused by many rare alleles. Ann N Y Acad Sci. 2010;1214:168–79. https://doi.org/10.1111/j.1749-6632.2010.05868.x .
Article CAS PubMed PubMed Central Google Scholar
The Lancet. Calling for research into deafness. Lancet. 2007;370:1738. https://doi.org/10.1016/S0140-6736(07)61729-2 .
Article Google Scholar
Wilson BS, Tucci DL, O’Donoghue GM, et al. A Lancet Commission to address the global burden of hearing loss. Lancet. 2019;393:2106–8. https://doi.org/10.1016/S0140-6736(19)30484-2 .
Article PubMed Google Scholar
Health Informatics Collaborative – National Institute for Health Research. https://hic.nihr.ac.uk/ (accessed 12 Apr 2023).
Davis AC. The prevalence of hearing impairment and reported hearing disability among adults in Great Britain. 1389. https://academic.oup.com/ije/article/18/4/911/683421 .
Tsimpida D, Kontopantelis E, Ashcroft DM, et al. Regional patterns and trends of hearing loss in England: evidence from the English longitudinal study of ageing (ELSA) and implications for health policy. BMC Geriatr. 2020;20:536. https://doi.org/10.1186/s12877-020-01945-6 .
Article PubMed PubMed Central Google Scholar
Sawyer CS, Armitage CJ, Munro KJ, et al. Correlates of hearing Aid Use in UK adults. Ear Hear. 2019;40:1061–8. https://doi.org/10.1097/AUD.0000000000000695 .
Lin FR, Albert M. Hearing loss and dementia – who is listening? Aging Ment Health. 2014;18:671–3. https://doi.org/10.1080/13607863.2014.915924 .
Samocha-Bonet D, Wu B, Ryugo DK. Diabetes mellitus and hearing loss: a review. Ageing Res Rev. 2021;71:101423. https://doi.org/10.1016/j.arr.2021.101423 .
Lin FR. Hearing loss and Falls among older adults in the United States. Arch Intern Med. 2012;172:369. https://doi.org/10.1001/archinternmed.2011.728 .
Chien W. Prevalence of hearing Aid Use among older adults in the United States. Arch Intern Med. 2012;172:292. https://doi.org/10.1001/archinternmed.2011.1408 .
Melgar-Rojas P, Alvarado JC, Fuentes-Santamaría V et al. Cellular mechanisms of Age-related hearing loss. 2015. 305–33. https://doi.org/10.1007/978-3-319-13473-4_15 .
Schilder AGM, Su MP, Mandavia R, et al. Early phase trials of novel hearing therapeutics: avenues and opportunities. Hear Res. 2019;380:175–86. https://doi.org/10.1016/j.heares.2019.07.003 .
Rai T, Dixon S, Ziebland S. Shifting research culture to address the mismatch between where trials recruit and where populations with the most disease live: a qualitative study. BMC Med Res Methodol. 2021;21:80. https://doi.org/10.1186/s12874-021-01268-z .
Experimental medicine | NIHR. https://www.nihr.ac.uk/explore-nihr/support/experimental-medicine.htm (accessed 12 Apr 2023).
Reinecke I, Zoch M, Reich C et al. The Usage of OHDSI OMOP – A Scoping Review. 2021. https://doi.org/10.3233/SHTI210546 .
ISO 8253-1. 2010(en), Acoustics — Audiometric test methods — Part 1: Pure-tone air and bone conduction audiometry. https://www.iso.org/obp/ui/#iso:std:iso:8253:-1:ed-2:v1:en (accessed 12 Apr 2023).
documentation:cdm:single. -page [Observational Health Data Sciences and Informatics]. https://www.ohdsi.org/web/wiki/doku.php?id=documentation:cdm:single-page (accessed 12 Apr 2023).
Download references
NIHR BRC colleagues at University of Manchester, University College London and University of Nottingham.
Not applicable.
Authors and affiliations.
NIHR University College London Hospitals Biomedical Research Centre, London, UK
Nishchay Mehta, Baptiste Briot Ribeyre, Lilia Dimitrov, Louise J. English, Nikhil Joshi, Luis Romao, Anne Gm Schilder & Talisa Ross
Royal National ENT Hospital, UCLH Trust, London, UK
Nishchay Mehta, Lilia Dimitrov, Nikhil Joshi, Anne Gm Schilder & Talisa Ross
Institute of Health Informatics, Faculty of Population Health Sciences, University College London, London, UK
Baptiste Briot Ribeyre, Louise J. English & Luis Romao
NIHR Health Informatics Collaborative Hearing Health, Patient and Public Engagement Group, London, UK
Colleen Ewart
NIHR Manchester Biomedical Research Centre, Manchester, UK
Antje Heinrich, Kevin J. Munro, Ruth Norris & George Tilston
Manchester Centre for Audiology and Deafness (ManCAD), School of Health Sciences, The University of Manchester, Manchester, UK
Antje Heinrich & Kevin J. Munro
Nottingham Audiology Services, Nottingham University Hospitals, Nottingham, UK
Talisa Ross
Oxford University Hospitals NHS Foundation Trust: Oxford, Oxfordshire, UK
Gail Roadknight
NIHR Biomedical Research Centre, School of Medicine, University of Nottingham, Nottingham, UK
Ruth V. Spriggs
Mental Health and Clinical Neurosciences, School of Medicine, University of Nottingham, Nottingham, UK
Centre for Health Informatics, School of Health Sciences, The University of Manchester, Manchester, UK
Ruth Norris & George Tilston
You can also search for this author in PubMed Google Scholar
Planning – NM, BBR, LD, LJE, CE, AH, NJ, KM, GR, LR, AS, RVS, RN, TR, GTConception and design– NM, BBR, LD, LJE, CE, AH, NJ, KM, GR, LR, AS, RVS, RN, TR, GTAcquisition of data– NM, BBR, LD, LJE, CE, AH, NJ, KM, GR, LR, AS, RVS, RN, TR, GTAnalysis– NM, BBR, LD, LJE, CE, AH, NJ, KM, GR, LR, AS, RVS, RN, TR, GTWrite up– NM, BBR, LD, LJE, CE, AH, NJ, KM, GR, LR, AS, RVS, RN, TR, GT.
Correspondence to Talisa Ross .
Ethics approval and consent to participate.
The protocol for the collection and management of data was approved by Central Bristol Research Ethics Committee (Reference Number: 21/SW/0139) and carried out according to the Declaration of Helsinki. The lack of need for informed consent was confirmed by the ethics committee due to the lack of identifiable patient data being included.
Competing interests.
The authors declare no competing interests.
Publisher’s note.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Below is the link to the electronic supplementary material.
Rights and permissions.
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it.The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/ .
Reprints and permissions
Cite this article.
Mehta, N., Ribeyre, B.B., Dimitrov, L. et al. Creating a health informatics data resource for hearing health research. BMC Med Inform Decis Mak 24 , 209 (2024). https://doi.org/10.1186/s12911-024-02589-x
Download citation
Received : 19 November 2023
Accepted : 25 June 2024
Published : 29 July 2024
DOI : https://doi.org/10.1186/s12911-024-02589-x
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
ISSN: 1472-6947
Event sponsored by:
IMAGES
VIDEO
COMMENTS
Introduction. Electronic health record systems (EHRs) have become an integral part of healthcare within the past decade. As of 2015, 87% of all office-based providers in the United States had an EHR. 1 Adoption within Non-Federal Acute Care Hospitals is almost universal, with over 96% possessing a certified EHR. 2 While clinicians recognize the benefits of EHR usage 3-5, they have concerns ...
Background. Clinical documentation is the process of generating a structured record of a patient's medical care. This record of medical information assists in optimising patient care, enables communication between health professionals involved in the patient's care, supplies data for audits and research as well as serves as potential evidence for medico-legal cases.
Electronic Health Records (EHRs) are systemized collections of patient health information and documentation, collected in real-time, and stored in a digital format [ 1 ]. EHRs were originally designed to facilitate clinical decision-making regarding health care delivery for individual patients, and to improve the quality of care. EHRs have seen ...
Although regulations and guidelines define both source documentation and the medical record, integration of research documentation in the electronic health record is not clearly defined. At minimum, the signed informed consent (s), investigational drug or device usage, and research team contact information should be documented within the ...
The electronic medical record (EMR) and electronic health record (EHR) are platforms utilized within both clinical care and clinical research settings to collect data ultimately used by a clinical ...
Electronic health record systems have become common in the workplace. Research has shown that the EHR can increase efficiency, improve patient safety, and facilitate improved access to patient records [4, 5].Spending time up front to customize your EHR platform can streamline your workflow and allow you to more efficiently document your clinical care [].
Electronic patient records (EPR) have been gradually introduced to replace patient's paper notes with the aim of providing a more reliable record-keeping system. It is perceived that EPR improve the quality and efficiency of patient care. The paediatric department at Queen's Hospital Burton uses a mix of paper notes and computerised medical ...
INTRODUCTION. Rapid adoption of electronic health records (EHRs) following the passage of the Health Information Technology for Economic and Clinical Health (HITECH) Act has led to advances in both individual- and population-level health. 1 HITECH has improved healthcare quality, patient safety, and diagnostic accuracy through enhanced data management and timely reuse; interoperable systems ...
Clinical documentation has been defined as "information that is recorded about a person's care. The primary purpose of clinical documentation is to facilitate, safe, high quality and continuous care. . .and is stored within a health record" (Australian Commission on Safety and Quality in Healthcare, 2023).It needs to accurately reflect clinical events and decision-making for purposes of ...
Clinical documentation has dramatically changed since the implementation and use of electronic health records and electronic provider documentation. The purpose of this report is to review these changes and promote the development of standards and best practices for electronic documentation for pediatric patients. In this report, we evaluate the unique aspects of clinical documentation for ...
The electronic medical record (EMR) and electronic health record (EHR) are platforms utilized within both clinical care and clinical research settings to collect data ultimately used by a clinical research information system (CRIS) to support clinical research. This chapter will review the concepts of the EMR, EHR, CRIS; the architecture of an ...
To date, symptom documentation has mostly relied on clinical notes in electronic health records or patient-reported outcomes using disease-specific symptom inventories.
Print. Docket Number: FDA-2016-D-1224. Issued by: Center for Drug Evaluation and Research. Center for Biologics Evaluation and Research. Center for Devices and Radiological Health. Procedural ...
SCOPE. The recommendations outlined in this guidance apply to the use of EHR data in: Prospective clinical investigations of human drugs and biological products, medical devices, and combination ...
Takeaways: If not used properly, the electronic health record (EHR) can create communication gaps. The nursing process can be applied to electronic documentation to avoid workarounds and close gaps in communication. Effective use the EHR can improve patient safety and care outcomes. Clinical documentation supports patient care, improves ...
1. Introduction. In the early 1990s, a trend in the shift from paper-based health records to electronic records started; this was in response to advances in technology as well as the advocacy of the Institute of Medicine in the United States [1,2].As a result of the inadequacies of paper-based health records gradually becoming evident to the healthcare industry [], electronic records have ...
Sample view of an electronic health record. An electronic health record (EHR) is the systematized collection of patient and population electronically stored health information in a digital format. These records can be shared across different health care settings. Records are shared through network-connected, enterprise-wide information systems or other information networks and exchanges.
Background The enactment of the Health Information Technology for Economic and Clinical Health Act and the wide adoption of electronic health record (EHR) systems have ushered in increasing documentation burden, frequently cited as a key factor affecting the work experience of healthcare professionals and a contributor to burnout. This systematic review aims to identify and characterize ...
Improving clinical documentation: introduction of electronic health records in paediatrics Justin Koh, Mansoor Ahmed To cite: Koh J, Ahmed M. Improving clinical documentation: introduction of electronic health records in paediatrics. BMJ Open Quality 2021;10:e000918. doi:10.1136/ bmjoq-2020-000918 Received 4 February 2020 Revised 25 January 2021
The National Electronic Health Records Survey (NEHRS) is an annual survey that measures the progress U.S. physicians and their offices have made in adopting electronic health records (EHRs). The survey also monitors health information exchange, including interoperability, and the burden associated with EHRs.
Earlier this year, the technology was integrated into the Epic electronic health record system. As per studies cited by TGH, DAX Copilot slashes documentation time by 50%.
1. Accurate and complete clinical documentation. Proper documentation is crucial for value-based care reimbursement. Providers need to ensure that all relevant patient information, including diagnoses, procedures, and outcomes, are accurately and completely documented within electronic health records to support appropriate medical coding and ...
An electronic medical record (EMR) provides access to longitudinal patient data and clinical information in a timely and convenient manner, [] while allowing clinicians to prescribe, review and administer medications on a single digital platform, often with the assistance of clinical decision support.Although the use of EMR systems results in fewer medication errors, [] they can also create ...
Objective: Electronic health records (EHRs) are used for both clinical practice and research. Because mental health service users' views are underrepresented in perspectives on EHR use, the authors examined service users' awareness, attitudes, and opinions about EHR data storage and sharing. Methods: A mixed-methods, cross-sectional design was used to examine attitudes of 253 Norwegian ...
Background: Clinical coding allows for structured and standardised recording of data in patients electronic healthcare records. How clinical and non-clinical staff in general practice approach clinical coding is poorly understood. Aim: To explore primary care staff's experiences and views on clinical coding. Design and setting: Qualitative, semi-structured interview study with primary care ...
Abstract. Electronic health records (EHRs) provide opportunities to enhance patient care, embed performance measures in clinical practice, and facilitate clinical research. Concerns have been raised about the increasing recruitment challenges in trials, burdensome and obtrusive data collection, and uncertain generalizability of the results.
This guidance is intended to provide sponsors and other interested parties with considerations when proposing to use electronic health records (EHRs) or medical claims data in clinical studies to support a regulatory decision for effectiveness or safety. This guidance finalizes the draft guidance of the same title issued on September 30, 2021 ...
Importance: The use of large language models (LLMs) in medicine is increasing, with potential applications in electronic health records (EHR) to create patient cohorts or identify patients who meet clinical trial recruitment criteria. However, significant barriers remain, including the extensive computer resources required, lack of performance evaluation, and challenges in implementation.
The National Institute of Health and Social Care Research (NIHR) Health Informatics Collaborative (HIC) for Hearing Health has been established in the UK to curate routinely collected hearing health data to address research questions. This study defines priority research areas, outlines its aims, governance structure and demonstrates how hearing health data have been integrated into a common ...
Machine learning is also a popular topic in clinical medicine to implement analysis on electronic health records and medical image data, which traditional statistics model is not adequate for. However, we realize that machine learning is not panacea and its defects such as loss of interpretability and excess selection may restrict its application.