Resources


Database Credentialed Federated

EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records

Source: Physionet

Electronic Health Records (EHRs) are integral for storing comprehensive patient medical records, combining structured data (e.g., medications) with detailed clinical notes (e.g., physician notes). These elements are essential for straightforward dat…

Published: Aug. 19, 2024. Version: 1.0.0 | DOI: 10.13026/2nb5-nf74


Database Credentialed Federated

ReXPref-Prior: A MIMIC-CXR Preference Dataset for Reducing Hallucinated Prior Exams in Radiology Report Generation

Source: Physionet

Generative vision-language models have exciting potential implications for radiology report generation, but unfortunately such models are also known to produce hallucinations and other nonsensical statements. For example, radiology report generation…

hallucination reinforcement learning chest x-rays

Published: Aug. 14, 2024. Version: 1.0.0 | DOI: 10.13026/t13x-4r94


Database Credentialed Federated

A Brazilian Multilabel Ophthalmological Dataset (BRSET)

Source: Physionet

The Brazilian Multilabel Ophthalmological Dataset (BRSET) is a multi-labeled ophthalmological dataset designed to improve scientific community development and validate machine learning models. In ophthalmology, ancillary exams support medical decisi…

dataset retina ophthalmology

Published: Aug. 14, 2024. Version: 1.0.1 | DOI: 10.13026/1pht-2b69


Database Credentialed Federated

INSPIRE, a publicly available research dataset for perioperative medicine

Source: Physionet

We present the INSPIRE dataset, a publicly available research dataset in perioperative medicine, which includes approximately 130,000 cases (50% of all surgical cases) who underwent anesthesia for surgery at an academic institution in South Korea be…

surgery open dataset multi-center perioperative medicine

Published: Aug. 12, 2024. Version: 1.3 | DOI: 10.13026/46m4-f655


Database Open Federated

SHDB-AF: a Japanese Holter ECG database of atrial fibrillation

Source: Physionet

Saitama Heart Database Atrial Fibrillation (SHDB-AF) is a novel open-sourced Holter ECG database from Japan, containing data from 100 unique patients with paroxysmal atrial fibrillation. The dataset contains raw ECG recordings with manually annotate…

ecg atrial fibrillation holters

Published: Aug. 12, 2024. Version: 1.0.0 | DOI: 10.13026/10mk-y852


Database Credentialed Federated

RadGraph2: Tracking Findings Over Time in Radiology Reports

Source: Physionet

RadGraph2 is a dataset of 800 chest radiology reports annotated using a fine-grained entity-relationship schema, which is an expanded version of the previously introduced RadGraph dataset. In contrast with the previous approaches and the original Ra…

chest x-rays radiology reports disease progression relation extraction named entity recognition information extraction

Published: Aug. 8, 2024. Version: 1.0.0 | DOI: 10.13026/q65y-9688


Database Credentialed Federated

MIMIC-IV-ECG-Ext-ICD: Diagnostic labels for MIMIC-IV-ECG

Source: Physionet

The number of publicly available ECG datasets has increased tremendously in the past few years and several of these datasets have developed into widely used benchmarking datasets. However, most of them exhibit a common limitation, namely the relianc…

electrocardiography machine learning mimic

Published: July 30, 2024. Version: 1.0.0 | DOI: 10.13026/hdyc-1h77


Database Restricted Federated

OpenOximetry Repository

Source: Physionet

The OpenOximetry Repository is a structured database designed to store clinical and laboratory pulse oximetry data and allows for consolidation of data sets held by collaborating organizations. Matched or independent readings of oxygen saturations, …

Published: July 30, 2024. Version: 1.0.1 | DOI: 10.13026/2g7z-t345


Database Credentialed Federated

MIMIC-CXR Database

Source: Physionet

The MIMIC Chest X-ray (MIMIC-CXR) Database v2.0.0 is a large publicly available dataset of chest radiographs in DICOM format with free-text radiology reports. The dataset contains 377,110 images corresponding to 227,835 radiographic studies per…

machine learning mimic computer vision chest x-rays radiology natural language processing

Published: July 23, 2024. Version: 2.1.0 | DOI: 10.13026/4jqj-jw95


Database Credentialed Federated

MIMIC-IV

Source: Physionet

Retrospectively collected medical data has the opportunity to improve patient care through knowledge discovery and algorithm development. Broad reuse of medical data is desirable for the greatest public good, but data sharing must be done in a manne…

mimic critical care intensive care unit machine learning

Published: July 23, 2024. Version: 3.0 | DOI: 10.13026/hxp0-hg59