Resources
Database Credentialed Federated
EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records
Electronic Health Records (EHRs) are integral for storing comprehensive patient medical records, combining structured data (e.g., medications) with detailed clinical notes (e.g., physician notes). These elements are essential for straightforward dat…
Published: Aug. 19, 2024. Version: 1.0.0 | DOI: 10.13026/2nb5-nf74
Database Credentialed Federated
ReXPref-Prior: A MIMIC-CXR Preference Dataset for Reducing Hallucinated Prior Exams in Radiology Report Generation
Generative vision-language models have exciting potential implications for radiology report generation, but unfortunately such models are also known to produce hallucinations and other nonsensical statements. For example, radiology report generation…
hallucination reinforcement learning chest x-rays
Published: Aug. 14, 2024. Version: 1.0.0 | DOI: 10.13026/t13x-4r94
Database Credentialed Federated
A Brazilian Multilabel Ophthalmological Dataset (BRSET)
The Brazilian Multilabel Ophthalmological Dataset (BRSET) is a multi-labeled ophthalmological dataset designed to improve scientific community development and validate machine learning models. In ophthalmology, ancillary exams support medical decisi…
dataset retina ophthalmology
Published: Aug. 14, 2024. Version: 1.0.1 | DOI: 10.13026/1pht-2b69
Database Credentialed Federated
INSPIRE, a publicly available research dataset for perioperative medicine
We present the INSPIRE dataset, a publicly available research dataset in perioperative medicine, which includes approximately 130,000 cases (50% of all surgical cases) who underwent anesthesia for surgery at an academic institution in South Korea be…
surgery open dataset multi-center perioperative medicine
Published: Aug. 12, 2024. Version: 1.3 | DOI: 10.13026/46m4-f655
Database Open Federated
SHDB-AF: a Japanese Holter ECG database of atrial fibrillation
Saitama Heart Database Atrial Fibrillation (SHDB-AF) is a novel open-sourced Holter ECG database from Japan, containing data from 100 unique patients with paroxysmal atrial fibrillation. The dataset contains raw ECG recordings with manually annotate…
ecg atrial fibrillation holters
Published: Aug. 12, 2024. Version: 1.0.0 | DOI: 10.13026/10mk-y852
Database Credentialed Federated
RadGraph2: Tracking Findings Over Time in Radiology Reports
RadGraph2 is a dataset of 800 chest radiology reports annotated using a fine-grained entity-relationship schema, which is an expanded version of the previously introduced RadGraph dataset. In contrast with the previous approaches and the original Ra…
chest x-rays radiology reports disease progression relation extraction named entity recognition information extraction
Published: Aug. 8, 2024. Version: 1.0.0 | DOI: 10.13026/q65y-9688
Database Credentialed Federated
MIMIC-IV-ECG-Ext-ICD: Diagnostic labels for MIMIC-IV-ECG
The number of publicly available ECG datasets has increased tremendously in the past few years and several of these datasets have developed into widely used benchmarking datasets. However, most of them exhibit a common limitation, namely the relianc…
electrocardiography machine learning mimic
Published: July 30, 2024. Version: 1.0.0 | DOI: 10.13026/hdyc-1h77
Database Restricted Federated
OpenOximetry Repository
The OpenOximetry Repository is a structured database designed to store clinical and laboratory pulse oximetry data and allows for consolidation of data sets held by collaborating organizations. Matched or independent readings of oxygen saturations, …
Published: July 30, 2024. Version: 1.0.1 | DOI: 10.13026/2g7z-t345
Database Credentialed Federated
MIMIC-CXR Database
The MIMIC Chest X-ray (MIMIC-CXR) Database v2.0.0 is a large publicly available dataset of chest radiographs in DICOM format with free-text radiology reports. The dataset contains 377,110 images corresponding to 227,835 radiographic studies per…
machine learning mimic computer vision chest x-rays radiology natural language processing
Published: July 23, 2024. Version: 2.1.0 | DOI: 10.13026/4jqj-jw95
Database Credentialed Federated
MIMIC-IV
Retrospectively collected medical data has the opportunity to improve patient care through knowledge discovery and algorithm development. Broad reuse of medical data is desirable for the greatest public good, but data sharing must be done in a manne…
mimic critical care intensive care unit machine learning
Published: July 23, 2024. Version: 3.0 | DOI: 10.13026/hxp0-hg59