Resources


Database Restricted Federated

MIMIC-IV-Ext-Apixaban-Trial-Criteria-Questions

Source: Physionet

Large-language models (LLMs) show promise for extracting information from clinical notes. Deploying these models at scale can be challenging due to high computational costs, regulatory constraints, and privacy concerns. To address these challenges, …

clinical q and a evaluation set clinical trial eligibility

Published: April 30, 2025. Version: 1.0.0 | DOI: 10.13026/4p6q-vb04


Database Credentialed Federated

Medical Expert Annotations of Unsupported Facts in Doctor-Written and LLM-Generated Patient Summaries

Source: Physionet

Large language models in healthcare can generate informative patient summaries while reducing the documentation workload of healthcare professionals. However, these models are prone to producing hallucinations, that is, generating unsupported inform…

Published: April 30, 2025. Version: 1.0.1 | DOI: 10.13026/gedc-j464


Database Restricted Federated

Swiss-Mammo: A physician-written, synthetic dataset of German mammography reports

Source: Physionet

This dataset, Swiss-Mammo, contains 28 manually constructed German mammography reports, each paired with an English translation. The reports are stratified across BI-RADS categories 0 through 6, with three reports per category. All reports were manu…

mammography radiology structured reporting bi-rads

Published: April 30, 2025. Version: 1.0.0 | DOI: 10.13026/mrg5-ja22


Database Restricted Federated

DREAMT: Dataset for Real-time sleep stage EstimAtion using Multisensor wearable Technology

Source: Physionet

Sleep is an intrinsic part of human life, and recent advancements in wearable technology and machine learning have promised continuous and non-invasive methods of tracking sleep health and patterns, providing an important facet to a more holistic un…

sleep disorders wearable biomedical time series classification

Published: April 30, 2025. Version: 2.1.0 | DOI: 10.13026/7r9r-7r24


Database Credentialed Federated

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Source: Physionet

The human voice contains complex acoustic markers which have been linked to important health conditions including dementia, mood disorders, and cancer. When viewed as a biomarker, voice is a promising characteristic to measure as it is simple to col…

bridge2ai voice

Published: April 16, 2025. Version: 2.0.0 | DOI: 10.13026/3xt6-rf05


Database Open Federated

SHDB-AF: a Japanese Holter ECG database of atrial fibrillation

Source: Physionet

Saitama Heart Database Atrial Fibrillation (SHDB-AF) is a novel open-sourced Holter ECG database from Japan, containing data from 122 unique subjects with paroxysmal atrial fibrillation. Among the 128 recordings, 98 contain raw ECG data with rhythm …

atrial fibrillation ecg holters

Published: April 16, 2025. Version: 1.0.1 | DOI: 10.13026/n6yq-fq90


Database Credentialed Federated

A Temporal Dataset for Respiratory Support in Critically Ill Patients

Source: Physionet

We present a temporal benchmark dataset for clinical respiratory intervention tasks in intensive care unit (ICU) patients, derived from the MIMIC-IV v2.2 dataset. The data consists of 50,920 adult ICU patients and includes 90-day hourly ventilation …

oberservational data time-series

Published: April 15, 2025. Version: 1.1.0 | DOI: 10.13026/wewp-sj67


Database Credentialed Federated

AIPatient KG: MIMIC-III and CORAL Electronic Health Records based Patient Knowledge Graph

Source: Physionet

This study integrates the MIMIC-III and CORAL electronic health records into knowledge graphs to enhance their utility for advanced medical analysis and decision-making. MIMIC-III contains comprehensive data from over 40,000 patients, while CORAL fo…

Published: April 15, 2025. Version: 1.0.0 | DOI: 10.13026/vjrq-9328


Challenge Credentialed Federated

ArchEHR-QA: BioNLP at ACL 2025 Shared Task on Grounded Electronic Health Record Question Answering

Source: Physionet

Responding to patients’ medical inbox messages through patient portals is increasingly a contributor to clinician burden. To this end, automatically generating answers to questions from patients considering their medical records is important. …

electronic health record clinicians patient portals question answering

Published: April 11, 2025. Version: 1.2 | DOI: 10.13026/nrhg-f267


Database Restricted Federated

Electrocardiogram-Capable Smartwatches: Assessing Their Clinical Accuracy and Application

Source: Physionet

Wearable technology has progressed significantly so that today's smartwatches are able to offer advanced health monitoring functions such as electrocardiogram (ECG). This study contains data that can be used to clinically assess the accuracy of four…

ischemia fitbit sense ambulatory samsung galaxy watch st-segment apple watch withings scanwatch smartwatch

Published: April 9, 2025. Version: 1.0.0 | DOI: 10.13026/7018-y383