Resources


Database Credentialed Federated

CXR-PRO: MIMIC-CXR with Prior References Omitted

Source: Physionet

CXR-PRO is an adaptation of the MIMIC-CXR dataset that omits references to prior radiology reports. Consisting of 374,139 free-text radiology reports and associated chest radiographs, CXR-PRO addresses the issue of hallucinated references to priors …

generation large language models free-text radiology reports references to priors retrieval

Published: Nov. 23, 2022. Version: 1.0.0 | DOI: 10.13026/frag-yn96


Database Open Federated

PTB-XL, a large publicly available electrocardiography dataset

Source: Physionet

Electrocardiography (ECG) is a key diagnostic tool to assess the cardiac condition of a patient. Automatic ECG interpretation algorithms as diagnosis support systems promise large reliefs for the medical personnel - only on the basis of the number o…

ecg ptb-xl ptb electrocardiography

Published: Nov. 9, 2022. Version: 1.0.3 | DOI: 10.13026/kfzx-aw45


Database Credentialed Federated

GLOBEM Dataset: Multi-Year Datasets for Longitudinal Human Behavior Modeling Generalization

Source: Physionet

We present the first multi-year mobile sensing datasets. Our multi-year data collection studies span four years (10 weeks each year, from 2018 to 2021). The four datasets contain data collected from 705 person-years (497 unique participants) with di…

ubiquitous computing passive mobile sensing human behavior modeling well-being health

Published: Nov. 4, 2022. Version: 1.0 | DOI: 10.13026/jvtb-2d81


Database Open Federated

KINECAL

Source: Physionet

The field of human action recognition has made great strides in recent years, much helped by the availability of a wide variety of datasets that use Kinect to record human movement. Conversely, progress towards using Kinect in clinical practice has …

age-related changes falls-risk postural sway posturography balance clinical tests

Published: Oct. 31, 2022. Version: 1.0.1 | DOI: 10.13026/z037-6191


Database Credentialed Federated

Tasks 1 and 3 from Progress Note Understanding Suite of Tasks: SOAP Note Tagging and Problem List Summarization

Source: Physionet

Applying methods in natural language processing on electronic health records (EHR) data is a growing field. Existing corpus and annotation focus on modelling textual features and relation prediction [1] . However, there is a paucity of annotated cor…

Published: Sept. 30, 2022. Version: 1.0.0 | DOI: 10.13026/wks0-w041


Database Open Federated

VitalDB, a high-fidelity multi-parameter vital signs database in surgical patients

Source: Physionet

In modern anesthesia, multiple medical devices are used simultaneously to comprehensively monitor real-time vital signs to optimize patient care and improve surgical outcomes. However, interpreting the dynamic changes of time-series biosignals and t…

vitaldb waveform intraoperative ecg anesthesia biosignal

Published: Sept. 21, 2022. Version: 1.0.0 | DOI: 10.13026/czw8-9p62


Database Open Federated

Gesture Recognition and Biometrics ElectroMyogram (GRABMyo)

Source: Physionet

We present the Gesture Recognition and Biometrics ElectroMyogram (GRABMyo) dataset, an open-access dataset of electromyogram (EMG) recordings collected from the wrist and forearm muscles while performing hand gestures. Data were collected from 43 he…

Published: Sept. 21, 2022. Version: 1.0.1 | DOI: 10.13026/701k-gs64


Database Credentialed Federated

Nosocomial Risk Datasets from MIMIC-III

Source: Physionet

Reliable longitudinal risk prediction for hospitalized patients is needed to provide quality care. Our goal is to foster the development of generalizable models capable of leveraging clinical notes to predict healthcare-associated diseases 24–…

pressure injury natural language processing risk prediction acute kidney injury deep learning anemia forecasting

Published: Sept. 15, 2022. Version: 1.0 | DOI: 10.13026/pm60-0g49


Database Restricted Federated

Multimodal Physiological Monitoring During Virtual Reality Piloting Tasks

Source: Physionet

This dataset includes multimodal physiologic, flight performance, and user interaction data streams, collected as participants performed virtual flight tasks of varying difficulty. In virtual reality, individuals flew an "Instrument Landing Sys…

Published: Aug. 25, 2022. Version: 1.0.0 | DOI: 10.13026/azwa-ge48


Database Open Federated

A large scale 12-lead electrocardiogram database for arrhythmia study

Source: Physionet

This newly inaugurated research database for 12-lead electrocardiogram (ECG) signals was created under the auspices of Chapman University, Shaoxing People’s Hospital (Shaoxing Hospital Zhejiang University School of Medicine), and Ningbo First …

Published: Aug. 24, 2022. Version: 1.0.0 | DOI: 10.13026/wgex-er52