Resources


Database Restricted Federated

OpenOximetry Repository

Source: Physionet

The OpenOximetry Repository is a structured database designed to store clinical and laboratory pulse oximetry data and allows for consolidation of data sets held by collaborating organizations. Matched or independent readings of oxygen saturations, …

Published: Feb. 27, 2024. Version: 1.0.0 | DOI: 10.13026/cc78-ad74


Database Credentialed Federated

EchoNotes Structured Database derived from MIMIC-III (ECHO-NOTE2NUM)

Source: Physionet

The EchoNotes Structured Database derived from MIMIC-III (ECHO-NOTE2NUM) is a structured echocardiogram database derived from 43,472 observational notes obtained during echocardiogram studies conducted in the intensive care unit at the Beth Israel D…

Published: Feb. 24, 2024. Version: 1.0.0 | DOI: 10.13026/xhrz-ht59


Database Credentialed Federated

MIMIC-IV on FHIR

Source: Physionet

Fast Healthcare Interoperability Resources (FHIR) has emerged as a robust standard for healthcare data exchange. To explore the use of FHIR for the process of data harmonization, we converted the Medical Information Mart for Intensive Care…

fhir electronic health record mimic-iv

Published: Feb. 20, 2024. Version: 1.0 | DOI: 10.13026/cqt2-0b27


Database Credentialed Federated

CHIFIR: Cytology and Histopathology Invasive Fungal Infection Reports

Source: Physionet

Surveillance of invasive fungal infection (IFI) in clinical settings is a laborious process requiring a detailed review of patient medical history. One of the key sources of clinical information is cytology and histopathology reports: pathologist-pr…

information extraction nlp clinical documentation invasive fungal infections

Published: Feb. 20, 2024. Version: 1.0.2 | DOI: 10.13026/m1rk-ns13


Software Open Federated

Software for computing Heart Rate Fragmentation

Source: Physionet

Heart rate fragmentation (HRF) is a new method for assessing neuroautonomic integrity based on the analysis of short-term (high-frequency [HF]) heart rate dynamics. The code (in AWK) provided here is for the computation of three different metrics, P…

cardiovascular disease vagal tone time series analysis prediction of atrial fibrillation cardiac autonomic function prediction of cardiovascular events prediction of cognitive decline heart rate variability heart rate fragmentation aging

Published: Feb. 14, 2024. Version: 1.0.0 | DOI: 10.13026/0mzj-gn98


Database Credentialed Federated

CORAL: expert-Curated medical Oncology Reports to Advance Language model inference

Source: Physionet

Both medical care and observational studies in oncology require a thorough understanding of a patient's disease progression and treatment history, often elaborately documented within clinical notes. As large language models (LLMs) are becoming m…

oncology natural language processing information extraction artificial intelligence large language models electronic health records

Published: Feb. 7, 2024. Version: 1.0 | DOI: 10.13026/v69y-xa45


Database Open Federated

A Multi-Modal Satellite Imagery Dataset for Public Health Analysis in Colombia

Source: Physionet

We introduce a cost-effective public health analysis solution for low- and middle-income countries—the Multi-Modal Satellite Imagery Dataset in Colombia. By leveraging high-quality, spatiotemporally aligned satellite images and corresponding m…

satellite imagery multimodality

Published: Jan. 30, 2024. Version: 1.0.0 | DOI: 10.13026/xr5s-xe24


Model Credentialed Federated

Asclepius-R : Clinical Large Language Model Built On MIMIC-III Discharge Summaries

Source: Physionet

The development of large language models tailored for handling patients’ clinical notes is often hindered by the limited accessibility and usability of these notes due to strict privacy regulations. To address these challenges, we first create…

synthetic notes large language model asclepius synthetic clinical notes llm open-source clinical notes clinical llm

Published: Jan. 30, 2024. Version: 1.0.1 | DOI: 10.13026/s5rz-1j65


Database Credentialed Federated

RadCoref: Fine-tuning coreference resolution for different styles of clinical narratives

Source: Physionet

RadCoref is a small subset of MIMIC-CXR with manually annotated coreference mentions and clusters. The dataset is annotated by a panel of three cross-disciplinary experts with experience in clinical data processing following the i2b2 annotation sche…

natural language processing radiology coreference resolution

Published: Jan. 30, 2024. Version: 1.0.0 | DOI: 10.13026/z67q-xy65


Database Credentialed Federated

Annotation dataset of social determinants of health from MIMIC-III Clinical Care Database

Source: Physionet

Social determinants of health (SDoH) have an important impact on patient outcomes but are incompletely collected from the electronic health records (EHR). This study researched the ability of large language models to extract SDoH from free text in E…

social determinants of health natural language processing

Published: Jan. 25, 2024. Version: 1.0.1 | DOI: 10.13026/zsgv-8w31