Resources


Database Credentialed Federated

MIMIC-CXR-JPG - chest radiographs with structured labels

Source: Physionet

The MIMIC Chest X-ray JPG (MIMIC-CXR-JPG) Database v2.0.0 is a large publicly available dataset of chest radiographs in JPG format with structured labels derived from free-text radiology reports. The MIMIC-CXR-JPG dataset is wholly derived from MIMI…

computer vision chest x-ray mimic deep learning radiology

Published: March 12, 2024. Version: 2.1.0 | DOI: 10.13026/jsn5-t979


Database Open Federated

CheXmask Database: a large-scale dataset of anatomical segmentation masks for chest x-ray images

Source: Physionet

The CheXmask Database presents a comprehensive, uniformly annotated collection of chest radiographs, constructed from five public databases: ChestX-ray8, Chexpert, MIMIC-CXR-JPG, Padchest and VinDr-CXR. The database aggregates 657,566 anatomical seg…

chest x-ray segmentation medical image segmentation automatic quality assesment

Published: March 1, 2024. Version: 0.4 | DOI: 10.13026/pgag-by42


Database Restricted Federated

OpenOximetry Repository

Source: Physionet

The OpenOximetry Repository is a structured database designed to store clinical and laboratory pulse oximetry data and allows for consolidation of data sets held by collaborating organizations. Matched or independent readings of oxygen saturations, …

Published: Feb. 27, 2024. Version: 1.0.0 | DOI: 10.13026/cc78-ad74


Database Credentialed Federated

EchoNotes Structured Database derived from MIMIC-III (ECHO-NOTE2NUM)

Source: Physionet

The EchoNotes Structured Database derived from MIMIC-III (ECHO-NOTE2NUM) is a structured echocardiogram database derived from 43,472 observational notes obtained during echocardiogram studies conducted in the intensive care unit at the Beth Israel D…

Published: Feb. 24, 2024. Version: 1.0.0 | DOI: 10.13026/xhrz-ht59


Database Credentialed Federated

MIMIC-IV on FHIR

Source: Physionet

Fast Healthcare Interoperability Resources (FHIR) has emerged as a robust standard for healthcare data exchange. To explore the use of FHIR for the process of data harmonization, we converted the Medical Information Mart for Intensive Care…

fhir electronic health record mimic-iv

Published: Feb. 20, 2024. Version: 1.0 | DOI: 10.13026/cqt2-0b27


Database Credentialed Federated

CHIFIR: Cytology and Histopathology Invasive Fungal Infection Reports

Source: Physionet

Surveillance of invasive fungal infection (IFI) in clinical settings is a laborious process requiring a detailed review of patient medical history. One of the key sources of clinical information is cytology and histopathology reports: pathologist-pr…

information extraction nlp clinical documentation invasive fungal infections

Published: Feb. 20, 2024. Version: 1.0.2 | DOI: 10.13026/m1rk-ns13


Database Credentialed Federated

CORAL: expert-Curated medical Oncology Reports to Advance Language model inference

Source: Physionet

Both medical care and observational studies in oncology require a thorough understanding of a patient's disease progression and treatment history, often elaborately documented within clinical notes. As large language models (LLMs) are becoming m…

oncology natural language processing information extraction artificial intelligence large language models electronic health records

Published: Feb. 7, 2024. Version: 1.0 | DOI: 10.13026/v69y-xa45


Database Open Federated

A Multi-Modal Satellite Imagery Dataset for Public Health Analysis in Colombia

Source: Physionet

We introduce a cost-effective public health analysis solution for low- and middle-income countries—the Multi-Modal Satellite Imagery Dataset in Colombia. By leveraging high-quality, spatiotemporally aligned satellite images and corresponding m…

satellite imagery multimodality

Published: Jan. 30, 2024. Version: 1.0.0 | DOI: 10.13026/xr5s-xe24


Database Credentialed Federated

RadCoref: Fine-tuning coreference resolution for different styles of clinical narratives

Source: Physionet

RadCoref is a small subset of MIMIC-CXR with manually annotated coreference mentions and clusters. The dataset is annotated by a panel of three cross-disciplinary experts with experience in clinical data processing following the i2b2 annotation sche…

natural language processing radiology coreference resolution

Published: Jan. 30, 2024. Version: 1.0.0 | DOI: 10.13026/z67q-xy65


Database Credentialed Federated

Annotation dataset of social determinants of health from MIMIC-III Clinical Care Database

Source: Physionet

Social determinants of health (SDoH) have an important impact on patient outcomes but are incompletely collected from the electronic health records (EHR). This study researched the ability of large language models to extract SDoH from free text in E…

social determinants of health natural language processing

Published: Jan. 25, 2024. Version: 1.0.1 | DOI: 10.13026/zsgv-8w31