Resources
Database Restricted Federated
OpenOximetry Repository
The OpenOximetry Repository is a structured database designed to store clinical and laboratory pulse oximetry data and allows for consolidation of data sets held by collaborating organizations. Matched or independent readings of oxygen saturations, …
Published: Feb. 27, 2024. Version: 1.0.0 | DOI: 10.13026/cc78-ad74
Database Credentialed Federated
EchoNotes Structured Database derived from MIMIC-III (ECHO-NOTE2NUM)
The EchoNotes Structured Database derived from MIMIC-III (ECHO-NOTE2NUM) is a structured echocardiogram database derived from 43,472 observational notes obtained during echocardiogram studies conducted in the intensive care unit at the Beth Israel D…
Published: Feb. 24, 2024. Version: 1.0.0 | DOI: 10.13026/xhrz-ht59
Database Credentialed Federated
MIMIC-IV on FHIR
Fast Healthcare Interoperability Resources (FHIR) has emerged as a robust standard for healthcare data exchange. To explore the use of FHIR for the process of data harmonization, we converted the Medical Information Mart for Intensive Care…
fhir electronic health record mimic-iv
Published: Feb. 20, 2024. Version: 1.0 | DOI: 10.13026/cqt2-0b27
Database Credentialed Federated
CHIFIR: Cytology and Histopathology Invasive Fungal Infection Reports
Surveillance of invasive fungal infection (IFI) in clinical settings is a laborious process requiring a detailed review of patient medical history. One of the key sources of clinical information is cytology and histopathology reports: pathologist-pr…
information extraction nlp clinical documentation invasive fungal infections
Published: Feb. 20, 2024. Version: 1.0.2 | DOI: 10.13026/m1rk-ns13
Software Open Federated
Software for computing Heart Rate Fragmentation
Heart rate fragmentation (HRF) is a new method for assessing neuroautonomic integrity based on the analysis of short-term (high-frequency [HF]) heart rate dynamics. The code (in AWK) provided here is for the computation of three different metrics, P…
cardiovascular disease vagal tone time series analysis prediction of atrial fibrillation cardiac autonomic function prediction of cardiovascular events prediction of cognitive decline heart rate variability heart rate fragmentation aging
Published: Feb. 14, 2024. Version: 1.0.0 | DOI: 10.13026/0mzj-gn98
Database Credentialed Federated
CORAL: expert-Curated medical Oncology Reports to Advance Language model inference
Both medical care and observational studies in oncology require a thorough understanding of a patient's disease progression and treatment history, often elaborately documented within clinical notes. As large language models (LLMs) are becoming m…
oncology natural language processing information extraction artificial intelligence large language models electronic health records
Published: Feb. 7, 2024. Version: 1.0 | DOI: 10.13026/v69y-xa45
Database Open Federated
A Multi-Modal Satellite Imagery Dataset for Public Health Analysis in Colombia
We introduce a cost-effective public health analysis solution for low- and middle-income countries—the Multi-Modal Satellite Imagery Dataset in Colombia. By leveraging high-quality, spatiotemporally aligned satellite images and corresponding m…
satellite imagery multimodality
Published: Jan. 30, 2024. Version: 1.0.0 | DOI: 10.13026/xr5s-xe24
Model Credentialed Federated
Asclepius-R : Clinical Large Language Model Built On MIMIC-III Discharge Summaries
The development of large language models tailored for handling patients’ clinical notes is often hindered by the limited accessibility and usability of these notes due to strict privacy regulations. To address these challenges, we first create…
synthetic notes large language model asclepius synthetic clinical notes llm open-source clinical notes clinical llm
Published: Jan. 30, 2024. Version: 1.0.1 | DOI: 10.13026/s5rz-1j65
Database Credentialed Federated
RadCoref: Fine-tuning coreference resolution for different styles of clinical narratives
RadCoref is a small subset of MIMIC-CXR with manually annotated coreference mentions and clusters. The dataset is annotated by a panel of three cross-disciplinary experts with experience in clinical data processing following the i2b2 annotation sche…
natural language processing radiology coreference resolution
Published: Jan. 30, 2024. Version: 1.0.0 | DOI: 10.13026/z67q-xy65
Database Credentialed Federated
Annotation dataset of social determinants of health from MIMIC-III Clinical Care Database
Social determinants of health (SDoH) have an important impact on patient outcomes but are incompletely collected from the electronic health records (EHR). This study researched the ability of large language models to extract SDoH from free text in E…
social determinants of health natural language processing
Published: Jan. 25, 2024. Version: 1.0.1 | DOI: 10.13026/zsgv-8w31