Resources


Database Credentialed Federated

Annotation dataset of social determinants of health from MIMIC-III Clinical Care Database

Source: Physionet

Social determinants of health (SDoH) have an important impact on patient outcomes but are incompletely collected from the electronic health records (EHR). This study researched the ability of large language models to extract SDoH from free text in E…

social determinants of health natural language processing

Published: Jan. 25, 2024. Version: 1.0.1 | DOI: 10.13026/zsgv-8w31


Database Open Federated

A Comprehensive Dataset of Pattern Electroretinograms for Ocular Electrophysiology Research: The PERG-IOBA Dataset

Source: Physionet

The pattern electroretinogram (PERG) is a valuable tool in ophthalmic electrophysiology, offering a non-invasive and objective method to evaluate central retinal function. By measuring electrical activity in the macula and retinal ganglion cells, PE…

Published: Jan. 19, 2024. Version: 1.0.0 | DOI: 10.13026/d24m-w054


Database Credentialed Federated

ODD: A Benchmark Dataset for the NLP-based Opioid Related Aberrant Behavior Detection

Source: Physionet

Opioid related aberrant behaviors (ORAB) present novel risk factors for opioid overdose. Previously, ORAB have been mainly assessed by survey results and by monitoring drug administrations. Such methods however, cannot scale up and do not cover the …

natural language processing substance use opioid related aberrant behavior

Published: Jan. 11, 2024. Version: 1.0.0 | DOI: 10.13026/qrje-z188


Database Credentialed Federated

EHR-DS-QA: A Synthetic QA Dataset Derived from Medical Discharge Summaries for Enhanced Medical Information Retrieval Systems

Source: Physionet

This dataset was designed and created to enable advancements in healthcare-focused large language models, particularly in the context of retrieval-augmented clinical question-answering capabilities. Developed using a self-constructed pipeline based …

mimic-iv large language models clinical question-answering medical discharge summaries

Published: Jan. 11, 2024. Version: 1.0.0 | DOI: 10.13026/25fx-f706


Database Open Federated

CheXmask Database: a large-scale dataset of anatomical segmentation masks for chest x-ray images

Source: Physionet

The CheXmask Database presents a comprehensive, uniformly annotated collection of chest radiographs, constructed from six public databases: CANDID-PTX, ChestX-ray8, Chexpert, MIMIC-CXR-JPG, Padchest and VinDr-CXR. The database aggregates 676,803 ana…

chest x-ray segmentation medical image segmentation automatic quality assesment

Published: Jan. 9, 2024. Version: 0.3 | DOI: 10.13026/nv4g-fr21


Database Open Federated

Surface electromyographic signals collected during long-lasting ground walking of young able-bodied subjects

Source: Physionet

The present dataset is composed of long-lasting (around 5 minutes) surface electromyographic (sEMG) signals recorded from 2011 and 2018 during ground walking of 31 young (20 years < age < 30 years) able-bodied subjects in the Movement Analysis…

biomedical signals muscle recruitment surface emg signal walking gait analysis

Published: Jan. 9, 2024. Version: 1.0.1 | DOI: 10.13026/grdj-qx06


Database Open Federated

Patient-level dataset to study the effect of COVID-19 in people with Multiple Sclerosis

Source: Physionet

Multiple Sclerosis (MS) is an inflammatory autoimmune disease of the central nervous system, causing increased vulnerability to infections and disability among young adults. Ever since the coronavirus disease 2019 (COVID-19) outbreak, caused by seve…

Published: Jan. 2, 2024. Version: 1.0.1 | DOI: 10.13026/77ta-1866


Database Credentialed Federated

INSPIRE, a publicly available research dataset for perioperative medicine

Source: Physionet

We present the INSPIRE dataset, a publicly available research dataset in perioperative medicine, which includes approximately 130,000 cases (50% of all surgical cases) who underwent anesthesia for surgery at an academic institution in South Korea be…

multi-center surgery perioperative medicine open dataset

Published: Dec. 28, 2023. Version: 1.2 | DOI: 10.13026/4evs-wq50


Database Open Federated

Open Access Dataset and Toolbox of High-Density Surface Electromyogram Recordings

Source: Physionet

We provide an open access dataset of High densitY Surface Electromyogram (HD-sEMG) Recordings (named "Hyser"). We acquired data from 20 subjects with each subject participating in our experiment twice on separate days following the same ex…

Published: Dec. 28, 2023. Version: 2.0.0 | DOI: 10.13026/hxan-pe94


Challenge Credentialed Federated

SNOMED CT Entity Linking Challenge

Source: Physionet

This challenge, sponsored by SNOMED International, seeks to advance the development of Entity Linking models that operate on unstructured clinical texts. Participants in the challenge will train entity linking models using a subset of MIMIC-IV-Note …

snomed entity linking clinical annotation

Published: Dec. 19, 2023. Version: 1.0.0 | DOI: 10.13026/s48e-sp45