Health Data Nexus Index

Database Credentialed Federated

MIMIC-IV-Ext Clinical Decision Making: A MIMIC-IV Derived Dataset for Evaluation of Large Language Models on the Task of Clinical Decision Making for Abdominal Pathologies

Source: Physionet

Clinical decision making is one of the most impactful parts of a physician's responsibilities and stands to benefit greatly from AI solutions such as large language models (LLMs). However, while many datasets exist to test the performance of AI …

clinical decision making emergency room diagnosis abdominal pathologies large language models treatment plan

Published: May 17, 2024. Version: 1.0 | DOI: 10.13026/2pfq-5b68

Database Restricted Federated

DREAMT: Dataset for Real-time sleep stage EstimAtion using Multisensor wearable Technology

Source: Physionet

Sleep is an intrinsic part of human life, and recent advancements in wearable technology and machine learning have promised continuous and non-invasive methods of tracking sleep health and patterns, providing an important facet to a more holistic un…

wearable biomedical sleep disorders time series classification

Published: April 30, 2024. Version: 1.0.0 | DOI: 10.13026/62an-cb28

Database Credentialed Federated

Medical Expert Annotations of Unsupported Facts in Doctor-Written and LLM-Generated Patient Summaries

Source: Physionet

Large language models in healthcare can generate informative patient summaries while reducing the documentation workload of healthcare professionals. However, these models are prone to producing hallucinations, that is, generating unsupported inform…

Published: April 28, 2024. Version: 1.0.0 | DOI: 10.13026/a66y-aa53

Database Contributor Review Federated

CARMEN-I: A resource of anonymized electronic health records in Spanish and Catalan for training and testing NLP tools

Source: Physionet

The CARMEN-I corpus comprises 2,000 clinical records, encompassing discharge letters, referrals, and radiology reports from Hospital Clínic of Barcelona between March 2020 and March 2022. These reports, primarily in Spanish with some Catalan …

de-identification clinical ner anonymization

Published: April 20, 2024. Version: 1.0.1 | DOI: 10.13026/x7ed-9r91

Database Credentialed Federated

EHRNoteQA: A Patient-Specific Question Answering Benchmark for Evaluating Large Language Models in Clinical Settings

Source: Physionet

We introduce EHRNoteQA, a patient-specific question answering benchmark tailored for evaluating Large Language Models (LLMs) in clinical environments. Based on MIMIC-IV Electronic Health Record (EHR), a team of three medical professionals has curate…

Published: April 3, 2024. Version: 1.0.0 | DOI: 10.13026/kvca-f224

Database Open Federated

PADS - Parkinsons Disease Smartwatch dataset

Source: Physionet

Parkinson’s disease (PD) is the second-most common neurodegenerative disorder, while incidence and worldwide burden are further increasing. In the era of digital health transformation, smart devices and mobile sensors, including smartphones an…

movement disorders wearables parkinsons disease

Published: March 26, 2024. Version: 1.0.0 | DOI: 10.13026/m0w9-zx22

Database Open Federated

ScientISST MOVE: Annotated Wearable Multimodal Biosignals recorded during Everyday Life Activities in Naturalistic Environments

Source: Physionet

Existing datasets containing physiological data are mostly acquired at rest or in controlled scenarios. As a result, algorithms developed using such data may not perform as well as with biosignals acquired in dynamic and uncontrolled environments. S…

greet wearable multimodal lift uncontrolled environments run jump gesticulate walk

Published: March 26, 2024. Version: 1.0.1 | DOI: 10.13026/hyxq-r919

Database Credentialed Federated

RaDialog Instruct Dataset

Source: Physionet

Conversational AI tools that can generate and discuss clinically correct radiology reports for a given medical image have the potential to transform radiology. Such a human-in-the-loop radiology assistant could facilitate a collaborative diagnostic …

radiology report generation large vision-language models medical image understaning radiology assistant radiology chatbot

Published: March 26, 2024. Version: 1.0.0 | DOI: 10.13026/zecj-bh52

Database Open Federated

Respiratory and heart rate monitoring dataset from aeration study

Source: Physionet

A study was conducted to collect respiratory pressure and flow data for model-based assessment, alongside electrical impedance tomography (EIT) aeration, electrocardiogram (ECG), and heart-rate belt (HRB) data. A 20 subjects set was selected with no…

Published: March 20, 2024. Version: 1.0.0 | DOI: 10.13026/e4dt-f689

Database Restricted Federated

CheXchoNet: A Chest Radiograph Dataset with Gold Standard Echocardiography Labels

Source: Physionet

Existing chest radiograph datasets, such as CheXpert and ChestX-ray14, have driven the development of new machine learning approaches to achieve expert or near-expert level performance on a variety of tasks. The primary focus of models developed usi…

early detection heart failure cardiac structural abnormalties deep learning chest x-rays

Published: March 20, 2024. Version: 1.0.0 | DOI: 10.13026/kp08-ws25

Search

Resources

MIMIC-IV-Ext Clinical Decision Making: A MIMIC-IV Derived Dataset for Evaluation of Large Language Models on the Task of Clinical Decision Making for Abdominal Pathologies

DREAMT: Dataset for Real-time sleep stage EstimAtion using Multisensor wearable Technology

Medical Expert Annotations of Unsupported Facts in Doctor-Written and LLM-Generated Patient Summaries

CARMEN-I: A resource of anonymized electronic health records in Spanish and Catalan for training and testing NLP tools

EHRNoteQA: A Patient-Specific Question Answering Benchmark for Evaluating Large Language Models in Clinical Settings

PADS - Parkinsons Disease Smartwatch dataset

ScientISST MOVE: Annotated Wearable Multimodal Biosignals recorded during Everyday Life Activities in Naturalistic Environments

RaDialog Instruct Dataset

Respiratory and heart rate monitoring dataset from aeration study

CheXchoNet: A Chest Radiograph Dataset with Gold Standard Echocardiography Labels