Resources
Database Credentialed Federated
MIMIC-IV-Ext Clinical Decision Making: A MIMIC-IV Derived Dataset for Evaluation of Large Language Models on the Task of Clinical Decision Making for Abdominal Pathologies
Clinical decision making is one of the most impactful parts of a physician's responsibilities and stands to benefit greatly from AI solutions such as large language models (LLMs). However, while many datasets exist to test the performance of AI …
clinical decision making emergency room diagnosis abdominal pathologies large language models treatment plan
Published: May 17, 2024. Version: 1.0 | DOI: 10.13026/2pfq-5b68
Database Restricted Federated
DREAMT: Dataset for Real-time sleep stage EstimAtion using Multisensor wearable Technology
Sleep is an intrinsic part of human life, and recent advancements in wearable technology and machine learning have promised continuous and non-invasive methods of tracking sleep health and patterns, providing an important facet to a more holistic un…
wearable biomedical sleep disorders time series classification
Published: April 30, 2024. Version: 1.0.0 | DOI: 10.13026/62an-cb28
Database Credentialed Federated
Medical Expert Annotations of Unsupported Facts in Doctor-Written and LLM-Generated Patient Summaries
Large language models in healthcare can generate informative patient summaries while reducing the documentation workload of healthcare professionals. However, these models are prone to producing hallucinations, that is, generating unsupported inform…
Published: April 28, 2024. Version: 1.0.0 | DOI: 10.13026/a66y-aa53
Database Contributor Review Federated
CARMEN-I: A resource of anonymized electronic health records in Spanish and Catalan for training and testing NLP tools
The CARMEN-I corpus comprises 2,000 clinical records, encompassing discharge letters, referrals, and radiology reports from Hospital Clínic of Barcelona between March 2020 and March 2022. These reports, primarily in Spanish with some Catalan …
de-identification clinical ner anonymization
Published: April 20, 2024. Version: 1.0.1 | DOI: 10.13026/x7ed-9r91
Database Credentialed Federated
EHRNoteQA: A Patient-Specific Question Answering Benchmark for Evaluating Large Language Models in Clinical Settings
We introduce EHRNoteQA, a patient-specific question answering benchmark tailored for evaluating Large Language Models (LLMs) in clinical environments. Based on MIMIC-IV Electronic Health Record (EHR), a team of three medical professionals has curate…
Published: April 3, 2024. Version: 1.0.0 | DOI: 10.13026/kvca-f224
Database Open Federated
PADS - Parkinsons Disease Smartwatch dataset
Parkinson’s disease (PD) is the second-most common neurodegenerative disorder, while incidence and worldwide burden are further increasing. In the era of digital health transformation, smart devices and mobile sensors, including smartphones an…
movement disorders wearables parkinsons disease
Published: March 26, 2024. Version: 1.0.0 | DOI: 10.13026/m0w9-zx22
Database Open Federated
ScientISST MOVE: Annotated Wearable Multimodal Biosignals recorded during Everyday Life Activities in Naturalistic Environments
Existing datasets containing physiological data are mostly acquired at rest or in controlled scenarios. As a result, algorithms developed using such data may not perform as well as with biosignals acquired in dynamic and uncontrolled environments. S…
greet wearable multimodal lift uncontrolled environments run jump gesticulate walk
Published: March 26, 2024. Version: 1.0.1 | DOI: 10.13026/hyxq-r919
Database Credentialed Federated
RaDialog Instruct Dataset
Conversational AI tools that can generate and discuss clinically correct radiology reports for a given medical image have the potential to transform radiology. Such a human-in-the-loop radiology assistant could facilitate a collaborative diagnostic …
radiology report generation large vision-language models medical image understaning radiology assistant radiology chatbot
Published: March 26, 2024. Version: 1.0.0 | DOI: 10.13026/zecj-bh52
Database Open Federated
Respiratory and heart rate monitoring dataset from aeration study
A study was conducted to collect respiratory pressure and flow data for model-based assessment, alongside electrical impedance tomography (EIT) aeration, electrocardiogram (ECG), and heart-rate belt (HRB) data. A 20 subjects set was selected with no…
Published: March 20, 2024. Version: 1.0.0 | DOI: 10.13026/e4dt-f689
Database Restricted Federated
CheXchoNet: A Chest Radiograph Dataset with Gold Standard Echocardiography Labels
Existing chest radiograph datasets, such as CheXpert and ChestX-ray14, have driven the development of new machine learning approaches to achieve expert or near-expert level performance on a variety of tasks. The primary focus of models developed usi…
early detection heart failure cardiac structural abnormalties deep learning chest x-rays
Published: March 20, 2024. Version: 1.0.0 | DOI: 10.13026/kp08-ws25