Resources


Database Restricted Federated

A database of hand kinematics, high-density sEMG of forearm and wrist for motion intent recognition

Source: Physionet

Surface electromyography (sEMG) signals reflect spinal motor neuron activities and can be used as intuitive inputs for human-machine interaction (HMI) via movement intent recognition. The motor neuron potentials of far-field (wrist) and near-field (…

Published: Jan. 17, 2025. Version: 1.0.0 | DOI: 10.13026/ch3e-c195


Database Credentialed Federated

MIMIC-IV-Ext-GPT-3_5-Generated-Discharge-Summaries-for-Low-Resource-Codes

Source: Physionet

This dataset comprises 9,606 Synthetic Discharge Summaries generated by GPT-3.5 based on combinations of ICD-10-code descriptions associated with real discharge summaries in MIMIC-IV. As part of the generation process, GPT-3.5 was also tasked to cod…

icd coding data augmentation large language model

Published: Dec. 16, 2024. Version: 1.0.0 | DOI: 10.13026/09ng-2614


Database Restricted Federated

Endoscapes2023, A Critical View of Safety and Surgical Scene Segmentation Dataset for Laparoscopic Cholecystectomy

Source: Physionet

Minimally invasive image-guided surgery heavily relies on vision. Deep learning models for surgical video analysis can support surgeons in visual tasks such as assessing the critical view of safety (CVS) in laparoscopic cholecystectomy, potentially …

surgical safety computer assisted interventions semantic segmentation surgical data science medical imaging analysis

Published: Dec. 11, 2024. Version: 1.0.0 | DOI: 10.13026/czwq-jh81


Database Credentialed Federated

CovIdentify Dataset

Source: Physionet

This dataset supports the study "A method for intelligent allocation of diagnostic testing by leveraging data from commercial wearable devices: a case study on COVID-19," which developed an Intelligent Testing Allocation (ITA) method. The …

Published: Nov. 25, 2024. Version: 1.0.0 | DOI: 10.13026/ncq1-vp79


Database Credentialed Federated

Northwestern ICU (NWICU) database

Source: Physionet

Retrospective medical data collection is essential for advancing patient care, offering insights and supporting the development of health technology. The Medical Information Mart for Intensive Care (MIMIC)-III database has been instrumental in provi…

Published: Nov. 19, 2024. Version: 0.1.0 | DOI: 10.13026/s84w-1829


Database Credentialed Federated

MS-CXR: Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing

Source: Physionet

We release a new dataset, MS-CXR, with locally-aligned phrase grounding annotations by board-certified radiologists to facilitate the study of complex semantic modelling in biomedical vision–language processing. The MS-CXR dataset provides 116…

localization phrase grounding vision-language processing chest x-ray

Published: Nov. 15, 2024. Version: 1.1.0 | DOI: 10.13026/9g2z-jg61


Database Contributor Review Federated

Chest Computed Tomography for patients with sepsis in the Emergency Department

Source: Physionet

Sepsis is a systematic inflammatory response syndrome that can impact all vital organs. Lung is the most commonly involved organ that sepsis can cause lung injury. Lung injury can have a variety clinical presentations and can be captured by chest im…

sepsis

Published: Oct. 28, 2024. Version: 1.0.0 | DOI: 10.13026/zne5-qh18


Database Contributor Review Federated

COVID Data for Shared Learning (CDSL): A comprehensive, multimodal COVID-19 dataset from HM Hospitales

Source: Physionet

COVID Data for Shared Learning (CDSL) is a multimodal database comprising de-identified medical data from 4,479 patients who were hospitalized with confirmed or suspected COVID-19 in the Spanish 'HM Hospitales' group from 2019-12-26 to 2021-…

multimodal database covid-19 radiological images open data healthcare data machine learning and ai

Published: Oct. 25, 2024. Version: 1.0.0 | DOI: 10.13026/1176-6c44


Model Credentialed Federated

Shareable Artificial Intelligence to Extract Cancer Outcomes from Electronic Health Records for Precision Oncology Research

Source: Physionet

Databases that link molecular data to clinical outcomes can inform precision cancer research into novel prognostic and predictive biomarkers. However, outside of clinical trials, cancer outcomes are typically recorded only in text form within electr…

Published: Oct. 24, 2024. Version: 1.0.0 | DOI: 10.13026/h2nj-p344


Database Credentialed Federated

C-REACT: Contextualized Race and Ethnicity Annotations for Clinical Text

Source: Physionet

The Contextualized Race and Ethnicity Annotations for Clinical Text (C-REACT) dataset is a large publicly available corpus of sentences from clinical notes manually annotated for information related to race and ethnicity (RE). The corpus presented h…

patient country information race and ethnicity patient language information clinical notes

Published: Oct. 21, 2024. Version: 1.0.0 | DOI: 10.13026/z8tq-v658