Resources


Database Credentialed Federated

BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language

Source: Physionet

Computational medicine research requires clinical data for training and testing purposes, so the development of datasets composed of real hospital data is of utmost importance in this field. Most such data collections are in the English language, we…

exams natural language processing tertiary care prescriptions clinical notes

Published: May 13, 2022. Version: 1.0 | DOI: 10.13026/cmab-j041


Database Open Federated

The CirCor DigiScope Phonocardiogram Dataset

Source: Physionet

A total number of 5272 heart sound recordings were collected from the main four auscultation locations of 1568 subjects, aged between 0 and 21 years (mean ± STD = 6.1 ± 4.3 years), with a duration between 4.8 to 80.4 seconds (mean &plu…

signal processing murmur pitch george b moody physionet challenge 2022 murmur grading murmur location murmur timing phonocardiogram pregnant murmur shape pediatric murmur detection murmur intensity murmur quality

Published: May 10, 2022. Version: 1.0.3 | DOI: 10.13026/tshs-mw03


Database Open Federated

The CirCor DigiScope Phonocardiogram Dataset

Source: Physionet

A total number of 5272 heart sound recordings were collected from the main four auscultation locations of 1568 subjects, aged between 0 and 21 years (mean ± STD = 6.1 ± 4.3 years), with a duration between 4.8 to 80.4 seconds (mean &plu…

signal processing murmur quality murmur location phonocardiogram murmur grading murmur intensity pediatric murmur shape george b moody physionet challenge 2022 murmur timing murmur pitch pregnant murmur detection

Published: April 30, 2022. Version: 1.0.2 | DOI: 10.13026/qcpr-xg17


Database Credentialed Federated

DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries

Source: Physionet

Electronic Health Records (EHR) contain patient records, stored in structured tables as well as unstructured clinical notes. The information in structured and unstructured EHR records is not strictly disjoint: information may be duplicated, contradi…

question-answer qa

Published: April 12, 2022. Version: 1.0.0 | DOI: 10.13026/a849-cd06


Database Open Federated

Icentia11k Single Lead Continuous Raw Electrocardiogram Dataset

Source: Physionet

This is a dataset of continuous raw electrocardiogram (ECG) signals containing 11 thousand patients and 2 billion labelled beats. The signals were recorded with a 16-bit resolution at 250Hz with a fixed chest mounted single lead probe for up to 2 we…

ecg representation learning

Published: April 12, 2022. Version: 1.0 | DOI: 10.13026/kk0v-r952


Database Credentialed Federated

RuMedNLI: A Russian Natural Language Inference Dataset For The Clinical Domain

Source: Physionet

There is a shortage of text medical resources for the Russian language. This is a substantial obstacle in state-of-the-art NLP deep learning models research and development. To mitigate this issue we translated the MedNLI data from English to R…

natural language inference recognizing textual entailment russian language

Published: April 1, 2022. Version: 1.0.0 | DOI: 10.13026/gxzd-cf80


Database Open Federated

Surface electromyographic signals collected during long-lasting ground walking of young able-bodied subjects

Source: Physionet

The present dataset is composed of long-lasting (around 5 minutes) surface electromyographic (sEMG) signals recorded from 2011 and 2018 during ground walking of 31 young (20 years < age < 30 years) able-bodied subjects in the Movement Analysis…

surface emg signal walking biomedical signals gait analysis muscle recruitment

Published: March 31, 2022. Version: 1.0.0 | DOI: 10.13026/bwvb-ht51


Database Open Federated

CPAP Pressure and Flow Data from a Local Trial of 30 Adults at the University of Canterbury

Source: Physionet

A pressure and flow dataset for CPAP (continuous positive airway pressure) breathing obtained from 30 subjects for model-based identification of patient-specific lung mechanics using a specially designed sensor system comprising an array of differen…

peep biomedical engineering cpap respiratory mechanics pulmonary mechanics respiratory modelling

Published: March 24, 2022. Version: 1.0.1 | DOI: 10.13026/xfae-vv63


Database Restricted Federated

VinDr-PCXR: An open, large-scale pediatric chest X-ray dataset for interpretation of common thoracic diseases

Source: Physionet

Computer-aided diagnosis systems in adult chest radiography (CXR) have recently achieved great success thanks to the availability of large-scale, annotated datasets and the advent of high-performance supervised learning algorithms. However, the deve…

Published: March 21, 2022. Version: 1.0.0 | DOI: 10.13026/k8qc-na36


Database Restricted Federated

VinDr-Mammo: A large-scale benchmark dataset for computer-aided detection and diagnosis in full-field digital mammography

Source: Physionet

Breast cancer is one of the most prevalent types of cancer and the leading type of cancer death. Mammography is the recommended imaging modality for periodic breast cancer screening. A few datasets have been published to develop computer-aided tools…

Published: March 21, 2022. Version: 1.0.0 | DOI: 10.13026/br2v-7517