Resources
Database Credentialed Federated
BRATECA (Brazilian Tertiary Care Dataset): a Clinical Information Dataset for the Portuguese Language
Computational medicine research requires clinical data for training and testing purposes, so the development of datasets composed of real hospital data is of utmost importance in this field. Most such data collections are in the English language, we…
exams natural language processing tertiary care prescriptions clinical notes
Published: May 13, 2022. Version: 1.0 | DOI: 10.13026/cmab-j041
Database Open Federated
The CirCor DigiScope Phonocardiogram Dataset
A total number of 5272 heart sound recordings were collected from the main four auscultation locations of 1568 subjects, aged between 0 and 21 years (mean ± STD = 6.1 ± 4.3 years), with a duration between 4.8 to 80.4 seconds (mean &plu…
signal processing murmur pitch george b moody physionet challenge 2022 murmur grading murmur location murmur timing phonocardiogram pregnant murmur shape pediatric murmur detection murmur intensity murmur quality
Published: May 10, 2022. Version: 1.0.3 | DOI: 10.13026/tshs-mw03
Database Open Federated
The CirCor DigiScope Phonocardiogram Dataset
A total number of 5272 heart sound recordings were collected from the main four auscultation locations of 1568 subjects, aged between 0 and 21 years (mean ± STD = 6.1 ± 4.3 years), with a duration between 4.8 to 80.4 seconds (mean &plu…
signal processing murmur quality murmur location phonocardiogram murmur grading murmur intensity pediatric murmur shape george b moody physionet challenge 2022 murmur timing murmur pitch pregnant murmur detection
Published: April 30, 2022. Version: 1.0.2 | DOI: 10.13026/qcpr-xg17
Database Credentialed Federated
DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries
Electronic Health Records (EHR) contain patient records, stored in structured tables as well as unstructured clinical notes. The information in structured and unstructured EHR records is not strictly disjoint: information may be duplicated, contradi…
question-answer qa
Published: April 12, 2022. Version: 1.0.0 | DOI: 10.13026/a849-cd06
Database Open Federated
Icentia11k Single Lead Continuous Raw Electrocardiogram Dataset
This is a dataset of continuous raw electrocardiogram (ECG) signals containing 11 thousand patients and 2 billion labelled beats. The signals were recorded with a 16-bit resolution at 250Hz with a fixed chest mounted single lead probe for up to 2 we…
ecg representation learning
Published: April 12, 2022. Version: 1.0 | DOI: 10.13026/kk0v-r952
Database Credentialed Federated
RuMedNLI: A Russian Natural Language Inference Dataset For The Clinical Domain
There is a shortage of text medical resources for the Russian language. This is a substantial obstacle in state-of-the-art NLP deep learning models research and development. To mitigate this issue we translated the MedNLI data from English to R…
natural language inference recognizing textual entailment russian language
Published: April 1, 2022. Version: 1.0.0 | DOI: 10.13026/gxzd-cf80
Database Open Federated
Surface electromyographic signals collected during long-lasting ground walking of young able-bodied subjects
The present dataset is composed of long-lasting (around 5 minutes) surface electromyographic (sEMG) signals recorded from 2011 and 2018 during ground walking of 31 young (20 years < age < 30 years) able-bodied subjects in the Movement Analysis…
surface emg signal walking biomedical signals gait analysis muscle recruitment
Published: March 31, 2022. Version: 1.0.0 | DOI: 10.13026/bwvb-ht51
Database Open Federated
CPAP Pressure and Flow Data from a Local Trial of 30 Adults at the University of Canterbury
A pressure and flow dataset for CPAP (continuous positive airway pressure) breathing obtained from 30 subjects for model-based identification of patient-specific lung mechanics using a specially designed sensor system comprising an array of differen…
peep biomedical engineering cpap respiratory mechanics pulmonary mechanics respiratory modelling
Published: March 24, 2022. Version: 1.0.1 | DOI: 10.13026/xfae-vv63
Database Restricted Federated
VinDr-PCXR: An open, large-scale pediatric chest X-ray dataset for interpretation of common thoracic diseases
Computer-aided diagnosis systems in adult chest radiography (CXR) have recently achieved great success thanks to the availability of large-scale, annotated datasets and the advent of high-performance supervised learning algorithms. However, the deve…
Published: March 21, 2022. Version: 1.0.0 | DOI: 10.13026/k8qc-na36
Database Restricted Federated
VinDr-Mammo: A large-scale benchmark dataset for computer-aided detection and diagnosis in full-field digital mammography
Breast cancer is one of the most prevalent types of cancer and the leading type of cancer death. Mammography is the recommended imaging modality for periodic breast cancer screening. A few datasets have been published to develop computer-aided tools…
Published: March 21, 2022. Version: 1.0.0 | DOI: 10.13026/br2v-7517