Featured Resources


Database Credentialed Access

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Alistair Johnson, Jean-Christophe Bélisle-Pipon, David Dorr, Satrajit Ghosh, Philip Payne, Maria Powell, Anaïs Rameau, Vardit Ravitsky, Alexandros Sigaras, Olivier Elemento, Yael Bensoussan

A dataset of voice recordings and metadata to enable the development, benchmarking, and validation of clinically applicable machine-learning models for diagnosing a wide range of health conditions.

voice bridge2ai audio

Published: Nov. 27, 2024. Version: 1.0


Database Credentialed Access

GIM, a dataset for predicting patient deterioration in the General Internal Medicine ward

Sebnem Kuzulugil, Chloe Pou-Prom, Muhammad Mamdani, Joshua Murray, Amol Verma, Kaiyin Zhu, Michaelia Banning

The General Internal Medicine (GIM) dataset is comprised of de-identified health related data associated with over 22,000 patient encounters for 14,000 unique patients who were admitted under the GIM service at St. Michael’s Hospital.

Published: March 17, 2023. Version: 1.0.1


Latest Resources


Database Credentialed Access

Comprehensive Sleep Laboratory Data: August - October 2024

Sarah Berger, Mark Boulos, Dennis Tchoudnovski, Alana Byeon, Anu Tandon, Brian Murray

Data from Sunnybrook Sleep Laboratory that includes de-identified raw overnight signals, scored sleep metrics, sleep and health questionnaires, and medications/medical history.

Published: Dec. 9, 2024. Version: 1.0


Database Credentialed Access

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Alistair Johnson, Jean-Christophe Bélisle-Pipon, David Dorr, Satrajit Ghosh, Philip Payne, Maria Powell, Anaïs Rameau, Vardit Ravitsky, Alexandros Sigaras, Olivier Elemento, Yael Bensoussan

A dataset of voice recordings and metadata to enable the development, benchmarking, and validation of clinically applicable machine-learning models for diagnosing a wide range of health conditions.

voice bridge2ai audio

Published: Nov. 27, 2024. Version: 1.0


Database Credentialed Access

Trial Files: Keeping Physicians Up to Date on RCTs Using Large Language Models

Katarina Zorcic, Bryant Lim, Michael Fralick

The text-davinci-003 API was used to generate abstract summaries of randomized controlled trials published in high-impact journals (NEJM, JAMA, Annals of Internal Medicine, Lancet, BMJ, Nature Medicine) based on key extracted information.

Published: May 7, 2024. Version: 1.0.0


Database Credentialed Access

Canadian Community Health Survey Public Use Microdata File (CCHS-PUMF)

Yulric Sequeira

The CCHS is a cross-sectional survey that collects information related to health status, health care utilization and health determinants for the Canadian population. it has a large sample size that is representative of the Canadian population.

Published: April 24, 2024. Version: 1.1


Database Credentialed Access

COVID-19 Hospital Demographic, Clinical and Outcome Dataset

Farbod Abolhassani, Jonathan Ranisau, Morgan Lim, Alexander Bilbily, Mark Cicero, Benjamin Fine

Dataset containing key demographic, clinical and outcome parameters of hospitalized patients diagnosed with COVID-19 from Nov 1, 2020 – March 31, 2021.

Published: May 30, 2023. Version: 1.0.1


Database Credentialed Access

Canadian Heart Health Database

Philip Connelly

The Canadian Heart Health Data Base (CHHDB) is a compilation of data from ten Provincial Heart Health Surveys conducted between 1986 and 1992.

Published: April 19, 2023. Version: 1.0.0