July 4, 2025

"Health Data Nexus: an open data platform for AI research and education in medicine" has been published in GigaScience!

Check out the first publication about the development and utility of Health Data Nexus! We'd love for you to read and share this article with your academic networks.
Oct. 2, 2025

Collaborative workspaces are now live!

Collaborative Vertex AI Workspace functionality has been added to Health Data Nexus, allowing you to create a workspace and then share it with another user. This will allow you to work collaboratively with others on code simultaneously in the same research environment. Look forward to this feature at our next Health Data Nexus datathons and workshops!

Featured Resources

More Resources
Database Credentialed Access

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Alistair Johnson, Jean-Christophe Bélisle-Pipon, David Dorr, Satrajit Ghosh, Philip Payne, Maria Powell, Anaïs Rameau, Vardit Ravitsky, Alexandros Sigaras, Olivier Elemento, Yael Bensoussan

A dataset of voice recordings and metadata to enable the development, benchmarking, and validation of clinically applicable machine-learning models for diagnosing a wide range of health conditions. Published: Nov. 27, 2024. Version: 1.0
Database Credentialed Access

GIM, a dataset for predicting patient deterioration in the General Internal Medicine ward

Sebnem Kuzulugil, Chloe Pou-Prom, Muhammad Mamdani, Joshua Murray, Amol Verma, Kaiyin Zhu, Michaelia Banning

The General Internal Medicine (GIM) dataset is comprised of de-identified health related data associated with over 22,000 patient encounters for 14,000 unique patients who were admitted under the GIM service at St. Michael’s Hospital. Published: March 17, 2023. Version: 1.0.1

Latest Resources

More Resources
Database Credentialed Access

High-Resolution Digital Pathology Imaging of Breast Cancer

William Tran, Fang-I Lu, Katarzyna Jerzak

This dataset includes high-resolution digital slides and clinical data from 157 high-risk breast cancer patients treated with neoadjuvant chemotherapy, supporting AI research. Published: Oct. 2, 2025. Version: 1.0.0
Database Credentialed Access

Comprehensive Sleep Laboratory Data: August - October 2024

Sarah Berger, Mark Boulos, Dennis Tchoudnovski, Alana Byeon, Anu Tandon, Brian Murray

Data from Sunnybrook Sleep Laboratory that includes de-identified raw overnight signals, scored sleep metrics, sleep and health questionnaires, and medications/medical history. Published: Dec. 9, 2024. Version: 1.0
Database Credentialed Access

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Alistair Johnson, Jean-Christophe Bélisle-Pipon, David Dorr, Satrajit Ghosh, Philip Payne, Maria Powell, Anaïs Rameau, Vardit Ravitsky, Alexandros Sigaras, Olivier Elemento, Yael Bensoussan

A dataset of voice recordings and metadata to enable the development, benchmarking, and validation of clinically applicable machine-learning models for diagnosing a wide range of health conditions. Published: Nov. 27, 2024. Version: 1.0

News

More News
Oct. 2, 2025

'High-Resolution Digital Pathology Imaging of Breast Cancer' has been released!

This breast cancer tumor biopsy dataset, prepared by Dr. William Tran as part of the Health Data Nexus Dataset Grants, is now available on the platform! Take a look at this fascinating dataset!
Nov. 27, 2024

The Bridge2AI Voice dataset has been released!

The first release of the Bridge2AI Voice dataset is now available! For detailed information regarding the dataset, please see our documentation website: https://docs.b2ai-voice.org/

July 5, 2023

National access is now live!

The Health Data Nexus is now available for use across Canada! If you are a member of any Canadian university or research institution, you can log in using your single sign-on credentials. Stay tuned for international access and further developments on the platform.

March 29, 2023

Join the T-CAIREM Hive

The T-CAIREM Hive is the home for all discussions, tips, and questions related to the Health Data Nexus datasets. 

Joining the Hive is easy! Membership in the T-CAIREM Hive is currently available to anyone affiliated with our partner institutions.

Once you've signed up, you can access the Hive Hubs dataset discussion groups. See you on the Hive!

April 5, 2022

Contribute your dataset to the T-CAIREM online health data platform

The Temerty Centre for AI Research and Education in Medicine (T-CAIREM) is currently developing a unique online platform for health data that emphasizes patient privacy and security, provides transparent and speedy access, and simplifies data discovery and analysis.

We’re looking for promising datasets to host on our platform. There are several benefits for researchers who contribute data:
• Broader visibility, including targeted workshops and courses
• A cloud-native platform with no data egress possible, giving greater control over data governance
• Data-use statistics that provide useful reporting information for funding agencies
• A centralized model allowing the sharing of developed tools and curated derivations of the data
• A citable dataset that supports career advancement.

Dataset access follows a strict process including credentialing of researchers, training in research with human participants, and signing of a data-use agreement drafted according to specifications from the data contributor.

For more information about submitting a dataset, please contact:
January Adams
Data Governance and Quality Analyst 
Temerty Centre for AI Research and Education in Medicine (T-CAIREM)
contact@healthdatanexus.ai