March 26, 2026

Reminder: REB Approval

This note is a reminder and clarification on the role of REB (Research Ethics Board) approval for any research using Health Data Nexus data.

Studies involving human participant information using data from Health Data Nexus that meet definition of Research in TCPS 2 (i.e., an undertaking intended to extend knowledge through a disciplined inquiry and/or systematic investigation) require REB review. For studies that meet the definition of research led by someone at U of T, the lead of the study should submit a protocol through U of T’s Research Ethics Board. For studies that are led by TAHSN hospital partners or externals sites, the leads of these studies should submit their project for REB administrative review.

This holds regardless of which Zone the dataset is placed within. As a reminder, Zone 1 requires a credentialing application, all required trainings to be completed, and the corresponding Data Use Agreement to be signed. Zone 2 additionally requires the submission of a project plan which must be approved by the Data Holder. Zone 3 additionally requires the project plan submission to be accompanied by REB approval which is reviewed by the Data Holder. (This can be the same REB approval as that which is required for access to the data for research purposes.)

Use of human participant information using data from Health Data Nexus that do not meet the definition of Research in TCPS 2 do not require REB review. This includes preliminary review and data exploration, educational uses (such as courses and datathons/hackathons), and general purpose investigations into the data.

May 6, 2026

Federated Data Search is now available!

Health Data Nexus now supports a federated data site search! Check out the wealth of resources available on our partner platform PhysioNet without leaving HDN.

Featured Resources

More Resources
Database Credentialed Access

Bridge2AI-Voice: An ethically-sourced, diverse voice dataset linked to health information

Alistair Johnson, Jean-Christophe Bélisle-Pipon, David Dorr, et al.

A dataset of voice recordings and metadata to enable the development, benchmarking, and validation of clinically applicable machine-learning models for diagnosing a wide range of health conditions. Published: Nov. 27, 2024. Version: 1.0
Database Credentialed Access

GIM, a dataset for predicting patient deterioration in the General Internal Medicine ward

Sebnem Kuzulugil, Chloe Pou-Prom, Muhammad Mamdani, et al.

The General Internal Medicine (GIM) dataset is comprised of de-identified health related data associated with over 22,000 patient encounters for 14,000 unique patients who were admitted under the GIM service at St. Michael’s Hospital. Published: March 17, 2023. Version: 1.0.1

Latest Resources

More Resources
Database Credentialed Access

Immune Checkpoint Blockade (ICB) Data

Farnoosh Abbas Aghababazadeh, Matthew Boccalon

The immune checkpoint blockade (ICB) datasets are comprised of multimodal ExpressionSet R data objects containing molecular data from a multitude of clinical trials. Published: Nov. 26, 2025. Version: 1.0
Database Credentialed Access

High-Resolution Digital Pathology Imaging of Breast Cancer

William Tran, Fang-I Lu, Katarzyna Jerzak

This dataset includes high-resolution digital slides and clinical data from 157 high-risk breast cancer patients treated with neoadjuvant chemotherapy, supporting AI research. Published: Oct. 2, 2025. Version: 1.0.0
Database Credentialed Access

Comprehensive Sleep Laboratory Data: August - October 2024

Sarah Berger, Mark Boulos, Dennis Tchoudnovski, et al.

Data from Sunnybrook Sleep Laboratory that includes de-identified raw overnight signals, scored sleep metrics, sleep and health questionnaires, and medications/medical history. Published: Dec. 9, 2024. Version: 1.0

News

More News
Oct. 2, 2025

'High-Resolution Digital Pathology Imaging of Breast Cancer' has been released!

This breast cancer tumor biopsy dataset, prepared by Dr. William Tran as part of the Health Data Nexus Dataset Grants, is now available on the platform! Take a look at this fascinating dataset!
Oct. 2, 2025

Collaborative workspaces are now live!

Collaborative Vertex AI Workspace functionality has been added to Health Data Nexus, allowing you to create a workspace and then share it with another user. This will allow you to work collaboratively with others on code simultaneously in the same research environment. Look forward to this feature at our next Health Data Nexus datathons and workshops!

July 4, 2025

"Health Data Nexus: an open data platform for AI research and education in medicine" has been published in GigaScience!

Check out the first publication about the development and utility of Health Data Nexus! We'd love for you to read and share this article with your academic networks.

Nov. 27, 2024

The Bridge2AI Voice dataset has been released!

The first release of the Bridge2AI Voice dataset is now available! For detailed information regarding the dataset, please see our documentation website: https://docs.b2ai-voice.org/

July 5, 2023

National access is now live!

The Health Data Nexus is now available for use across Canada! If you are a member of any Canadian university or research institution, you can log in using your single sign-on credentials. Stay tuned for international access and further developments on the platform.