The Calgary Blood Stream Infection Cohort

A Comprehensive Multi-omics Dataset of More than 38 000 Bloodstream Infections

The Calgary BSI Cohort

There are more than 50,000 bloodstream infections in North America each year leading to 9,000 deaths. Despite the seriousness of these infections, there are few comprehensive datasets capturing the molecular profiles of these pathogens in large cohorts. To address this shortcoming, our group conducted a systematic multi-omics survey of the 12 most common Calgary-area BSI-causing pathogens collected between 2006-2022 (The Calgary BSI Cohort).

We have compiled extensive medical information and clinical laboratory records linked to these infections. Moreover, we isolated the pathogens responsible for each infection and collected detailed whole-genome sequencing, metabolomics, and quantitative proteomics data. This unique constellation of microbial datasets and human health records enables a wide range of scientific investigations that have not been possible previously and lays the foundation for significant new commercial ventures and clinical applications.

The Calgary BSI Cohort in Numbers

Genomics

Ilumina Whole Genome Sequencing

Proteomics

Quantitative TMT-LC-MSMS proteomics

Metabolomics

Semi-quantitative LC-MS metabolomics

Clinical Data

Extensive clinical data and detailed patient characteristics

Scientific Research and Commercial Ventures

Antimicrobial Susceptibility Prediction

Many bacteria are becoming increasingly resistant to antibiotics and diagnostic testing for antibiotic susceptibility is a $8B global market. The Calgary BSI Cohort could enable a new wave of next-generation sequencing-based diagnostic tools and we have been approached by two companies seeking to license the data for this purpose.


Infection Outbreak Monitoring

Our dataset includes geospatial information of patients, along with hospital and ward locations for infections. These data, along with our genomics records, allow us to build tools that automatically flag outbreaks, identify hospital wards where outbreaks originate, and precisely guide infection control procedures. In collaboration with our provincial infection control and prevention experts and a company that is seeking to develop this market, we received funding for researching and implementing this technology.


Virulence Based Diagnostics

Virulence factors produced by BSI pathogens have a direct impact on the clinical trajectory of infections. Despite this, there are no routine diagnostic tests for microbial virulence, and clinical decision-making is not guided by the unique risk profiles of microbes. Using machine learning, we are investigating the contribution of microbial factors to virulence for the development of new products for their detection.


Mapping of Microbial Gene Expression Regulons

Microbes (such as Escherichia coli) are routinely used to produce proteins and other biomolecules that are then used in foods, medicines, and a wide variety of other industrial products. Our dataset includes protein expression levels and metabolite production levels from every isolate observed and thereby captures a major transect of biological variability. The depth of information allows unprecedented insights into the connection between genome, proteome and metabolome. We have leveraged this to find previously unknown controllers of protein/metabolite production and have entered into early discussions about licensing these data for industrial applications.