Data overview

Over 19,000 genes across 20,000 patient samples

 

MediSapiens reference data profiles over 19,000 genes across 20,000 patient samples, > 40 tissue types and > 70 cancer types. The samples are selected and expertly curated from ArrayExpress, GEO and proprietary sources. A significant and an increasing proportion of data have associated clinical outcomes.

 

The reference data is extended with quarterly updates based on the latest science and our customers’ needs. Our goal is to maintain our current position as the most comprehensive, unified gene expression human reference data; and to increase its diagnosis, prognosis and treatment decision accuracy and potential.

 

The quarterly data update process is as follows:

1. Molecular profile data selection

The MediSapiens Science Board selects the most relevant and recent high-quality published and peer-reviewed gene expression data from GEO, ArrayExpress and its own research teams.

2. Clinical data curation

The qualified MediSapiens Data and Science Team curates all data with ICD-10 and ICD-O-compliant clinical terminology. Data annotations go through the systematic QA/QC process and validity analysis to be consistent with the described biological features.

3. Data unification

Data from different microarray generations is unified with proprietary MediSapiens algorithms validated and published in peer-reviewed journals.

 

The MediSapiens reference data can be integrated with flexible APIs to other data sources to perform analyses across different biological data. These analyses can be visualized with the MediSapiens IST Online interface.