Correlation between datasets
With molecules expressed as signatures, it is easy to apply the similarity principle.
As of today, we measure the following types of correlation (only between the 25 exemplary datasets):
- Canonical correlation analysis.
- Shared pairs of similar molecules.
- Coincidence of similarity ranks.
These correlations greatly help analysis (here we show a consensus on exemplary datasets):
Such correlations, in a very simple manner, together with conditional probabilities, are used internally by the CC web app.
Signatures Type 3
Signatures Type 3 are the attemp to predict, for any given molecule (with any given information available for it), the signature corresponding to a certain data type.