... | ... | @@ -15,6 +15,18 @@ In some datasets, this procedure may be of considerable complexity and we need t |
|
|
|
|
|
Another important matter here is the distance. The CC works with *common* distance metrics, such as the `cosine` or `euclidean` distances. Sometimes, connectivity may require other types of metrics (e.g. GSEA-like, overlap, etc.). We might consider learning siamese networks that transform original distances to the more standard ones. This is an unexplored avenue, though.
|
|
|
|
|
|
## Standard input files
|
|
|
|
|
|
|Type|Format|Description||
|
|
|
|Feature sets||GMT|
|
|
|
|Key-feature pairs||TSV|
|
|
|
|Key profiles||TSV|
|
|
|
|InChIKeys|TSV|
|
|
|
|
|
|
## Documentation
|
|
|
|
|
|
Every pre-processing script needs to have a `README` file.
|
|
|
|
|
|
## Chemistry
|
|
|
|
|
|
|
... | ... | |