Skip to content

Shared Datasets§

There are several shared datasets or software spaces maintained by UCL researchers and available for others to use.

Myriad§

On Myriad, these spaces are available:

Alphafold2§

Location: /myriadfs/projects/alphafold2

Contains Alphafold2 datasets.

This one is maintained centrally by rc-support@ucl.ac.uk

BlastDB§

Location: /myriadfs/projects/blastdb

Contains weekly updated NCBI BLAST repositories for BLAST nucleotide (nt), protein (nr) , core nucl (core_nt), and Diamond. Plus Taxonomizr and fcs (foreign contamination scan) downloaded when needed.

There is a README at /myriadfs/projects/blastdb/README.md.

If you use these databases, you should record the date you used, as this will correspond to the database you have used and when they were downloaded (For reproducibility).

Cancer§

Location: /myriadfs/projects/cancer

Software modules: module load blic-modules

This space provides modules for NextFlow and other tools that can be made visible by first loading the blic-modules module and then typing module avail.

Contact ci.bioinfohub@ucl.ac.uk if you have any questions about the modules.

DTAdb§

Location: /myriadfs/projects/DTAdb

Contains datasets such as GWAS summary statistics and some core datasets like 1000 genomes and VEP. The GWAS data were normalised using this pipeline: https://cfinan.gitlab.io/gwas-norm/index.html