Skip to main content

dRep

File naming

The dRep format is a special case that requires two files:

  • Cdb.csv
  • Wdb.csv

Their names have to be exactly as above.

File format

tip

For more information on the dRep output files, visit the dRep documentation.

Cdb.tsv

This file informs the cluster of every MAG.

The file must follow the Tab Separated Values (TSV). It must have columns representing the following data, in that order and with a header:

Column nameColumn obligatorinessData typeData nullability
genomeMandatoryStringNot nullable
secondary_clusterMandatoryStringNullable
thresholdOptional (ignored)N/AN/A
cluster_methodOptional (ignored)N/AN/A
comparison_algorithmOptional (ignored)N/AN/A
primary_clusterOptional (ignored)N/AN/A

Wdb.tsv

This file informs the "winners" (i.e. best representatives) of each cluster.

The file must follow the Tab Separated Values (TSV). It must have columns representing the following data, in that order and with a header:

Column nameColumn obligatorinessData typeData nullability
genomeMandatoryStringNot nullable
scoreOptional (ignored)N/AN/A
clusterOptional (ignored)N/AN/A

Mapping to database

DrepDirectory

Original dataDrepDirectory fieldNotes
dRep directory pathpathThis is the path to the directory that contains both Cdb.csv and Wdb.csv

DrepEntry

Original dataDrepEntry fieldNotes
genome column of Wdb.csvwinnerMAGs whose names are in Wdb.csv are the winners of theirs clusters
genome column of Cdb.csvgenome_name
secondary_cluster column of Cdb.csvgenome_cluster_name