####################################################################### WARNING. WARNING. WARNING. This project has been automatically analyzed by the 'draftQD.pl' script, a crude zeroth order, project clean up script, designed to identify and remove low quality and contaminant reads from a project. The heuristics used by the script may both remove valid project data and fail to remove bona fide contaminants. The following sets of reads have been removed from the project fasta in this edit_dir only. The trace data of possible contaminants has not been removed from the project. Therefore, if you are using automated assembly procedures which recreate a project fasta from data in the partitions, then reads removed in this edit_dir will be present in new file. ####################################################################### Even after removal of 54000 (!!) problem reads, the GC plot shows a pronounced and distinct second peak at >about 70% GC. A GC plot of the singlets matches very well the second GC peak of the main assembly. Based on the calculated depth and genome size, the project still has far too many contigs which is consitent with a contaminant. There are about 6000 reads in the assembly and another several thousand in the singlets file with 0.7 GC fraction which corresponds to roughly 10% contaminant (after cleanup). Blast of singlets against nt produced many significant plasmid, transposon, vector, e coli, and chloroplast hits. reads removed from fasta: 6423 reads.2RdContigs 40356 reads.lowQual.q20lt100 494 reads.possible.eukaryota 54231 total