Table of Contents
	Project Information
		report description
		projects db info
		taxonomy summary
		genome size estimates
		contamination summary
		project base counts
	Libraries and Reads
		assembled average insert size estimates
		library / read quality summary
		reads2plates summary
		trimmed read length histograms
		library vector screening
		GC Content of reads histogram
	Contigs and Assemblies
		contig size and read count table
		depth summary
		depth histogram
		depth values
	Assembler Specific Info
		reads in assembly summary from assembler
		assembly parameters

Project Information

------------------------------------------------------------------- Assembly QC Report Date: 12-12-2005 Runby: Kerrie Barry Description: analysis of libraries completely sequenced (well, complete at this time) ------------------------------------------------------------------- ------------------------------------------------------------------- Project information from 'PROJECTS' db ------------------------------------------------------------------- Project Size(KB) TaxID GenusSpecies 3634510 3000 36861 Thiobacillus denitrificans ------------------------------------------------------------------- Taxonomy summary Command: /home/copeland/scripts/tax2tree.sh Thiobacillus_denitrificans ------------------------------------------------------------------- Hydrogenophilaceae, family, b-proteobacteria Hydrogenophilales, order, b-proteobacteria Thiobacillus denitrificans, species, b-proteobacteria Betaproteobacteria, class, b-proteobacteria Thiobacillus, genus, b-proteobacteria Proteobacteria (purple bacteria), phylum, proteobacteria Bacteria (eubacteria), superkingdom, eubacteria cellular organisms root ------------------------------------------------------------------- Genome size estimates ------------------------------------------------------------------- # contigs: 2910952 # phrap: 2974295 # db: altered. 3000000 2961749 +/- 37420 ------------------------------------------------------------------- Contam Summary with *.contigs: Command: /psf/QC/bin/sparc/summarizeCrossMatchHits -o ------------------------------------------------------------------- Number of reads with X's: 0 Number of reads with percent X's >= 20%: 0 = 0.0% Number of reads with percent X's >= 50%: 0 = 0.0% Number of reads with percent X's >= 80%: 0 = 0.0% Total reads in project: 45354 Total bp X'd : reads >= 20% >= 50% >= 80% screened ------------------------------------------------------------------- Contam Summary with *.singlets: Command: /psf/QC/bin/sparc/summarizeCrossMatchHits -o -s ------------------------------------------------------------------- ------------------------------------------------------------------- Base Count for Project: Command: /home/copeland/scripts/projectBaseCount.pl phrap.out ------------------------------------------------------------------- A = 8796809 C = 15871639 G = 15654463 T = 8717510 N = 211473 X = 511490 GC fraction = 0.63 Total = 49763384 ------------------------------------------------------------------- Base Count for contigs: Command: /psf/QC/bin/sparc/faCount 3634510_fasta.screen.contigs ------------------------------------------------------------------- A 494975 C 962795 G 963426 T 494175 N 885 fraction GC = 0.66 total bases = 2916256

Libraries and Reads

------------------------------------------------------------------- Histogram of Assembled Average Insert Sizes: Command: /home/copeland/scripts/phrapView2.pl -p phrap.out -C > reads.list ------------------------------------------------------------------- ------------------------------------------------------------------- Command: /usr/xpg4/bin/grep AIFS reads.list > grep.reads.list.AIFS Command: /home/copeland/scripts/histogram2.pl grep.reads.list.AIFS 4 500 ------------------------------------------------------------------- #Found 8617 total values totalling 51277110.0000. <5950.691656 +/- 32813.539397> #Range: [ 1114 - 853955 ] #Most likely bin: [ 2500 - 3000 ] 2604 counts #Median bin: [ 3000 - 3500 ] 2225 counts #Histogram Bins Count Fraction Cum_Fraction | 1000 - 1500 : [ 10 0.00 0.00 ] | 1500 - 2000 : [ 32 0.00 0.00 ] |XXXXXXXXXXXX 2000 - 2500 : [ 757 0.09 0.09 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2500 - 3000 : [ 2604 0.30 0.39 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 3000 - 3500 : [ 2225 0.26 0.65 ] |XXXXXXXXXXXXXXXXXXXXXXXX 3500 - 4000 : [ 1549 0.18 0.83 ] |XXXXXXXXXXXXXXX 4000 - 4500 : [ 990 0.11 0.95 ] |XXXXX 4500 - 5000 : [ 306 0.04 0.98 ] |X 5000 - 5500 : [ 50 0.01 0.99 ] | 5500 - 6000 : [ 1 0.00 0.99 ] #... | 14000 - 14500 : [ 1 0.00 0.99 ] #... | 17000 - 17500 : [ 1 0.00 0.99 ] #... | 19500 - 20000 : [ 1 0.00 0.99 ] | 20000 - 20500 : [ 1 0.00 0.99 ] #... | 26500 - 27000 : [ 2 0.00 0.99 ] #... | 29500 - 30000 : [ 1 0.00 0.99 ] #... | 35500 - 36000 : [ 1 0.00 0.99 ] | 36000 - 36500 : [ 1 0.00 0.99 ] #... | 45000 - 45500 : [ 1 0.00 0.99 ] #... | 46500 - 47000 : [ 1 0.00 0.99 ] #... | 47500 - 48000 : [ 1 0.00 0.99 ] #... | 50500 - 51000 : [ 1 0.00 0.99 ] #... | 55000 - 55500 : [ 1 0.00 0.99 ] #... | 56500 - 57000 : [ 1 0.00 0.99 ] #... | 61500 - 62000 : [ 1 0.00 0.99 ] #... | 63500 - 64000 : [ 1 0.00 0.99 ] #... | 64500 - 65000 : [ 1 0.00 0.99 ] #... | 68500 - 69000 : [ 1 0.00 0.99 ] #... | 74500 - 75000 : [ 1 0.00 0.99 ] #... | 76000 - 76500 : [ 3 0.00 0.99 ] #... | 79000 - 79500 : [ 2 0.00 0.99 ] #... | 80500 - 81000 : [ 1 0.00 0.99 ] #... | 94000 - 94500 : [ 1 0.00 0.99 ] #... | 95500 - 96000 : [ 1 0.00 0.99 ] #... | 97000 - 97500 : [ 1 0.00 0.99 ] #... | 99000 - 99500 : [ 1 0.00 0.99 ] #... | 114000 - 114500 : [ 1 0.00 0.99 ] #... | 119500 - 120000 : [ 1 0.00 0.99 ] | 120000 - 120500 : [ 1 0.00 0.99 ] #... | 131500 - 132000 : [ 1 0.00 0.99 ] #... | 139500 - 140000 : [ 1 0.00 0.99 ] #... | 141000 - 141500 : [ 1 0.00 0.99 ] #... | 151000 - 151500 : [ 1 0.00 0.99 ] #... | 161000 - 161500 : [ 1 0.00 0.99 ] #... | 163500 - 164000 : [ 1 0.00 0.99 ] #... | 171000 - 171500 : [ 1 0.00 0.99 ] #... | 176500 - 177000 : [ 1 0.00 0.99 ] #... | 181500 - 182000 : [ 1 0.00 0.99 ] #... | 200000 - 200500 : [ 1 0.00 0.99 ] #... | 204500 - 205000 : [ 1 0.00 0.99 ] #... | 205500 - 206000 : [ 1 0.00 0.99 ] #... | 206500 - 207000 : [ 1 0.00 0.99 ] #... | 211000 - 211500 : [ 1 0.00 0.99 ] #... | 213000 - 213500 : [ 1 0.00 0.99 ] #... | 219500 - 220000 : [ 1 0.00 0.99 ] #... | 225000 - 225500 : [ 1 0.00 1.00 ] #... | 226000 - 226500 : [ 1 0.00 1.00 ] #... | 236000 - 236500 : [ 1 0.00 1.00 ] #... | 256500 - 257000 : [ 1 0.00 1.00 ] #... | 266000 - 266500 : [ 1 0.00 1.00 ] #... | 270500 - 271000 : [ 1 0.00 1.00 ] #... | 276000 - 276500 : [ 1 0.00 1.00 ] | 276500 - 277000 : [ 1 0.00 1.00 ] | 277000 - 277500 : [ 1 0.00 1.00 ] #... | 280000 - 280500 : [ 1 0.00 1.00 ] #... | 284500 - 285000 : [ 1 0.00 1.00 ] #... | 301500 - 302000 : [ 2 0.00 1.00 ] #... | 307500 - 308000 : [ 1 0.00 1.00 ] #... | 316000 - 316500 : [ 1 0.00 1.00 ] #... | 329000 - 329500 : [ 1 0.00 1.00 ] #... | 332500 - 333000 : [ 1 0.00 1.00 ] #... | 349000 - 349500 : [ 1 0.00 1.00 ] #... | 351000 - 351500 : [ 1 0.00 1.00 ] | 351500 - 352000 : [ 1 0.00 1.00 ] #... | 366500 - 367000 : [ 1 0.00 1.00 ] #... | 369500 - 370000 : [ 1 0.00 1.00 ] #... | 380500 - 381000 : [ 1 0.00 1.00 ] #... | 390000 - 390500 : [ 1 0.00 1.00 ] #... | 404500 - 405000 : [ 1 0.00 1.00 ] #... | 410500 - 411000 : [ 1 0.00 1.00 ] #... | 413000 - 413500 : [ 1 0.00 1.00 ] #... | 427000 - 427500 : [ 1 0.00 1.00 ] #... | 452500 - 453000 : [ 1 0.00 1.00 ] #... | 469000 - 469500 : [ 1 0.00 1.00 ] #... | 470500 - 471000 : [ 1 0.00 1.00 ] #... | 479000 - 479500 : [ 1 0.00 1.00 ] #... | 504000 - 504500 : [ 1 0.00 1.00 ] #... | 517500 - 518000 : [ 1 0.00 1.00 ] #... | 531500 - 532000 : [ 1 0.00 1.00 ] #... | 542500 - 543000 : [ 1 0.00 1.00 ] #... | 583000 - 583500 : [ 1 0.00 1.00 ] #... | 595500 - 596000 : [ 1 0.00 1.00 ] #... | 599000 - 599500 : [ 1 0.00 1.00 ] #... | 628000 - 628500 : [ 1 0.00 1.00 ] #... | 665000 - 665500 : [ 1 0.00 1.00 ] #... | 776500 - 777000 : [ 1 0.00 1.00 ] #... | 787500 - 788000 : [ 1 0.00 1.00 ] #... | 853500 - 854000 : [ 1 0.00 1.00 ] ------------------------------------------------------------------- Command: /usr/xpg4/bin/grep AIFT reads.list > grep.reads.list.AIFT Command: /home/copeland/scripts/histogram2.pl grep.reads.list.AIFT 4 500 ------------------------------------------------------------------- #Found 8617 total values totalling 66726031.0000. <7743.533828 +/- 19478.816208> #Range: [ 1131 - 916153 ] #Most likely bin: [ 6500 - 7000 ] 1750 counts #Median bin: [ 6500 - 7000 ] 1750 counts #Histogram Bins Count Fraction Cum_Fraction | 1000 - 1500 : [ 11 0.00 0.00 ] | 1500 - 2000 : [ 19 0.00 0.00 ] | 2000 - 2500 : [ 12 0.00 0.00 ] | 2500 - 3000 : [ 17 0.00 0.01 ] |X 3000 - 3500 : [ 28 0.00 0.01 ] | 3500 - 4000 : [ 13 0.00 0.01 ] | 4000 - 4500 : [ 21 0.00 0.01 ] |X 4500 - 5000 : [ 28 0.00 0.02 ] |XXX 5000 - 5500 : [ 123 0.01 0.03 ] |XXXXXXXXXXXXXXXXXXXX 5500 - 6000 : [ 868 0.10 0.13 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 6000 - 6500 : [ 1636 0.19 0.32 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 6500 - 7000 : [ 1750 0.20 0.53 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 7000 - 7500 : [ 1683 0.20 0.72 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 7500 - 8000 : [ 1354 0.16 0.88 ] |XXXXXXXXXXXXXXXXXXX 8000 - 8500 : [ 843 0.10 0.98 ] |XXXX 8500 - 9000 : [ 157 0.02 0.99 ] | 9000 - 9500 : [ 8 0.00 0.99 ] | 9500 - 10000 : [ 2 0.00 0.99 ] | 10000 - 10500 : [ 6 0.00 1.00 ] | 10500 - 11000 : [ 1 0.00 1.00 ] | 11000 - 11500 : [ 4 0.00 1.00 ] | 11500 - 12000 : [ 3 0.00 1.00 ] | 12000 - 12500 : [ 2 0.00 1.00 ] | 12500 - 13000 : [ 1 0.00 1.00 ] #... | 20500 - 21000 : [ 1 0.00 1.00 ] #... | 41000 - 41500 : [ 1 0.00 1.00 ] #... | 63500 - 64000 : [ 1 0.00 1.00 ] #... | 66500 - 67000 : [ 1 0.00 1.00 ] #... | 84000 - 84500 : [ 1 0.00 1.00 ] #... | 87000 - 87500 : [ 1 0.00 1.00 ] | 87500 - 88000 : [ 1 0.00 1.00 ] #... | 92000 - 92500 : [ 1 0.00 1.00 ] #... | 104500 - 105000 : [ 1 0.00 1.00 ] #... | 109000 - 109500 : [ 1 0.00 1.00 ] #... | 111000 - 111500 : [ 1 0.00 1.00 ] #... | 133500 - 134000 : [ 1 0.00 1.00 ] #... | 157500 - 158000 : [ 1 0.00 1.00 ] #... | 195500 - 196000 : [ 1 0.00 1.00 ] #... | 210500 - 211000 : [ 1 0.00 1.00 ] #... | 226500 - 227000 : [ 1 0.00 1.00 ] | 227000 - 227500 : [ 1 0.00 1.00 ] #... | 262000 - 262500 : [ 1 0.00 1.00 ] #... | 359500 - 360000 : [ 1 0.00 1.00 ] #... | 437500 - 438000 : [ 1 0.00 1.00 ] #... | 445000 - 445500 : [ 1 0.00 1.00 ] #... | 449500 - 450000 : [ 1 0.00 1.00 ] #... | 472500 - 473000 : [ 1 0.00 1.00 ] #... | 517000 - 517500 : [ 1 0.00 1.00 ] #... | 669500 - 670000 : [ 1 0.00 1.00 ] #... | 701000 - 701500 : [ 1 0.00 1.00 ] #... | 916000 - 916500 : [ 1 0.00 1.00 ] ------------------------------------------------------------------- Command: /usr/xpg4/bin/grep AIFU reads.list > grep.reads.list.AIFU Command: /home/copeland/scripts/histogram2.pl grep.reads.list.AIFU 4 500 ------------------------------------------------------------------- #Found 1215 total values totalling 45995741.0000. <37856.576955 +/- 3278.313939> #Range: [ 26080 - 45950 ] #Most likely bin: [ 37500 - 38000 ] 78 counts #Median bin: [ 37500 - 38000 ] 78 counts #Histogram Bins Count Fraction Cum_Fraction |X 26000 - 26500 : [ 1 0.00 0.00 ] |X 26500 - 27000 : [ 1 0.00 0.00 ] |X 27000 - 27500 : [ 1 0.00 0.00 ] |X 27500 - 28000 : [ 2 0.00 0.00 ] |X 28000 - 28500 : [ 1 0.00 0.00 ] |XX 28500 - 29000 : [ 4 0.00 0.01 ] |XX 29000 - 29500 : [ 3 0.00 0.01 ] |XX 29500 - 30000 : [ 3 0.00 0.01 ] |X 30000 - 30500 : [ 2 0.00 0.01 ] |XXX 30500 - 31000 : [ 6 0.00 0.02 ] |XXXXX 31000 - 31500 : [ 9 0.01 0.03 ] |XXXXXX 31500 - 32000 : [ 12 0.01 0.04 ] |XXXXXXX 32000 - 32500 : [ 14 0.01 0.05 ] |XXXXXXXX 32500 - 33000 : [ 16 0.01 0.06 ] |XXXXXXXXXXXX 33000 - 33500 : [ 24 0.02 0.08 ] |XXXXXXXXXXXXXXXX 33500 - 34000 : [ 31 0.03 0.11 ] |XXXXXXXXXXXXXXXXXXXXXXX 34000 - 34500 : [ 44 0.04 0.14 ] |XXXXXXXXXXXXXXXXXXXXXXXX 34500 - 35000 : [ 46 0.04 0.18 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 35000 - 35500 : [ 56 0.05 0.23 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 35500 - 36000 : [ 77 0.06 0.29 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36000 - 36500 : [ 64 0.05 0.34 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 36500 - 37000 : [ 73 0.06 0.40 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 37000 - 37500 : [ 77 0.06 0.47 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 37500 - 38000 : [ 78 0.06 0.53 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 38000 - 38500 : [ 68 0.06 0.59 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 38500 - 39000 : [ 71 0.06 0.65 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 39000 - 39500 : [ 56 0.05 0.69 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 39500 - 40000 : [ 62 0.05 0.74 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXX 40000 - 40500 : [ 53 0.04 0.79 ] |XXXXXXXXXXXXXXXXXXX 40500 - 41000 : [ 37 0.03 0.82 ] |XXXXXXXXXXXXXXXXXXXXXX 41000 - 41500 : [ 43 0.04 0.85 ] |XXXXXXXXXXXXXXXXXXX 41500 - 42000 : [ 38 0.03 0.88 ] |XXXXXXXXXXXXXXXXX 42000 - 42500 : [ 34 0.03 0.91 ] |XXXXXXXXXXXXXXXXXX 42500 - 43000 : [ 35 0.03 0.94 ] |XXXXXXXXXX 43000 - 43500 : [ 20 0.02 0.96 ] |XXXXXXXX 43500 - 44000 : [ 16 0.01 0.97 ] |XXXXXXX 44000 - 44500 : [ 14 0.01 0.98 ] |XXXXXX 44500 - 45000 : [ 12 0.01 0.99 ] |XX 45000 - 45500 : [ 4 0.00 0.99 ] |XXXX 45500 - 46000 : [ 7 0.01 1.00 ] ------------------------------------------------------------------- Estimated Assembled Average Insert Sizes: Command: /home/copeland/scripts/estInsertSize.pl -f phrap.out ------------------------------------------------------------------- # AIFU 37962 +- 3361 (n=602) # AIFS 3197 +- 1348 (n=4404) # AIFT 6852 +- 1179 (n=4278) ------------------------------------------------------------------- Library / Read Quality summary extracted from 3634510_fasta.screen.trimQ15.SaF and database (md_run table) * note J15 is Jazz trimmed length, Q20 is count of Quality 20+ bases ------------------------------------------------------------------- DB |----J15---| |----Q20---| Fasta |----J15---| |----Q20---| LIB Reads %pass AvgLen %pass AvgNum Reads %pass AvgLen %pass AvgNun AIFS 23808 -1 -1 98 752 22995 100 713 100 723 AIFT 21504 -1 -1 97 708 19505 99 745 100 738 AIFU 3072 -1 -1 96 607 2854 98 635 100 647 FWD |----J15---| |----Q20---| REV |----J15---| |----Q20---| LIB Reads %pass AvgLen %pass AvgNum Reads %pass AvgLen %pass AvgNun AIFS 11481 100 712 100 726 11514 100 714 100 720 AIFT 9781 99 759 100 751 9724 99 731 100 725 AIFU 1425 98 638 100 653 1429 98 633 100 642 ------------------------------------------------------------------- reads2plates summary extracted from file: 3634510_fasta.screen.r2p [ from "READ/CLONE COUNT SUMMARY" to end ] ------------------------------------------------------------------- plate(s) reads clones LIBRARY @ ] 45354 22163 92.35 cumulative total @@ Only indicates plates present in input file. Make no assumption regarding plates (not) present in project that do not appear above. ------------------------------------------------------------------- trimt JAZZ trim 15 readlength histogram: Command: /home/copeland/scripts/histogram2.pl 3634510_fasta.screen.trimQ15.SaF 4 50 ------------------------------------------------------------------- #Found 45354 total values totalling 32556608.0000. <717.833223 +/- 143.842748> #Range: [ 18 - 1034 ] #Most likely bin: [ 750 - 800 ] 12017 counts #Median bin: [ 750 - 800 ] 12017 counts #Histogram Bins Count Fraction Cum_Fraction | 0 - 50 : [ 80 0.00 0.00 ] |X 50 - 100 : [ 212 0.00 0.01 ] |X 100 - 150 : [ 215 0.00 0.01 ] |X 150 - 200 : [ 294 0.01 0.02 ] |X 200 - 250 : [ 274 0.01 0.02 ] |X 250 - 300 : [ 370 0.01 0.03 ] |X 300 - 350 : [ 344 0.01 0.04 ] |XX 350 - 400 : [ 452 0.01 0.05 ] |XX 400 - 450 : [ 570 0.01 0.06 ] |XX 450 - 500 : [ 690 0.02 0.08 ] |XXX 500 - 550 : [ 966 0.02 0.10 ] |XXXXX 550 - 600 : [ 1435 0.03 0.13 ] |XXXXXXXX 600 - 650 : [ 2335 0.05 0.18 ] |XXXXXXXXXXXXXXX 650 - 700 : [ 4559 0.10 0.28 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXX 700 - 750 : [ 8514 0.19 0.47 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 750 - 800 : [ 12017 0.26 0.73 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 800 - 850 : [ 9022 0.20 0.93 ] |XXXXXXXX 850 - 900 : [ 2421 0.05 0.99 ] |X 900 - 950 : [ 442 0.01 1.00 ] | 950 - 1000 : [ 132 0.00 1.00 ] | 1000 - 1050 : [ 10 0.00 1.00 ] trimt JAZZ trim 15 readlength histogram for AIFS ------------------------------------------------------------------- Command: /usr/xpg4/bin/grep AIFS 3634510_fasta.screen.trimQ15.SaF > reads.trim15.AIFS.rl Command: /home/copeland/scripts/histogram2.pl reads.trim15.AIFS.rl 2 50 ------------------------------------------------------------------- #Found 22995 total values totalling 16336953.0000. <710.456751 +/- 141.714634> #Range: [ 25 - 1034 ] #Most likely bin: [ 750 - 800 ] 5507 counts #Median bin: [ 700 - 750 ] 4891 counts #Histogram Bins Count Fraction Cum_Fraction | 0 - 50 : [ 22 0.00 0.00 ] |X 50 - 100 : [ 74 0.00 0.00 ] |X 100 - 150 : [ 96 0.00 0.01 ] |X 150 - 200 : [ 114 0.00 0.01 ] |X 200 - 250 : [ 164 0.01 0.02 ] |X 250 - 300 : [ 189 0.01 0.03 ] |X 300 - 350 : [ 192 0.01 0.04 ] |XX 350 - 400 : [ 278 0.01 0.05 ] |XX 400 - 450 : [ 329 0.01 0.06 ] |XXX 450 - 500 : [ 418 0.02 0.08 ] |XXXX 500 - 550 : [ 515 0.02 0.10 ] |XXXXXX 550 - 600 : [ 828 0.04 0.14 ] |XXXXXXXXXX 600 - 650 : [ 1391 0.06 0.20 ] |XXXXXXXXXXXXXXXXXXXXX 650 - 700 : [ 2868 0.12 0.33 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 700 - 750 : [ 4891 0.21 0.54 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 750 - 800 : [ 5507 0.24 0.78 ] |XXXXXXXXXXXXXXXXXXXXXXXX 800 - 850 : [ 3337 0.15 0.92 ] |XXXXXXXXX 850 - 900 : [ 1214 0.05 0.98 ] |XXX 900 - 950 : [ 426 0.02 0.99 ] |X 950 - 1000 : [ 132 0.01 1.00 ] | 1000 - 1050 : [ 10 0.00 1.00 ] trimt JAZZ trim 15 readlength histogram for AIFT ------------------------------------------------------------------- Command: /usr/xpg4/bin/grep AIFT 3634510_fasta.screen.trimQ15.SaF > reads.trim15.AIFT.rl Command: /home/copeland/scripts/histogram2.pl reads.trim15.AIFT.rl 2 50 ------------------------------------------------------------------- #Found 19505 total values totalling 14443895.0000. <740.522686 +/- 133.633146> #Range: [ 18 - 925 ] #Most likely bin: [ 750 - 800 ] 5995 counts #Median bin: [ 750 - 800 ] 5995 counts #Histogram Bins Count Fraction Cum_Fraction | 0 - 50 : [ 41 0.00 0.00 ] |X 50 - 100 : [ 89 0.00 0.01 ] |X 100 - 150 : [ 89 0.00 0.01 ] |X 150 - 200 : [ 148 0.01 0.02 ] |X 200 - 250 : [ 79 0.00 0.02 ] |X 250 - 300 : [ 87 0.00 0.03 ] |X 300 - 350 : [ 93 0.00 0.03 ] |X 350 - 400 : [ 128 0.01 0.04 ] |X 400 - 450 : [ 180 0.01 0.05 ] |X 450 - 500 : [ 191 0.01 0.06 ] |XX 500 - 550 : [ 265 0.01 0.07 ] |XX 550 - 600 : [ 357 0.02 0.09 ] |XXXXX 600 - 650 : [ 690 0.04 0.12 ] |XXXXXXXXX 650 - 700 : [ 1306 0.07 0.19 ] |XXXXXXXXXXXXXXXXXXXX 700 - 750 : [ 3020 0.15 0.35 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 750 - 800 : [ 5995 0.31 0.65 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 800 - 850 : [ 5530 0.28 0.94 ] |XXXXXXXX 850 - 900 : [ 1201 0.06 1.00 ] | 900 - 950 : [ 16 0.00 1.00 ] trimt JAZZ trim 15 readlength histogram for AIFU ------------------------------------------------------------------- Command: /usr/xpg4/bin/grep AIFU 3634510_fasta.screen.trimQ15.SaF > reads.trim15.AIFU.rl Command: /home/copeland/scripts/histogram2.pl reads.trim15.AIFU.rl 2 50 ------------------------------------------------------------------- #Found 2854 total values totalling 1775760.0000. <622.200420 +/- 178.453231> #Range: [ 25 - 870 ] #Most likely bin: [ 700 - 750 ] 603 counts #Median bin: [ 650 - 700 ] 385 counts #Histogram Bins Count Fraction Cum_Fraction |X 0 - 50 : [ 17 0.01 0.01 ] |XXX 50 - 100 : [ 49 0.02 0.02 ] |XX 100 - 150 : [ 30 0.01 0.03 ] |XX 150 - 200 : [ 32 0.01 0.04 ] |XX 200 - 250 : [ 31 0.01 0.06 ] |XXXXXX 250 - 300 : [ 94 0.03 0.09 ] |XXXX 300 - 350 : [ 59 0.02 0.11 ] |XXX 350 - 400 : [ 46 0.02 0.13 ] |XXXX 400 - 450 : [ 61 0.02 0.15 ] |XXXXX 450 - 500 : [ 81 0.03 0.18 ] |XXXXXXXXXXXX 500 - 550 : [ 186 0.07 0.24 ] |XXXXXXXXXXXXXXXXX 550 - 600 : [ 250 0.09 0.33 ] |XXXXXXXXXXXXXXXXX 600 - 650 : [ 254 0.09 0.42 ] |XXXXXXXXXXXXXXXXXXXXXXXXXX 650 - 700 : [ 385 0.13 0.55 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 700 - 750 : [ 603 0.21 0.76 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 750 - 800 : [ 515 0.18 0.94 ] |XXXXXXXXXX 800 - 850 : [ 155 0.05 1.00 ] | 850 - 900 : [ 6 0.00 1.00 ] trimt JAZZ trim 15 readlength histogram for AOOY trimt JAZZ trim 15 readlength histogram for ATGY ------------------------------------------------------------------- Library vector screening Command: /home/copeland/scripts/checkScreen.sh 3634510 ------------------------------------------------------------------- AIFS.000001.000100 pUC18.fa LRS.fasta AIFS.000101.000200 pUC18.fa LRS.fasta AIFT.000001.000100 pMCL200.fa LRS.fasta AIFT.000101.000200 pMCL200.fa LRS.fasta AIFU.000001.000100 pCC1Fos.fa LRS.fasta AIFS.000001.000100 pUC18.fa LRS.fasta AIFS.000101.000200 pUC18.fa LRS.fasta AIFT.000001.000100 pMCL200.fa LRS.fasta AIFT.000101.000200 pMCL200.fa LRS.fasta AIFU.000001.000100 pCC1Fos.fa LRS.fasta ------------------------------------------------------------------- GC content histogram: Command: /bin/nawk '{print $5+$6}' GC.3634510_fasta.screen.trimQ20 | /home/copeland/scripts/histogram2.pl - 1 0.005 ------------------------------------------------------------------- # GC.3634510_fasta.screen.trimQ20 | nawk 'NR>1 {print $5+$6}' | /home/jchapman/perlscripts/histogram2.pl - 1 0.005 #Found 45262 total values totalling 29847.1936. <0.659432 +/- 0.042571> #Range: [ 0.2592 - 0.9131 ] #Most likely bin: [ 0.66 - 0.665 ] 2546 counts #Median bin: [ 0.66 - 0.665 ] 2546 counts | 0.255 - 0.26 : [ 1 0.00 0.00 ] #... | 0.325 - 0.33 : [ 1 0.00 0.00 ] #... | 0.345 - 0.35 : [ 1 0.00 0.00 ] #... | 0.405 - 0.41 : [ 1 0.00 0.00 ] #... | 0.415 - 0.42 : [ 3 0.00 0.00 ] #... | 0.425 - 0.43 : [ 1 0.00 0.00 ] | 0.43 - 0.435 : [ 4 0.00 0.00 ] | 0.435 - 0.44 : [ 6 0.00 0.00 ] | 0.44 - 0.445 : [ 1 0.00 0.00 ] | 0.445 - 0.45 : [ 4 0.00 0.00 ] | 0.45 - 0.455 : [ 7 0.00 0.00 ] | 0.455 - 0.46 : [ 6 0.00 0.00 ] | 0.46 - 0.465 : [ 12 0.00 0.00 ] | 0.465 - 0.47 : [ 15 0.00 0.00 ] | 0.47 - 0.475 : [ 8 0.00 0.00 ] | 0.475 - 0.48 : [ 18 0.00 0.00 ] | 0.48 - 0.485 : [ 20 0.00 0.00 ] | 0.485 - 0.49 : [ 18 0.00 0.00 ] | 0.49 - 0.495 : [ 15 0.00 0.00 ] | 0.495 - 0.5 : [ 23 0.00 0.00 ] |X 0.5 - 0.505 : [ 45 0.00 0.00 ] |X 0.505 - 0.51 : [ 48 0.00 0.01 ] |X 0.51 - 0.515 : [ 42 0.00 0.01 ] |X 0.515 - 0.52 : [ 62 0.00 0.01 ] |X 0.52 - 0.525 : [ 60 0.00 0.01 ] |X 0.525 - 0.53 : [ 87 0.00 0.01 ] |XX 0.53 - 0.535 : [ 96 0.00 0.01 ] |XX 0.535 - 0.54 : [ 133 0.00 0.02 ] |XX 0.54 - 0.545 : [ 128 0.00 0.02 ] |XX 0.545 - 0.55 : [ 123 0.00 0.02 ] |XXX 0.55 - 0.555 : [ 162 0.00 0.03 ] |XXX 0.555 - 0.56 : [ 174 0.00 0.03 ] |XXX 0.56 - 0.565 : [ 214 0.00 0.03 ] |XXX 0.565 - 0.57 : [ 210 0.00 0.04 ] |XXXX 0.57 - 0.575 : [ 227 0.01 0.04 ] |XXXX 0.575 - 0.58 : [ 253 0.01 0.05 ] |XXXXX 0.58 - 0.585 : [ 307 0.01 0.06 ] |XXXXX 0.585 - 0.59 : [ 315 0.01 0.06 ] |XXXXX 0.59 - 0.595 : [ 323 0.01 0.07 ] |XXXXXX 0.595 - 0.6 : [ 412 0.01 0.08 ] |XXXXXXXX 0.6 - 0.605 : [ 527 0.01 0.09 ] |XXXXXXXXXX 0.605 - 0.61 : [ 612 0.01 0.10 ] |XXXXXXXXXXXX 0.61 - 0.615 : [ 777 0.02 0.12 ] |XXXXXXXXXXXXXXX 0.615 - 0.62 : [ 942 0.02 0.14 ] |XXXXXXXXXXXXXXXXX 0.62 - 0.625 : [ 1104 0.02 0.17 ] |XXXXXXXXXXXXXXXXXXXX 0.625 - 0.63 : [ 1265 0.03 0.19 ] |XXXXXXXXXXXXXXXXXXXXXXX 0.63 - 0.635 : [ 1465 0.03 0.23 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXX 0.635 - 0.64 : [ 1734 0.04 0.27 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 0.64 - 0.645 : [ 1923 0.04 0.31 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 0.645 - 0.65 : [ 2034 0.04 0.35 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 0.65 - 0.655 : [ 2289 0.05 0.40 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 0.655 - 0.66 : [ 2416 0.05 0.46 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 0.66 - 0.665 : [ 2546 0.06 0.51 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 0.665 - 0.67 : [ 2450 0.05 0.57 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 0.67 - 0.675 : [ 2531 0.06 0.62 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 0.675 - 0.68 : [ 2501 0.06 0.68 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 0.68 - 0.685 : [ 2383 0.05 0.73 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 0.685 - 0.69 : [ 2257 0.05 0.78 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXX 0.69 - 0.695 : [ 1874 0.04 0.82 ] |XXXXXXXXXXXXXXXXXXXXXXXXXX 0.695 - 0.7 : [ 1627 0.04 0.86 ] |XXXXXXXXXXXXXXXXXXXXXXX 0.7 - 0.705 : [ 1485 0.03 0.89 ] |XXXXXXXXXXXXXXXXXX 0.705 - 0.71 : [ 1129 0.02 0.92 ] |XXXXXXXXXXXXXX 0.71 - 0.715 : [ 910 0.02 0.94 ] |XXXXXXXXXXX 0.715 - 0.72 : [ 726 0.02 0.95 ] |XXXXXXXXX 0.72 - 0.725 : [ 574 0.01 0.96 ] |XXXXXXX 0.725 - 0.73 : [ 458 0.01 0.97 ] |XXXXX 0.73 - 0.735 : [ 327 0.01 0.98 ] |XXXX 0.735 - 0.74 : [ 231 0.01 0.99 ] |XXX 0.74 - 0.745 : [ 177 0.00 0.99 ] |XX 0.745 - 0.75 : [ 131 0.00 0.99 ] |X 0.75 - 0.755 : [ 84 0.00 1.00 ] |X 0.755 - 0.76 : [ 42 0.00 1.00 ] | 0.76 - 0.765 : [ 22 0.00 1.00 ] |X 0.765 - 0.77 : [ 33 0.00 1.00 ] | 0.77 - 0.775 : [ 20 0.00 1.00 ] | 0.775 - 0.78 : [ 9 0.00 1.00 ] | 0.78 - 0.785 : [ 10 0.00 1.00 ] | 0.785 - 0.79 : [ 10 0.00 1.00 ] | 0.79 - 0.795 : [ 6 0.00 1.00 ] | 0.795 - 0.8 : [ 4 0.00 1.00 ] | 0.8 - 0.805 : [ 7 0.00 1.00 ] | 0.805 - 0.81 : [ 2 0.00 1.00 ] | 0.81 - 0.815 : [ 1 0.00 1.00 ] | 0.815 - 0.82 : [ 3 0.00 1.00 ] | 0.82 - 0.825 : [ 3 0.00 1.00 ] | 0.825 - 0.83 : [ 5 0.00 1.00 ] | 0.83 - 0.835 : [ 2 0.00 1.00 ] | 0.835 - 0.84 : [ 1 0.00 1.00 ] | 0.84 - 0.845 : [ 1 0.00 1.00 ] | 0.845 - 0.85 : [ 1 0.00 1.00 ] | 0.85 - 0.855 : [ 2 0.00 1.00 ] #... | 0.88 - 0.885 : [ 1 0.00 1.00 ] #... | 0.895 - 0.9 : [ 1 0.00 1.00 ] #... | 0.91 - 0.915 : [ 1 0.00 1.00 ]

Contigs and Assemblies

------------------------------------------------------------------- Command: /usr/local/bin/contig > contig.out [ final 30 lines ] ------------------------------------------------------------------- -------------------------------------------------------------- Contig 1. 1 read; 618 bp (untrimmed), 349 (trimmed). Contig 2. 1 read; 988 bp (untrimmed), 132 (trimmed). Contig 3. 1 read; 558 bp (untrimmed), 0 (trimmed). Contig 4. 1 read; 676 bp (untrimmed), 250 (trimmed). Contig 5. 2 reads; 517 bp (untrimmed), 461 (trimmed). Contig 6. 2 reads; 232 bp (untrimmed), 213 (trimmed). Contig 7. 4 reads; 1463 bp (untrimmed), 1147 (trimmed). Contig 8. 5 reads; 146 bp (untrimmed), 146 (trimmed). Contig 9. 8 reads; 154 bp (untrimmed), 154 (trimmed). Contig 10. 9 reads; 1356 bp (untrimmed), 1352 (trimmed). Contig 11. 10 reads; 1800 bp (untrimmed), 1769 (trimmed). Contig 12. 66 reads; 2195 bp (untrimmed), 1269 (trimmed). Contig 13. 113 reads; 10705 bp (untrimmed), 10705 (trimmed). Contig 14. 456 reads; 34390 bp (untrimmed), 34373 (trimmed). Contig 15. 482 reads; 40002 bp (untrimmed), 39953 (trimmed). Contig 16. 1823 reads; 110259 bp (untrimmed), 109897 (trimmed). Contig 17. 4490 reads; 307135 bp (untrimmed), 306776 (trimmed). Contig 18. 4608 reads; 281556 bp (untrimmed), 281151 (trimmed). Contig 19. 5988 reads; 375289 bp (untrimmed), 375210 (trimmed). Contig 20. 12434 reads; 755205 bp (untrimmed), 755109 (trimmed). Contig 21. 14829 reads; 991012 bp (untrimmed), 990536 (trimmed). -------------------------------------------------------------- Totals 45333 reads; 2916256 bp (untrimmed), 2910952 (trimmed). ------------------------------------------------------------------- Depth Summary Command: /home/copeland/scripts/depth_summary.pl depth.out ------------------------------------------------------------------- depth.out contains 2912884 bases = 14.98 +- 4.75 = 0.10 +- 4.61 ------------------------------------------------------------------- Histogram of All Contig Depth Values: Command: /home/copeland/scripts/histogram2.pl depth.out 9 0.5 ------------------------------------------------------------------- #Found 14 total values totalling 148.4600. <10.604286 +/- 4.971251> #Range: [ 1.94 - 15.91 ] #Most likely bin: [ 15.5 - 16 ] 3 counts #Median bin: [ 11.5 - 12 ] 1 counts #Histogram Bins Count Fraction Cum_Fraction |XXXXXXXXXXXXX 1.5 - 2 : [ 1 0.07 0.07 ] |XXXXXXXXXXXXX 2 - 2.5 : [ 1 0.07 0.14 ] #... |XXXXXXXXXXXXX 5.5 - 6 : [ 1 0.07 0.21 ] |XXXXXXXXXXXXX 6 - 6.5 : [ 1 0.07 0.29 ] |XXXXXXXXXXXXX 6.5 - 7 : [ 1 0.07 0.36 ] #... |XXXXXXXXXXXXX 9.5 - 10 : [ 1 0.07 0.43 ] #... |XXXXXXXXXXXXX 11.5 - 12 : [ 1 0.07 0.50 ] #... |XXXXXXXXXXXXX 12.5 - 13 : [ 1 0.07 0.57 ] #... |XXXXXXXXXXXXX 13.5 - 14 : [ 1 0.07 0.64 ] |XXXXXXXXXXXXX 14 - 14.5 : [ 1 0.07 0.71 ] #... |XXXXXXXXXXXXX 15 - 15.5 : [ 1 0.07 0.79 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 15.5 - 16 : [ 3 0.21 1.00 ] ------------------------------------------------------------------- Histogram of Major Contig Depth Values: Command: /home/copeland/scripts/histogram2.pl depth.out 9 0.5 3 10 10000000 5 2000 10000000 ------------------------------------------------------------------- #Found 10 total values totalling 132.5000. <13.250000 +/- 2.913441> #Range: [ 6.55 - 15.91 ] #Most likely bin: [ 15.5 - 16 ] 3 counts #Median bin: [ 13.5 - 14 ] 1 counts #Histogram Bins Count Fraction Cum_Fraction |XXXXXXXXXXXXX 6.5 - 7 : [ 1 0.10 0.10 ] #... |XXXXXXXXXXXXX 9.5 - 10 : [ 1 0.10 0.20 ] #... |XXXXXXXXXXXXX 11.5 - 12 : [ 1 0.10 0.30 ] #... |XXXXXXXXXXXXX 12.5 - 13 : [ 1 0.10 0.40 ] #... |XXXXXXXXXXXXX 13.5 - 14 : [ 1 0.10 0.50 ] |XXXXXXXXXXXXX 14 - 14.5 : [ 1 0.10 0.60 ] #... |XXXXXXXXXXXXX 15 - 15.5 : [ 1 0.10 0.70 ] |XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 15.5 - 16 : [ 3 0.30 1.00 ] ------------------------------------------------------------------- Sorted Depth Values: Command: sort -n -k 9 depth.out > sorted.depth.out [first and last 20 lines included] ------------------------------------------------------------------- Contig 5 2 reads 517 bases = 1.94 +- 0.23 = 1.94 +- 0.23 Contig 7 4 reads 1463 bases = 2.05 +- 0.86 = 2.05 +- 0.86 Contig 11 10 reads 1800 bases = 5.68 +- 3.19 = 1.26 +- 1.93 Contig 10 9 reads 1356 bases = 6.29 +- 2.31 = 6.29 +- 2.31 Contig 12 66 reads 2195 bases = 6.55 +- 17.67 = 0.11 +- 2.72 Contig 13 113 reads 10705 bases = 9.98 +- 4.13 = 0.44 +- 5.58 Contig 15 482 reads 40002 bases = 11.78 +- 5.32 = 0.22 +- 4.60 Contig 14 456 reads 34390 bases = 12.79 +- 4.66 = 1.15 +- 4.68 Contig 17 4490 reads 307135 bases = 13.96 +- 4.68 = 0.06 +- 4.19 Contig 21 14829 reads 991012 bases = 14.48 +- 4.47 = 0.09 +- 4.50 Contig 19 5988 reads 375289 bases = 15.47 +- 4.74 = 0.01 +- 4.84 Contig 18 4608 reads 281556 bases = 15.76 +- 4.51 = 0.28 +- 4.82 Contig 20 12434 reads 755205 bases = 15.82 +- 4.72 = 0.00 +- 4.73 Contig 16 1823 reads 110259 bases = 15.91 +- 4.64 = 0.39 +- 4.46

Assembler Specific Info

------------------------------------------------------------------- Reads in assembly summary ------------------------------------------------------------------- Small Inserts = 1 HQ Discrepant reads = 7 Chimeric reads = 67 Suspect alignments = 241 ------------------------------------------------------------------- Assembly parameters ------------------------------------------------------------------- phrap version SPS - 3.57 SUN/Ultra-2/3 Equivalent to Phil Green's version 0.990329 Score matrix (set by value of penalty: -2) A C G T N X A 1 -2 -2 -2 0 -3 C -2 1 -2 -2 0 -3 G -2 -2 1 -2 0 -3 T -2 -2 -2 1 0 -3 N 0 0 0 0 0 0 X -3 -3 -3 -3 0 -3 gap_init: -4 gap_ext: -3 ins_gap_ext: -3 del_gap_ext: -3 Using complexity-adjusted scores. Assumed background frequencies: A: 0.250 C: 0.250 G: 0.250 T: 0.250 N: 0.000 X: 0.000 minmatch: 30 maxmatch: 55 max_group_size: 20 minscore: 55 bandwidth: 14 indexwordsize: 10 vector_bound: 20 word_raw: 0 trim_penalty: -2 trim_score: 20 trim_qual: 13 maxgap: 30 repeat_stringency: 0.950000 qual_show: 20 confirm_length: 8 confirm_trim: 1 confirm_penalty: -5 confirm_score: 30 node_seg: 8 node_space: 4 forcelevel: 0 bypasslevel: 1 max_subclone_size: 50000 File generated in /psf/project/finished2/microbe/3634510/edit_dir.25Feb04.QD