Methanobrevibacter smithii: Difference between revisions
Jump to navigation
Jump to search
Line 131: | Line 131: | ||
TS95-5.16 1259 45 63 85 285 43875 1576 12739 1984835 209415 | TS95-5.16 1259 45 63 85 285 43875 1576 12739 1984835 209415 | ||
== velvet.Illumina == | == velvet.Illumina (CBCB) == | ||
#ctgs min q1 q2 q3 max mean n50 sum 0cgv | #ctgs min q1 q2 q3 max mean n50 sum 0cgv | ||
Line 148: | Line 148: | ||
TS95-4.15 751 45 63 98 887 51451 2455 15583 1844453 259266 | TS95-4.15 751 45 63 98 887 51451 2455 15583 1844453 259266 | ||
TS95-5.16 881 45 65 76 350 47906 2226 15468 1961518 212351 | TS95-5.16 881 45 65 76 350 47906 2226 15468 1961518 212351 | ||
== velvet.Illumina (WUSTL) == | |||
#ctgs | |||
FR1LH1.1 403 | |||
FR1LH3.2 274 | |||
FR1LH6.3 291 | |||
TS145A.4 1127 | |||
TS145B.5 445 | |||
TS94-3.11 449 | |||
TS94-5.12 468 | |||
TS95-2.13 4476 | |||
TS95-3.14 484 | |||
TS95-4.15 508 | |||
TS95-5.16 469 | |||
== newbler.454 (CBCB) == | == newbler.454 (CBCB) == |
Revision as of 16:09, 4 January 2010
Data
NCBI
1 complete + 2 draft assembly strains:
Methanobrevibacter smithii ATCC 35061
- complete: NC_009515.1
- 1,853,160bp
- Ref: /fs/szdata/ncbi/ftp.ncbi.nih.gov/genomes/Bacteria/Methanobrevibacter_smithii_ATCC_35061/*fna
- Published: Genomic and metabolic adaptations of Methanobrevibacter smithii to the human gut PNAS
- Assembled: Phrap and PCAP
- Repeats.png
min%identity elem min q1 q2 q3 max mean n50 sum 94 70 1102 1360 1383 1393 1408 1373 1383 96142 99 66 1358 1363 1383 1394 1408 1381 1383 91155
Methanobrevibacter smithii DSM 2374
- draft: NZ_ABYV02000000
- 1,727,775 bp
- 25 contigs: NZ_ABYV02000001 .. NZ_ABYV02000025
Methanobrevibacter smithii DSM 2375
- draft: NZ_ABYW00000000
- 1,704,865 bp
- 24 contigs: NZ_ABYW01000001 .. NZ_ABYW01000024
Methanobrevibacter smithii DSM 11975
- progress
WUSTL (Gordon Lab)
/fs/szattic-asmg4/methanobrevibacter_smithii/Data
- 22 strains
- half-way through sequencing (should have all the data by early to mid-January)
- right now (--Dpuiu 15:19, 15 December 2009 (EST)):
- 10 strains sequenced by GAII Illumina (36mers) with 3-8 million reads per strain (coverage is 50-150x),
- 12 strains sequenced by 454-Titanium sequencing, with 20,000 to 90,000 reads per strain (coverage is ~5-20x).
- 7 strains sequenced by Illumina and 454
Illumina: 36 bp single reads
nl strain reads bases cvg 1 FR1LH1 5186537 186715332 100 2 FR1LH3 7211080 259598880 140 3 FR1LH6 4968262 178857432 96 4 TS145A 6536457 235312452 126 5 TS145B 8277390 297986040 160 6 TS94-3 4886376 175909536 94 7 TS94-5 4785200 172267200 92 8 TS95-2 2896065 104258340 56 9 TS95-3 5064150 182309400 98 10 TS95-4 3557512 128070432 69 11 TS95-5 4559830 164153880 88
454: avg 342 bp single reads
nl strain reads bases cvg 1 TS145A 83667 28587141 15 2 TS145B 45203 15720997 8 3 TS146-3 49854 17608862 10 4 TS146e4 58633 18306825 10 5 TS146e5A 27844 8311560 4 6 TS146e5B 73182 23547825 13 7 TS147e8 68487 20662109 11 8 TS94-3 449545 157306990 85 9 TS94-5 76513 26734802 14 10 TS95-2 73255 25779806 14 11 TS95-4 85737 29231201 16 12 TS95-5 96757 35000794 19
Illumina & 454 : 7 strands have both Illumina & 454 reads
nl strand #reads #bases cvg #read #bases cvg avg%idFinishedGenome 1 FR1LH1 5186537 186715332 100 . . . 2 FR1LH3 7211080 259598880 140 . . . 3 FR1LH6 4968262 178857432 96 . . . 4* TS145A 6536457 235312452 126 83667 28587141 15 98 5 TS145B 8277390 297986040 160 45203 15720997 8 6 TS146-3 . . . 49854 17608862 10 7 TS146e4 . . . 58633 18306825 10 8* TS146e5A . . . 27844 8311560 4 98 9 TS146e5B . . . 73182 23547825 13 10 TS147e8 . . . 68487 20662109 11 11* TS94-3 4886376 175909536 94 449545 157306990 85 92 12 TS94-5 4785200 172267200 92 76513 26734802 14 13 TS95-2 2896065 104258340 56 73255 25779806 14 14 TS95-3 5064150 182309400 98 . . . 15 TS95-4 3557512 128070432 69 85737 29231201 16 16 TS95-5 4559830 164153880 88 96757 35000794 19
Based on nucmer alignments of finished genome and newbler contigs looks like
- TS145A.4 & TS146e5A.8 99% id
- TS145A.4 & TS94-3.11 92% id
Assembly
AMOScmp.Illumina
#ctgs min q1 q2 q3 max mean n50 sum 0cgv FR1LH1.1 1684 36 55 109 650 33114 958 5106 1614205 275309 FR1LH3.2 1642 36 55 106 631 33115 984 5331 1616391 273301 FR1LH6.3 1649 36 55 108 660 26387 979 5180 1615744 273654 TS145A.4 1918 36 59 124 805 28850 838 3234 1608620 284953 TS145B.5 1723 36 58 110 647 33114 936 4463 1612823 277233 TS94-5.12 8859 36 56 85 155 10840 159 242 1408957 658124 TS95-2.13 9535 36 55 82 147 7508 144 209 1380166 695098 TS95-3.14 9152 36 56 85 154 10159 156 238 1436137 642175 TS95-4.15 8859 36 56 85 155 10840 159 242 1408957 658124 TS95-5.16 9238 36 55 84 152 10160 155 235 1435894 645655
SOAPdenovo.Illumina
#ctgs min q1 q2 q3 max mean n50 sum 0cgv FR1LH1.1 774 45 75 137 2158 50285 2353 10320 1821758 218968 FR1LH3.2 767 45 73 108 977 56788 2371 14018 1818948 221732 FR1LH6.3 757 45 74 129 1396 50013 2402 12294 1818674 219385 TS145A.4 1798 45 127 459 1381 13456 991 2225 1781905 230976 TS145B.5 1098 45 83 306 2231 20523 1631 4869 1791302 216380 TS94-3.11 1405 45 59 98 634 35690 1349 7940 1895871 249447 TS94-5.12 1562 45 58 101 706 25048 1216 6456 1899512 251003 TS95-2.13 6611 45 131 215 364 2994 281 375 1863491 404797 TS95-3.14 1687 45 58 85 388 25368 1184 6931 1997443 213660 TS95-4.15 1562 45 58 101 706 25048 1216 6456 1899512 251003 TS95-5.16 1259 45 63 85 285 43875 1576 12739 1984835 209415
velvet.Illumina (CBCB)
#ctgs min q1 q2 q3 max mean n50 sum 0cgv FR1LH1.1 518 45 73 158 3131 67401 3491 16101 1808646 219880 FR1LH3.2 532 45 72 105 729 111372 3382 27042 1799368 227816 FR1LH6.3 547 45 72 111 992 85069 3309 20751 1810235 220469 TS145A.4 1002 45 398 1154 2228 14317 1575 2707 1578999 355737 TS145B.5 687 45 105 773 3424 31627 2567 6873 1763953 223089 TS94-3.11 738 45 63 102 755 99478 2526 17184 1864601 250197 TS94-5.12 751 45 63 98 887 51451 2455 15583 1844453 259266 TS95-2.13 7235 45 107 198 349 2547 265 370 1922674 382074 TS95-3.14 1018 45 68 85 341 60536 1934 15001 1969398 211351 TS95-4.15 751 45 63 98 887 51451 2455 15583 1844453 259266 TS95-5.16 881 45 65 76 350 47906 2226 15468 1961518 212351
velvet.Illumina (WUSTL)
#ctgs FR1LH1.1 403 FR1LH3.2 274 FR1LH6.3 291 TS145A.4 1127 TS145B.5 445 TS94-3.11 449 TS94-5.12 468
TS95-2.13 4476 TS95-3.14 484 TS95-4.15 508 TS95-5.16 469
newbler.454 (CBCB)
#ctgs min q1 q2 q3 max mean n50 sum 0cgv TS145A.4 48 131 883 9320 56233 210376 37056 112225 1778716 196515 TS145B.5 117 104 2185 7850 22256 79935 15147 34389 1772299 203614 TS146-3.6 441 100 248 398 497 106331 4295 37197 1894514 202108 TS146e4.7 99 135 926 13013 24440 147105 17974 36272 1779476 198936 TS146e5A.8 1183 98 326 770 1953 19970 1446 2812 1710725 350637 TS146e5B.9 432 100 191 325 462 212980 4411 106361 1905607 195632 TS147e8.10 86 132 1662 8952 35506 122338 22610 55697 1944471 221774 TS94-3.11 723 96 238 361 456 395484 2934 91958 2121818 224389 TS94-5.12 67 103 778 5001 46879 173674 28164 82955 1887011 226423 TS95-2.13 78 108 626 5843 45099 140027 25376 88829 1979337 187754 TS95-4.15 58 111 387 11498 59591 169655 34241 89436 1986032 186252 TS95-5.16 58 103 286 5198 59620 188214 34081 115574 1976737 186181
newbler.454 (WUSTL)
#ctgs max mean n50 sum TS145A.4 65 166070 41269 82515 1779098 TS145B.5 138 79979 15635 34155 1771201 TS146-3.6 391 83020 13810 29243 1876413 TS146e4.7 152 73674 13939 25059 1774990 TS146e5A.8 1216 7108 1365 1612 1439275 worse TS146e5B.9 115 129166 22375 52376 1801557 TS147e8.10 145 140484 17133 41840 1942004 TS94-3.11 598 284245 18746 106861 2066217 TS94-5.12 93 189466 30334 71682 1886916 TS95-2.13 85 136894 34695 73086 1983043 TS95-4.15 69 200840 50706 115852 1983803 TS95-5.16 54 191648 53368 91183 1978099
best
#ctgs min q1 q2 q3 max mean n50 sum 0cgv FR1LH1.1 169 47 307 3469 13513 113154 10746 29975 1816093 210847 FR1LH3.2 159 47 162 2932 13757 113158 11288 37473 1794914 221522 FR1LH6.3 157 50 269 3785 15911 126755 11603 29272 1821689 211256 TS145A.4 48 131 883 9320 56233 210376 37056 112225 1778716 196515 TS145B.5 53 71 1018 11425 57301 147750 33590 83876 1780320 198279 TS146-3.6 441 100 248 398 497 106331 4295 37197 1894514 202108 TS146e4.7 99 135 926 13013 24440 147105 17974 36272 1779476 198936 TS146e5A.8 1183 98 326 770 1953 19970 1446 2812 1710725 350637 TS146e5B.9 432 100 191 325 462 212980 4411 106361 1905607 195632 TS147e8.10 86 132 1662 8952 35506 122338 22610 55697 1944471 221774 TS94-3.11 38 55 764 9841 66968 395587 49738 144195 1890074 223341 TS94-5.12 49 65 766 7044 51688 285237 38534 111850 1888200 225957 TS95-2.13 57 110 1383 9455 50972 165660 34754 94352 1981005 182973 TS95-3.14 408 47 141 419 5698 52172 4663 16369 1902753 202507 TS95-4.15 79 47 147 2317 16017 200685 26493 138013 2092967 179552 TS95-5.16 47 69 311 14482 77588 188308 42099 115630 1978688 184590
Strains
TS145A.4
- 126x Illumina & 15x 454
- Read assemblies:
#ctgs min q1 q2 q3 max mean n50 sum 0cgv AMOScmp.Illumina 1918 36 59 124 805 28850 838 3234 1608620 284953 SOAPdenovo.Illumina* 1798 45 127 459 1381 13456 991 2225 1781905 230976 velvet.Illumina 1002 45 398 1154 2228 14317 1575 2707 1578999 355737 AMOScmp.454 135 114 1012 4414 12554 110913 12255 36870 1654428 203524 CA.454 203 1002 2802 6431 12196 41845 8646 13047 1755208 245426 newbler.454** 48 131 883 9320 56233 210376 37056 112225 1778716 196515 SOAPdenovo.454 6531 45 103 179 306 1806 237 321 1553899 478469 velvet.454 21165 45 53 65 83 548 73 74 1561277 821841 SOAPdenovo.Illumina_454 5795 45 75 184 425 3983 317 588 1840810 243391 velvet.Illumina_454 4229 45 135 269 507 3529 384 584 1626062 396716
Contig assemblies(no singletons):
#ctgs min q1 q2 q3 max mean n50 sum 0cgv minimus2.SOAPdenovo.Illumina-newbler.454 41 46 883 17898 61580 287497 43446 114143 1781287 197555 # 3 contigs don't contain any newbler original ctg minimus2.velvet.Illumina-newbler.454 36 289 1802 38243 70184 210459 49402 113849 1778497 197611 # all contigs contain at least 1 newbler original ctg
Contig assemblies(include singletons):
#ctgs min q1 q2 q3 max mean n50 sum 0cgv minimus2.SOAPdenovo.Illumina-newbler.454 83 45 97 425 17898 287497 21689 114143 1800192 196885 minimus2.velvet.Illumina-newbler.454 61 45 677 2956 43814 210459 29821 113849 1819092 197424
TS146e5A.8
- 4x 454
- Read assemblies:
#ctgs min q1 q2 q3 max mean n50 sum 0cgv AMOScmp.454 736 56 675 1452 2902 17360 2092 3319 1540404 324145 newbler.454* 1183 98 326 770 1953 19970 1446 2812 1710725 350637
TS94-3.11
- 94x Illumina & 85x 454
- Read assemblies:
#ctgs min q1 q2 q3 max mean n50 sum 0cgv SOAPdenovo.Illumina 1405 45 59 98 634 35690 1349 7940 1895871 249447 velvet.Illumina 738 45 63 102 755 99478 2526 17184 1864601 250197 AMOScmp.454 255 52 585 2580 8321 61854 6531 16349 1665535 225050 newbler.454 723 96 238 361 456 395484 2934 91958 2121818 224389