Bos taurus: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
|||
Line 65: | Line 65: | ||
== Contaminants == | == Contaminants == | ||
All: | |||
#elem min max mean median n50 sum #unplaced | #elem min max mean median n50 sum #unplaced | ||
list.exclude_contigs(ctgs) 4809 316 16661 1511 1485 1514 7266315 4797 | list.exclude_contigs(ctgs) 4809 316 16661 1511 1485 1514 7266315 4797 | ||
Line 70: | Line 71: | ||
list.trim_contigs(regions) 30279 48 2479 354 319 445 10724357 | list.trim_contigs(regions) 30279 48 2479 354 319 445 10724357 | ||
7266315+10724357=17,990,672=18M bp to trim | 7266315+10724357=17,990,672=18M bp to trim | ||
Exclude: | |||
#elem min max mean median n50 sum | |||
all 4809 316 16661 1511 1485 1514 7266315 | |||
mitochondrion 69 1003 2081 1369 1299 1365 94490 | |||
degenerates 43 316 1098 842 872 889 36201 | |||
contaminants 4697 861 16661 1519 1487 1517 7135624 | |||
= Local files = | = Local files = |
Revision as of 14:16, 9 December 2008
BCM
NCBI
SPECIES_CODE = "BOS TAURUS" 37,788,710 traces SPECIES_CODE = "BOS TAURUS" and CENTER_NAME = "BCM" 35,596,825 traces SPECIES_CODE = "BOS TAURUS" and CENTER_NAME = "BCM" and TRACE_TYPE_CODE = "WGS" 24,863,627 traces SPECIES_CODE = "BOS TAURUS" and CENTER_NAME = "BCM" and TRACE_TYPE_CODE = "SHOTGUN" 10,716,306 traces SPECIES_CODE = "BOS TAURUS" and CENTER_NAME = "BCM" and TRACE_TYPE_CODE = "CLONEEND" 16,892 traces
Submission
- Genome Project
- WGS id (GPID): 32899
- Title: "A whole-genome assembly of the cow, Bos taurus"
- Authors:
Steven Salzberg Aleksey Zimin Arthur Delcher Liliana Florea David Kelley Finian Hanrahan Guillaume Marcais Geo Pertea Michael Roberts Michael Schatz Curt Van Tassell James Yorke Poorani S.
- Assembler:
Celera Assembler and UMD Overlapper.
- Sequencing Center :
Baylor College of Medicine.
- Source of DNA used for sequencing:
The source of the BAC library DNA was Hereford bull L1 Domino 99375, registration number 41170496. Dr. Michael MacNeil's laboratory, USDA-ARS, Miles City, MT provided the blood. The DNA for the whole genome shotgun sequences was provided by Dr. Timothy Smith's laboratory, U.S. Meat Animal Research Center, Clay Center, NE from white blood cells from L1 Dominette 01449, American Hereford Association registration number 42190680 (a daughter of L1 Domino 99375). A skin cell fibroblast cell line from the same animal is available from Dr. Carol Chitko-McKown's laboratory, although there is no sequence from that cell line.
- Sequence modifiers:
[organism=Bos taurus][breed=Hereford][tech=wgs][chromosome=...]
- Submission: Use sequin
/nfshomes/dpuiu/szdevel/sequin.8.10/sequin
- Sequence:
Contig length summary: #seqs min max mean median n50 sum all 210657 71 840370 13709 1523 78511 2887902366 placed 75775 88 840370 34512 13416 88287 2615171268 unplaced 134882 71 166670 2022 1322 1742 272731098
Problems
Duplicates
- Duplicate contigs : 111+2 : one copy was removed
deg0003136509,7180003440308 : both unplaced deg0003084562,7180002954167 : both unplaced
Contaminants
All:
#elem min max mean median n50 sum #unplaced list.exclude_contigs(ctgs) 4809 316 16661 1511 1485 1514 7266315 4797 list.trim_contigs(ctgs) 19049 457 433546 4424 1204 70673 84270091 16631 list.trim_contigs(regions) 30279 48 2479 354 319 445 10724357 7266315+10724357=17,990,672=18M bp to trim
Exclude:
#elem min max mean median n50 sum all 4809 316 16661 1511 1485 1514 7266315 mitochondrion 69 1003 2081 1369 1299 1365 94490 degenerates 43 316 1098 842 872 889 36201 contaminants 4697 861 16661 1519 1487 1517 7135624
Local files
- Freeze dir files
/fs/szasmg3/bos_taurus/Bos_taurus_UMD_2.0/contigs.unplaced.fa : sequences /fs/szasmg3/bos_taurus/Bos_taurus_UMD_2.0/bos_taurus.agp : all scaffolds
- Files uploaded
Ftp server: ftp-private.ncbi.nlm.nih.gov Account: cbcb_trc Dir: uploads/ Local files: /fs/szasmg3/dpuiu/bos_taurus/submission/ftp/ : 22 *sqn + 1 agp