Bos taurus

From Cbcb
Revision as of 15:06, 28 December 2008 by Dpuiu (talk | contribs) (→‎Contaminants)
Jump to navigation Jump to search

BCM

         #elem   min             max             mean            median          n50             sum
 contig  131620  91              326010          20755           10365           44270           2731814362
 placed  101579  91              250125          24286           13928           47485           2466971326
 chrom   30      44060403        161106243       87813777        84419198        106383598       2634413324

NCBI

 SPECIES_CODE = "BOS TAURUS"                                                          37,788,710 traces
 SPECIES_CODE = "BOS TAURUS" and CENTER_NAME = "BCM"                                  35,596,825 traces
 
 SPECIES_CODE = "BOS TAURUS" and CENTER_NAME = "BCM" and TRACE_TYPE_CODE = "WGS"      24,863,627 traces
 SPECIES_CODE = "BOS TAURUS" and CENTER_NAME = "BCM" and TRACE_TYPE_CODE = "SHOTGUN"  10,716,306 traces
 SPECIES_CODE = "BOS TAURUS" and CENTER_NAME = "BCM" and TRACE_TYPE_CODE = "CLONEEND"     16,892 traces

Submission

  • Title: "A whole-genome assembly of the cow, Bos taurus"
  • Authors:
 Steven Salzberg
 Aleksey Zimin
 Arthur Delcher
 Liliana Florea
 David Kelley
 Finian Hanrahan
 Guillaume Marcais
 Geo Pertea
 Michael Roberts
 Michael Schatz
 Curt Van Tassell
 James Yorke
 Poorani S.
  • Assembler:
 Celera Assembler and UMD Overlapper.
  • Sequencing Center :
 Baylor College of Medicine. 
  • Source of DNA used for sequencing:

The source of the BAC library DNA was Hereford bull L1 Domino 99375, registration number 41170496. Dr. Michael MacNeil's laboratory, USDA-ARS, Miles City, MT provided the blood. The DNA for the whole genome shotgun sequences was provided by Dr. Timothy Smith's laboratory, U.S. Meat Animal Research Center, Clay Center, NE from white blood cells from L1 Dominette 01449, American Hereford Association registration number 42190680 (a daughter of L1 Domino 99375). A skin cell fibroblast cell line from the same animal is available from Dr. Carol Chitko-McKown's laboratory, although there is no sequence from that cell line.

  • Sequence modifiers:
 [organism=Bos taurus][breed=Hereford][tech=wgs][chromosome=...]
  • Submission: Use sequin
 /nfshomes/dpuiu/szdevel/sequin.8.10/sequin
  • Sequence:
 Contig length summary:
           #seqs   min     max     mean    median  n50     sum
 all       210657  71      840370  13709   1523    78511   2887902366
 placed    75775   88      840370  34512   13416   88287   2615171268
 unplaced  134882  71      166670  2022    1322    1742    272731098


Duplicates

 deg0003136509,7180003440308 : both unplaced
 deg0003084562,7180002954167 : both unplaced

Contaminants

Detected by NCBI (1st batch)

Detected by NCBI (2nd batch)

                             #ctgs   min     max     mean    median  n50     total_bp
 UMD_Freeze2.0_contam        210657  71      840370  13709   1523    78508   2887902366
 UMD_Freeze2.0               187683  71      840370  15225   1609    79580   2857554998


Files: /fs/szasmg3/dpuiu/bos_taurus/submission/decontam

Notes:

  • The 4,814 ctgs were aligned to UniVec (-c 20 ; delta-filter -q)
    • 4,247 ctgs aligned to 89 vecto seqs
    • top ref hits:
 gnl|uv|J01749.1   Cloning vector pBR322
 gnl|uv|J01636.1   E.coli lactose operon with lacI, lacZ, lacY and lacA genes
 gnl|uv|AF102576.1 Cloning vector pSOS
 gnl|uv|L08959.1   pUC8 cloning vector 
 gnl|uv|L08931.1   pMAC7-8 cloning vector for site-directed mutagenesis
 gnl|uv|L09145.1   pUR222 cloning vector
 gnl|uv|U47102.2   Cloning vector pALTER<R>-Ex1
 ...
  • The 4,814 ctgs were aligned to EcoliK12
    • 4,299 aligned 200bp+ to Ecoli
    • 3,877 aligned 100% to region 365521_365744 (224bp)
 >NC_000913.2_365521_365744 Escherichia coli K12, complete genome
 CATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATAC
 GAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAA
 TTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAAT
 GAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGC

Local files

  • Freeze dir files
 /fs/szasmg3/bos_taurus/Bos_taurus_UMD_2.0/contigs.unplaced.fa  : sequences
 /fs/szasmg3/bos_taurus/Bos_taurus_UMD_2.0/bos_taurus.agp       : all scaffolds
  • Files uploaded
 Ftp server: ftp-private.ncbi.nlm.nih.gov
 Account: cbcb_trc
 Dir: uploads/
 Local files: /fs/szasmg3/dpuiu/bos_taurus/submission/ftp/   : 22 *sqn + 1 agp