Dpuiu Assemblathon: Difference between revisions

From Cbcb
Jump to navigation Jump to search
No edit summary
 
Line 4: Line 4:


= CBCB genomes =
= CBCB genomes =
#. a bacterial genome.  Instead of E. coli, we can use S. aureus USA300, which has sequence data in SRA from 454 and Illumina, paired and unpaired.  Daniela has already assemblied it using CA, Newbler, Velvet, SOAPdenovo, and Maq (using its comparative assembly mode, where it aligns to a reference).
# a bacterial genome.  Instead of E. coli, we can use S. aureus USA300, which has sequence data in SRA from 454 and Illumina, paired and unpaired.  Daniela has already assemblied it using CA, Newbler, Velvet, SOAPdenovo, and Maq (using its comparative assembly mode, where it aligns to a reference).
#. A medium-sized eukaryote.  I'd like to use the Argentine ant or the Bombus impatiens bee - I've just written to Gene Robinson to ask about the bee.
# A medium-sized eukaryote.  I'd like to use the Argentine ant or the Bombus impatiens bee - I've just written to Gene Robinson to ask about the bee.
#. Another eukaryote, ideally a larger one.  Human would be great, but we just don't have enough time to do multiple human assemblies.  So maybe another insect, or perhaps a plant if we can find one for which data is available.
# Another eukaryote, ideally a larger one.  Human would be great, but we just don't have enough time to do multiple human assemblies.  So maybe another insect, or perhaps a plant if we can find one for which data is available.


If we can agree on the data sets, then the next step would be to design the experiment - decide in advance which assemblers to run and how many ways to try each one.  I'm thinking we should also trim all the data with Quake.
If we can agree on the data sets, then the next step would be to design the experiment - decide in advance which assemblers to run and how many ways to try each one.  I'm thinking we should also trim all the data with Quake.

Revision as of 04:28, 11 December 2010

Links

CBCB genomes

  1. a bacterial genome. Instead of E. coli, we can use S. aureus USA300, which has sequence data in SRA from 454 and Illumina, paired and unpaired. Daniela has already assemblied it using CA, Newbler, Velvet, SOAPdenovo, and Maq (using its comparative assembly mode, where it aligns to a reference).
  2. A medium-sized eukaryote. I'd like to use the Argentine ant or the Bombus impatiens bee - I've just written to Gene Robinson to ask about the bee.
  3. Another eukaryote, ideally a larger one. Human would be great, but we just don't have enough time to do multiple human assemblies. So maybe another insect, or perhaps a plant if we can find one for which data is available.

If we can agree on the data sets, then the next step would be to design the experiment - decide in advance which assemblers to run and how many ways to try each one. I'm thinking we should also trim all the data with Quake.