<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.umiacs.umd.edu/cbcb/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Mschatz</id>
	<title>Cbcb - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.umiacs.umd.edu/cbcb/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Mschatz"/>
	<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php/Special:Contributions/Mschatz"/>
	<updated>2026-05-25T17:28:53Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.43.7</generator>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Short_read_sequencing&amp;diff=3003</id>
		<title>Short read sequencing</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Short_read_sequencing&amp;diff=3003"/>
		<updated>2008-06-20T18:23:10Z</updated>

		<summary type="html">&lt;p&gt;Mschatz: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Articles =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== 06/27/2008 Consensus calling and Structural variation ==&lt;br /&gt;
&lt;br /&gt;
* [http://www.ncbi.nlm.nih.gov/pubmed/18321888 Consensus generation and variant detection by Celera Assembler.]&lt;br /&gt;
* [http://compbio.cs.toronto.edu/structvar/ Detecting Structural Variations, Brudno et al. ]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== 06/20/2008 Read Mapping Software ==&lt;br /&gt;
&lt;br /&gt;
=== BLAT ===&lt;br /&gt;
* [http://www.genome.org/cgi/reprint/GR-2292Rv1 BLAT—The BLAST-Like Alignment Tool, Genome Research 2002]&lt;br /&gt;
* [http://genome.ucsc.edu/FAQ/FAQblat FAQ]&lt;br /&gt;
* Can align any type of reads&lt;br /&gt;
* Can do nt:aa translation&lt;br /&gt;
* Command: blat &lt;br /&gt;
&lt;br /&gt;
=== MAQ ===&lt;br /&gt;
* [http://maq.sourceforge.net/ Maq Sourceforge]&lt;br /&gt;
* [http://www.sanger.ac.uk/Users/lh3/maq-poster.pdf Maq Poster from Sanger] &lt;br /&gt;
* Illumina-Solexa/AB-SOLiD , not 454 or capillary reads&lt;br /&gt;
* Uses FASTQ format&lt;br /&gt;
* Command: maq map ...&lt;br /&gt;
&lt;br /&gt;
=== RMAP ===&lt;br /&gt;
* [http://rulai.cshl.edu/rmap/ RMAP] : designed for Illumina-Solexa&lt;br /&gt;
* Command: rmap &lt;br /&gt;
&lt;br /&gt;
=== SHRiMP ===&lt;br /&gt;
* [http://compbio.cs.toronto.edu/shrimp/ Web site]&lt;br /&gt;
* Commands: rmapper-cs , rmapper-ls, ...&lt;br /&gt;
&lt;br /&gt;
=== SOAP ===&lt;br /&gt;
* [http://soap.genomics.org.cn/ Web site (China)]&lt;br /&gt;
* [http://soap.genomics.org.cn/SOAP_paper.pdf SOAP: short oligonucleotide alignment program, Bioinformatics Jan 2008]&lt;br /&gt;
* Commands: soap, soap.contig, soap_dealign, soap.huge, soap.short&lt;br /&gt;
* can use qualities, do read trimming, use pair ends, RNA alignments&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== 06/13/2008 Genome Resequencing ==&lt;br /&gt;
&lt;br /&gt;
* [http://www.nature.com/nature/journal/v452/n7189/pdf/nature06884.pdf The complete genome of an individual by massively parallel DNA sequencing (J.Watson&#039;s genome) Nature April 2008]&lt;br /&gt;
* [http://www.nature.com/nature/journal/v452/n7189/extref/nature06884-s1.pdf J.Watson&#039;s genome (supplementary info) ]&lt;br /&gt;
&lt;br /&gt;
* [http://www.nature.com/nmeth/journal/v5/n2/full/nmeth.1179.html;jsessionid=DC518BCD8B2CACAE8AFFF7F70DD46902 Whole-genome sequencing and variant discovery in C. elegans Nature Jan 2008]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Links =&lt;br /&gt;
&lt;br /&gt;
* [http://www.1000genomes.org/page.php?page=home 1000 genomes]&lt;br /&gt;
&lt;br /&gt;
* [http://www.cbcb.umd.edu/~langmead/solexa_1000genomes.html Ben&#039;s web site 1]&lt;br /&gt;
* [http://www.cbcb.umd.edu/~langmead/solexa_format.html Ben&#039;s web site 2]&lt;br /&gt;
&lt;br /&gt;
* [http://en.wikipedia.org/wiki/Chip-Sequencing Chip-Seq @ Wikipedia]&lt;br /&gt;
&lt;br /&gt;
* [http://www.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?cmd=table&amp;amp;f=run&amp;amp;m=data&amp;amp;s=run SRA]&lt;br /&gt;
* [ftp://ftp.ncbi.nih.gov/pub/TraceDB/ShortRead SRA FTP]&lt;br /&gt;
&lt;br /&gt;
= Data =&lt;br /&gt;
&lt;br /&gt;
== Solexa ==&lt;br /&gt;
&lt;br /&gt;
* [ftp://ftp.sanger.ac.uk/pub/PRODUCTION_SOFTWARE/data_sets/suis_solexa/ Strep suis Solexa at Sanger] 36bp, ~49X coverage&lt;br /&gt;
* [http://www.genomic.ch/edena/mw2Reads.seq.gz  Staphylococcus aureus strain MW2 (edena paper)] 35bp, ~47X coverage&lt;br /&gt;
* Pseudomonas aeruginosa: 33bp, ~43X coverage&lt;br /&gt;
* Pseudomonas syringae:   32bp, ~31X coverage&lt;br /&gt;
* 1000 Genomes (June 14th 2008): 47bp&lt;br /&gt;
&lt;br /&gt;
  Accession       #Runs   Instrument                      Center  Study                          [Individual]&lt;br /&gt;
  SRA000303       41      Solexa 1G Genome Analyzer       BI      1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA000304       49      Solexa 1G Genome Analyzer       BI      1000Genomes Project Pilot 2     NA12891&lt;br /&gt;
  SRA000305       56      Solexa 1G Genome Analyzer       BI      1000Genomes Project Pilot 2     NA12892&lt;br /&gt;
  SRA000307       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA10851&lt;br /&gt;
  SRA000308       2       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA11993&lt;br /&gt;
  SRA000309       3       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA11995&lt;br /&gt;
  SRA000310       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12006&lt;br /&gt;
  SRA000311       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12044&lt;br /&gt;
  SRA000312       2       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12156&lt;br /&gt;
  SRA000313       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12414&lt;br /&gt;
  SRA000314       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12776&lt;br /&gt;
  SRA000315       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12828&lt;br /&gt;
  SRA000316       12      Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA000317       8       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 2     NA12891&lt;br /&gt;
  SRA000318       14      Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 2     NA12892&lt;br /&gt;
  SRA000319       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12004&lt;br /&gt;
&lt;br /&gt;
  /fs/szdata/Solexa/1000genomes&lt;br /&gt;
&lt;br /&gt;
== 454 ==&lt;br /&gt;
&lt;br /&gt;
* 1000 Genomes (June 14th 2008)&lt;br /&gt;
&lt;br /&gt;
  Accession       #Runs   Instrument      Center  Study                           [Individual]&lt;br /&gt;
  SRA000302       121     454 GS FLX      BCM     1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA001032       2       454 GS FLX      BCM     1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA001036       1       454 GS FLX      BCM     1000Genomes Project Pilot 1     NA12812&lt;br /&gt;
  SRA001094       1       454 GS FLX      BCM     1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
&lt;br /&gt;
== Refseq ==&lt;br /&gt;
&lt;br /&gt;
* /fs/szdata/genomes/human_ncbi_build36/ NCBI build36.1 May 2006 (Current build is 36.3 March 2008)&lt;br /&gt;
* /fs/szdata/genomes/human_celera_2001_Orig/&lt;br /&gt;
&lt;br /&gt;
= Software @ CBCB =&lt;br /&gt;
&lt;br /&gt;
Under /fs/sz-user-supported/Linux-x86_64/bin/&lt;br /&gt;
&lt;br /&gt;
== Denovo assembly ==&lt;br /&gt;
&lt;br /&gt;
* edena&lt;br /&gt;
* ssake&lt;br /&gt;
* velveth,velvetg&lt;br /&gt;
&lt;br /&gt;
== Read mapping ==&lt;br /&gt;
&lt;br /&gt;
* blat&lt;br /&gt;
* maq&lt;br /&gt;
* soap&lt;/div&gt;</summary>
		<author><name>Mschatz</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Short_read_sequencing&amp;diff=3002</id>
		<title>Short read sequencing</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Short_read_sequencing&amp;diff=3002"/>
		<updated>2008-06-20T18:22:18Z</updated>

		<summary type="html">&lt;p&gt;Mschatz: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Articles =&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== 06/27/2008 (Consensus calling and structural variation)==&lt;br /&gt;
&lt;br /&gt;
* [http://www.ncbi.nlm.nih.gov/pubmed/18321888 Consensus generation and variant detection by Celera Assembler.]&lt;br /&gt;
* [http://compbio.cs.toronto.edu/structvar/ Detecting Structural Variations, Brudno et al. ]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== 06/20/2008 (Read Mapping Software)==&lt;br /&gt;
&lt;br /&gt;
=== BLAT ===&lt;br /&gt;
* [http://www.genome.org/cgi/reprint/GR-2292Rv1 BLAT—The BLAST-Like Alignment Tool, Genome Research 2002]&lt;br /&gt;
* [http://genome.ucsc.edu/FAQ/FAQblat FAQ]&lt;br /&gt;
* Can align any type of reads&lt;br /&gt;
* Can do nt:aa translation&lt;br /&gt;
* Command: blat &lt;br /&gt;
&lt;br /&gt;
=== MAQ ===&lt;br /&gt;
* [http://maq.sourceforge.net/ Maq Sourceforge]&lt;br /&gt;
* [http://www.sanger.ac.uk/Users/lh3/maq-poster.pdf Maq Poster from Sanger] &lt;br /&gt;
* Illumina-Solexa/AB-SOLiD , not 454 or capillary reads&lt;br /&gt;
* Uses FASTQ format&lt;br /&gt;
* Command: maq map ...&lt;br /&gt;
&lt;br /&gt;
=== RMAP ===&lt;br /&gt;
* [http://rulai.cshl.edu/rmap/ RMAP] : designed for Illumina-Solexa&lt;br /&gt;
* Command: rmap &lt;br /&gt;
&lt;br /&gt;
=== SHRiMP ===&lt;br /&gt;
* [http://compbio.cs.toronto.edu/shrimp/ Web site]&lt;br /&gt;
* Commands: rmapper-cs , rmapper-ls, ...&lt;br /&gt;
&lt;br /&gt;
=== SOAP ===&lt;br /&gt;
* [http://soap.genomics.org.cn/ Web site (China)]&lt;br /&gt;
* [http://soap.genomics.org.cn/SOAP_paper.pdf SOAP: short oligonucleotide alignment program, Bioinformatics Jan 2008]&lt;br /&gt;
* Commands: soap, soap.contig, soap_dealign, soap.huge, soap.short&lt;br /&gt;
* can use qualities, do read trimming, use pair ends, RNA alignments&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== 06/13/2008 (Genome Resequencing) ==&lt;br /&gt;
&lt;br /&gt;
* [http://www.nature.com/nature/journal/v452/n7189/pdf/nature06884.pdf The complete genome of an individual by massively parallel DNA sequencing (J.Watson&#039;s genome) Nature April 2008]&lt;br /&gt;
* [http://www.nature.com/nature/journal/v452/n7189/extref/nature06884-s1.pdf J.Watson&#039;s genome (supplementary info) ]&lt;br /&gt;
&lt;br /&gt;
* [http://www.nature.com/nmeth/journal/v5/n2/full/nmeth.1179.html;jsessionid=DC518BCD8B2CACAE8AFFF7F70DD46902 Whole-genome sequencing and variant discovery in C. elegans Nature Jan 2008]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
= Links =&lt;br /&gt;
&lt;br /&gt;
* [http://www.1000genomes.org/page.php?page=home 1000 genomes]&lt;br /&gt;
&lt;br /&gt;
* [http://www.cbcb.umd.edu/~langmead/solexa_1000genomes.html Ben&#039;s web site 1]&lt;br /&gt;
* [http://www.cbcb.umd.edu/~langmead/solexa_format.html Ben&#039;s web site 2]&lt;br /&gt;
&lt;br /&gt;
* [http://en.wikipedia.org/wiki/Chip-Sequencing Chip-Seq @ Wikipedia]&lt;br /&gt;
&lt;br /&gt;
* [http://www.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?cmd=table&amp;amp;f=run&amp;amp;m=data&amp;amp;s=run SRA]&lt;br /&gt;
* [ftp://ftp.ncbi.nih.gov/pub/TraceDB/ShortRead SRA FTP]&lt;br /&gt;
&lt;br /&gt;
= Data =&lt;br /&gt;
&lt;br /&gt;
== Solexa ==&lt;br /&gt;
&lt;br /&gt;
* [ftp://ftp.sanger.ac.uk/pub/PRODUCTION_SOFTWARE/data_sets/suis_solexa/ Strep suis Solexa at Sanger] 36bp, ~49X coverage&lt;br /&gt;
* [http://www.genomic.ch/edena/mw2Reads.seq.gz  Staphylococcus aureus strain MW2 (edena paper)] 35bp, ~47X coverage&lt;br /&gt;
* Pseudomonas aeruginosa: 33bp, ~43X coverage&lt;br /&gt;
* Pseudomonas syringae:   32bp, ~31X coverage&lt;br /&gt;
* 1000 Genomes (June 14th 2008): 47bp&lt;br /&gt;
&lt;br /&gt;
  Accession       #Runs   Instrument                      Center  Study                          [Individual]&lt;br /&gt;
  SRA000303       41      Solexa 1G Genome Analyzer       BI      1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA000304       49      Solexa 1G Genome Analyzer       BI      1000Genomes Project Pilot 2     NA12891&lt;br /&gt;
  SRA000305       56      Solexa 1G Genome Analyzer       BI      1000Genomes Project Pilot 2     NA12892&lt;br /&gt;
  SRA000307       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA10851&lt;br /&gt;
  SRA000308       2       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA11993&lt;br /&gt;
  SRA000309       3       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA11995&lt;br /&gt;
  SRA000310       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12006&lt;br /&gt;
  SRA000311       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12044&lt;br /&gt;
  SRA000312       2       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12156&lt;br /&gt;
  SRA000313       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12414&lt;br /&gt;
  SRA000314       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12776&lt;br /&gt;
  SRA000315       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12828&lt;br /&gt;
  SRA000316       12      Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA000317       8       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 2     NA12891&lt;br /&gt;
  SRA000318       14      Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 2     NA12892&lt;br /&gt;
  SRA000319       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12004&lt;br /&gt;
&lt;br /&gt;
  /fs/szdata/Solexa/1000genomes&lt;br /&gt;
&lt;br /&gt;
== 454 ==&lt;br /&gt;
&lt;br /&gt;
* 1000 Genomes (June 14th 2008)&lt;br /&gt;
&lt;br /&gt;
  Accession       #Runs   Instrument      Center  Study                           [Individual]&lt;br /&gt;
  SRA000302       121     454 GS FLX      BCM     1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA001032       2       454 GS FLX      BCM     1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA001036       1       454 GS FLX      BCM     1000Genomes Project Pilot 1     NA12812&lt;br /&gt;
  SRA001094       1       454 GS FLX      BCM     1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
&lt;br /&gt;
== Refseq ==&lt;br /&gt;
&lt;br /&gt;
* /fs/szdata/genomes/human_ncbi_build36/ NCBI build36.1 May 2006 (Current build is 36.3 March 2008)&lt;br /&gt;
* /fs/szdata/genomes/human_celera_2001_Orig/&lt;br /&gt;
&lt;br /&gt;
= Software @ CBCB =&lt;br /&gt;
&lt;br /&gt;
Under /fs/sz-user-supported/Linux-x86_64/bin/&lt;br /&gt;
&lt;br /&gt;
== Denovo assembly ==&lt;br /&gt;
&lt;br /&gt;
* edena&lt;br /&gt;
* ssake&lt;br /&gt;
* velveth,velvetg&lt;br /&gt;
&lt;br /&gt;
== Read mapping ==&lt;br /&gt;
&lt;br /&gt;
* blat&lt;br /&gt;
* maq&lt;br /&gt;
* soap&lt;/div&gt;</summary>
		<author><name>Mschatz</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Short_read_sequencing&amp;diff=3001</id>
		<title>Short read sequencing</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Short_read_sequencing&amp;diff=3001"/>
		<updated>2008-06-20T18:21:38Z</updated>

		<summary type="html">&lt;p&gt;Mschatz: /* 06/27/2008 (Consensus calling and structural variation) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Articles =&lt;br /&gt;
&lt;br /&gt;
== 06/13/2008 (Genome Resequencing) ==&lt;br /&gt;
&lt;br /&gt;
* [http://www.nature.com/nature/journal/v452/n7189/pdf/nature06884.pdf The complete genome of an individual by massively parallel DNA sequencing (J.Watson&#039;s genome) Nature April 2008]&lt;br /&gt;
* [http://www.nature.com/nature/journal/v452/n7189/extref/nature06884-s1.pdf J.Watson&#039;s genome (supplementary info) ]&lt;br /&gt;
&lt;br /&gt;
* [http://www.nature.com/nmeth/journal/v5/n2/full/nmeth.1179.html;jsessionid=DC518BCD8B2CACAE8AFFF7F70DD46902 Whole-genome sequencing and variant discovery in C. elegans Nature Jan 2008]&lt;br /&gt;
&lt;br /&gt;
== 06/20/2008 (Read Mapping Software)==&lt;br /&gt;
&lt;br /&gt;
=== BLAT ===&lt;br /&gt;
* [http://www.genome.org/cgi/reprint/GR-2292Rv1 BLAT—The BLAST-Like Alignment Tool, Genome Research 2002]&lt;br /&gt;
* [http://genome.ucsc.edu/FAQ/FAQblat FAQ]&lt;br /&gt;
* Can align any type of reads&lt;br /&gt;
* Can do nt:aa translation&lt;br /&gt;
* Command: blat &lt;br /&gt;
&lt;br /&gt;
=== MAQ ===&lt;br /&gt;
* [http://maq.sourceforge.net/ Maq Sourceforge]&lt;br /&gt;
* [http://www.sanger.ac.uk/Users/lh3/maq-poster.pdf Maq Poster from Sanger] &lt;br /&gt;
* Illumina-Solexa/AB-SOLiD , not 454 or capillary reads&lt;br /&gt;
* Uses FASTQ format&lt;br /&gt;
* Command: maq map ...&lt;br /&gt;
&lt;br /&gt;
=== RMAP ===&lt;br /&gt;
* [http://rulai.cshl.edu/rmap/ RMAP] : designed for Illumina-Solexa&lt;br /&gt;
* Command: rmap &lt;br /&gt;
&lt;br /&gt;
=== SHRiMP ===&lt;br /&gt;
* [http://compbio.cs.toronto.edu/shrimp/ Web site]&lt;br /&gt;
* Commands: rmapper-cs , rmapper-ls, ...&lt;br /&gt;
&lt;br /&gt;
=== SOAP ===&lt;br /&gt;
* [http://soap.genomics.org.cn/ Web site (China)]&lt;br /&gt;
* [http://soap.genomics.org.cn/SOAP_paper.pdf SOAP: short oligonucleotide alignment program, Bioinformatics Jan 2008]&lt;br /&gt;
* Commands: soap, soap.contig, soap_dealign, soap.huge, soap.short&lt;br /&gt;
* can use qualities, do read trimming, use pair ends, RNA alignments&lt;br /&gt;
&lt;br /&gt;
== 06/27/2008 (Consensus calling and structural variation)==&lt;br /&gt;
&lt;br /&gt;
* [http://www.ncbi.nlm.nih.gov/pubmed/18321888 Consensus generation and variant detection by Celera Assembler.]&lt;br /&gt;
* [http://compbio.cs.toronto.edu/structvar/ Detecting Structural Variations, Brudno et al. ]&lt;br /&gt;
&lt;br /&gt;
= Links =&lt;br /&gt;
&lt;br /&gt;
* [http://www.1000genomes.org/page.php?page=home 1000 genomes]&lt;br /&gt;
&lt;br /&gt;
* [http://www.cbcb.umd.edu/~langmead/solexa_1000genomes.html Ben&#039;s web site 1]&lt;br /&gt;
* [http://www.cbcb.umd.edu/~langmead/solexa_format.html Ben&#039;s web site 2]&lt;br /&gt;
&lt;br /&gt;
* [http://en.wikipedia.org/wiki/Chip-Sequencing Chip-Seq @ Wikipedia]&lt;br /&gt;
&lt;br /&gt;
* [http://www.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?cmd=table&amp;amp;f=run&amp;amp;m=data&amp;amp;s=run SRA]&lt;br /&gt;
* [ftp://ftp.ncbi.nih.gov/pub/TraceDB/ShortRead SRA FTP]&lt;br /&gt;
&lt;br /&gt;
= Data =&lt;br /&gt;
&lt;br /&gt;
== Solexa ==&lt;br /&gt;
&lt;br /&gt;
* [ftp://ftp.sanger.ac.uk/pub/PRODUCTION_SOFTWARE/data_sets/suis_solexa/ Strep suis Solexa at Sanger] 36bp, ~49X coverage&lt;br /&gt;
* [http://www.genomic.ch/edena/mw2Reads.seq.gz  Staphylococcus aureus strain MW2 (edena paper)] 35bp, ~47X coverage&lt;br /&gt;
* Pseudomonas aeruginosa: 33bp, ~43X coverage&lt;br /&gt;
* Pseudomonas syringae:   32bp, ~31X coverage&lt;br /&gt;
* 1000 Genomes (June 14th 2008): 47bp&lt;br /&gt;
&lt;br /&gt;
  Accession       #Runs   Instrument                      Center  Study                          [Individual]&lt;br /&gt;
  SRA000303       41      Solexa 1G Genome Analyzer       BI      1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA000304       49      Solexa 1G Genome Analyzer       BI      1000Genomes Project Pilot 2     NA12891&lt;br /&gt;
  SRA000305       56      Solexa 1G Genome Analyzer       BI      1000Genomes Project Pilot 2     NA12892&lt;br /&gt;
  SRA000307       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA10851&lt;br /&gt;
  SRA000308       2       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA11993&lt;br /&gt;
  SRA000309       3       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA11995&lt;br /&gt;
  SRA000310       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12006&lt;br /&gt;
  SRA000311       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12044&lt;br /&gt;
  SRA000312       2       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12156&lt;br /&gt;
  SRA000313       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12414&lt;br /&gt;
  SRA000314       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12776&lt;br /&gt;
  SRA000315       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12828&lt;br /&gt;
  SRA000316       12      Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA000317       8       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 2     NA12891&lt;br /&gt;
  SRA000318       14      Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 2     NA12892&lt;br /&gt;
  SRA000319       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12004&lt;br /&gt;
&lt;br /&gt;
  /fs/szdata/Solexa/1000genomes&lt;br /&gt;
&lt;br /&gt;
== 454 ==&lt;br /&gt;
&lt;br /&gt;
* 1000 Genomes (June 14th 2008)&lt;br /&gt;
&lt;br /&gt;
  Accession       #Runs   Instrument      Center  Study                           [Individual]&lt;br /&gt;
  SRA000302       121     454 GS FLX      BCM     1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA001032       2       454 GS FLX      BCM     1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA001036       1       454 GS FLX      BCM     1000Genomes Project Pilot 1     NA12812&lt;br /&gt;
  SRA001094       1       454 GS FLX      BCM     1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
&lt;br /&gt;
== Refseq ==&lt;br /&gt;
&lt;br /&gt;
* /fs/szdata/genomes/human_ncbi_build36/ NCBI build36.1 May 2006 (Current build is 36.3 March 2008)&lt;br /&gt;
* /fs/szdata/genomes/human_celera_2001_Orig/&lt;br /&gt;
&lt;br /&gt;
= Software @ CBCB =&lt;br /&gt;
&lt;br /&gt;
Under /fs/sz-user-supported/Linux-x86_64/bin/&lt;br /&gt;
&lt;br /&gt;
== Denovo assembly ==&lt;br /&gt;
&lt;br /&gt;
* edena&lt;br /&gt;
* ssake&lt;br /&gt;
* velveth,velvetg&lt;br /&gt;
&lt;br /&gt;
== Read mapping ==&lt;br /&gt;
&lt;br /&gt;
* blat&lt;br /&gt;
* maq&lt;br /&gt;
* soap&lt;/div&gt;</summary>
		<author><name>Mschatz</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Short_read_sequencing&amp;diff=3000</id>
		<title>Short read sequencing</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Short_read_sequencing&amp;diff=3000"/>
		<updated>2008-06-20T18:14:56Z</updated>

		<summary type="html">&lt;p&gt;Mschatz: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Articles =&lt;br /&gt;
&lt;br /&gt;
== 06/13/2008 (Genome Resequencing) ==&lt;br /&gt;
&lt;br /&gt;
* [http://www.nature.com/nature/journal/v452/n7189/pdf/nature06884.pdf The complete genome of an individual by massively parallel DNA sequencing (J.Watson&#039;s genome) Nature April 2008]&lt;br /&gt;
* [http://www.nature.com/nature/journal/v452/n7189/extref/nature06884-s1.pdf J.Watson&#039;s genome (supplementary info) ]&lt;br /&gt;
&lt;br /&gt;
* [http://www.nature.com/nmeth/journal/v5/n2/full/nmeth.1179.html;jsessionid=DC518BCD8B2CACAE8AFFF7F70DD46902 Whole-genome sequencing and variant discovery in C. elegans Nature Jan 2008]&lt;br /&gt;
&lt;br /&gt;
== 06/20/2008 (Read Mapping Software)==&lt;br /&gt;
&lt;br /&gt;
=== BLAT ===&lt;br /&gt;
* [http://www.genome.org/cgi/reprint/GR-2292Rv1 BLAT—The BLAST-Like Alignment Tool, Genome Research 2002]&lt;br /&gt;
* [http://genome.ucsc.edu/FAQ/FAQblat FAQ]&lt;br /&gt;
* Can align any type of reads&lt;br /&gt;
* Can do nt:aa translation&lt;br /&gt;
* Command: blat &lt;br /&gt;
&lt;br /&gt;
=== MAQ ===&lt;br /&gt;
* [http://maq.sourceforge.net/ Maq Sourceforge]&lt;br /&gt;
* [http://www.sanger.ac.uk/Users/lh3/maq-poster.pdf Maq Poster from Sanger] &lt;br /&gt;
* Illumina-Solexa/AB-SOLiD , not 454 or capillary reads&lt;br /&gt;
* Uses FASTQ format&lt;br /&gt;
* Command: maq map ...&lt;br /&gt;
&lt;br /&gt;
=== RMAP ===&lt;br /&gt;
* [http://rulai.cshl.edu/rmap/ RMAP] : designed for Illumina-Solexa&lt;br /&gt;
* Command: rmap &lt;br /&gt;
&lt;br /&gt;
=== SHRiMP ===&lt;br /&gt;
* [http://compbio.cs.toronto.edu/shrimp/ Web site]&lt;br /&gt;
* Commands: rmapper-cs , rmapper-ls, ...&lt;br /&gt;
&lt;br /&gt;
=== SOAP ===&lt;br /&gt;
* [http://soap.genomics.org.cn/ Web site (China)]&lt;br /&gt;
* [http://soap.genomics.org.cn/SOAP_paper.pdf SOAP: short oligonucleotide alignment program, Bioinformatics Jan 2008]&lt;br /&gt;
* Commands: soap, soap.contig, soap_dealign, soap.huge, soap.short&lt;br /&gt;
* can use qualities, do read trimming, use pair ends, RNA alignments&lt;br /&gt;
&lt;br /&gt;
== 06/27/2008 (Consensus calling and structural variation)==&lt;br /&gt;
&lt;br /&gt;
* [http://www.ncbi.nlm.nih.gov/pubmed/18321888 Consensus generation and variant detection by Celera Assembler.]&lt;br /&gt;
* In contact with Brudno&lt;br /&gt;
&lt;br /&gt;
= Links =&lt;br /&gt;
&lt;br /&gt;
* [http://www.1000genomes.org/page.php?page=home 1000 genomes]&lt;br /&gt;
&lt;br /&gt;
* [http://www.cbcb.umd.edu/~langmead/solexa_1000genomes.html Ben&#039;s web site 1]&lt;br /&gt;
* [http://www.cbcb.umd.edu/~langmead/solexa_format.html Ben&#039;s web site 2]&lt;br /&gt;
&lt;br /&gt;
* [http://en.wikipedia.org/wiki/Chip-Sequencing Chip-Seq @ Wikipedia]&lt;br /&gt;
&lt;br /&gt;
* [http://www.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?cmd=table&amp;amp;f=run&amp;amp;m=data&amp;amp;s=run SRA]&lt;br /&gt;
* [ftp://ftp.ncbi.nih.gov/pub/TraceDB/ShortRead SRA FTP]&lt;br /&gt;
&lt;br /&gt;
= Data =&lt;br /&gt;
&lt;br /&gt;
== Solexa ==&lt;br /&gt;
&lt;br /&gt;
* [ftp://ftp.sanger.ac.uk/pub/PRODUCTION_SOFTWARE/data_sets/suis_solexa/ Strep suis Solexa at Sanger] 36bp, ~49X coverage&lt;br /&gt;
* [http://www.genomic.ch/edena/mw2Reads.seq.gz  Staphylococcus aureus strain MW2 (edena paper)] 35bp, ~47X coverage&lt;br /&gt;
* Pseudomonas aeruginosa: 33bp, ~43X coverage&lt;br /&gt;
* Pseudomonas syringae:   32bp, ~31X coverage&lt;br /&gt;
* 1000 Genomes (June 14th 2008): 47bp&lt;br /&gt;
&lt;br /&gt;
  Accession       #Runs   Instrument                      Center  Study                          [Individual]&lt;br /&gt;
  SRA000303       41      Solexa 1G Genome Analyzer       BI      1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA000304       49      Solexa 1G Genome Analyzer       BI      1000Genomes Project Pilot 2     NA12891&lt;br /&gt;
  SRA000305       56      Solexa 1G Genome Analyzer       BI      1000Genomes Project Pilot 2     NA12892&lt;br /&gt;
  SRA000307       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA10851&lt;br /&gt;
  SRA000308       2       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA11993&lt;br /&gt;
  SRA000309       3       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA11995&lt;br /&gt;
  SRA000310       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12006&lt;br /&gt;
  SRA000311       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12044&lt;br /&gt;
  SRA000312       2       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12156&lt;br /&gt;
  SRA000313       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12414&lt;br /&gt;
  SRA000314       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12776&lt;br /&gt;
  SRA000315       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12828&lt;br /&gt;
  SRA000316       12      Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA000317       8       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 2     NA12891&lt;br /&gt;
  SRA000318       14      Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 2     NA12892&lt;br /&gt;
  SRA000319       1       Solexa 1G Genome Analyzer       SC      1000Genomes Project Pilot 1     NA12004&lt;br /&gt;
&lt;br /&gt;
  /fs/szdata/Solexa/1000genomes&lt;br /&gt;
&lt;br /&gt;
== 454 ==&lt;br /&gt;
&lt;br /&gt;
* 1000 Genomes (June 14th 2008)&lt;br /&gt;
&lt;br /&gt;
  Accession       #Runs   Instrument      Center  Study                           [Individual]&lt;br /&gt;
  SRA000302       121     454 GS FLX      BCM     1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA001032       2       454 GS FLX      BCM     1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
  SRA001036       1       454 GS FLX      BCM     1000Genomes Project Pilot 1     NA12812&lt;br /&gt;
  SRA001094       1       454 GS FLX      BCM     1000Genomes Project Pilot 2     NA12878&lt;br /&gt;
&lt;br /&gt;
== Refseq ==&lt;br /&gt;
&lt;br /&gt;
* /fs/szdata/genomes/human_ncbi_build36/ NCBI build36.1 May 2006 (Current build is 36.3 March 2008)&lt;br /&gt;
* /fs/szdata/genomes/human_celera_2001_Orig/&lt;br /&gt;
&lt;br /&gt;
= Software @ CBCB =&lt;br /&gt;
&lt;br /&gt;
Under /fs/sz-user-supported/Linux-x86_64/bin/&lt;br /&gt;
&lt;br /&gt;
== Denovo assembly ==&lt;br /&gt;
&lt;br /&gt;
* edena&lt;br /&gt;
* ssake&lt;br /&gt;
* velveth,velvetg&lt;br /&gt;
&lt;br /&gt;
== Read mapping ==&lt;br /&gt;
&lt;br /&gt;
* blat&lt;br /&gt;
* maq&lt;br /&gt;
* soap&lt;/div&gt;</summary>
		<author><name>Mschatz</name></author>
	</entry>
</feed>