<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.umiacs.umd.edu/cbcb/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Sergek</id>
	<title>Cbcb - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.umiacs.umd.edu/cbcb/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Sergek"/>
	<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php/Special:Contributions/Sergek"/>
	<updated>2026-04-13T10:29:16Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.43.7</generator>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:How_do_I_run_the_new_Bambus&amp;diff=5986</id>
		<title>Cbcb:Pop-Lab:How do I run the new Bambus</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:How_do_I_run_the_new_Bambus&amp;diff=5986"/>
		<updated>2009-10-31T21:19:42Z</updated>

		<summary type="html">&lt;p&gt;Sergek: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The new Bambus (aka Bambus 2) actually consists of fourexecutables that run in order using a supplied AMOS bank. An important note, Bambus 2 is still an early beta. As such, it is advisable to back up&lt;br /&gt;
the bnk directory before using. Program documentation is also available on the command line by typing &amp;lt;command&amp;gt; -h.&lt;br /&gt;
&lt;br /&gt;
The first program is clk. This program finds all mated reads within contigs and converts the mate distances to be relative to contigs rather than reads. The second program is Bundler. Bundler joins together&lt;br /&gt;
the contig link messages generated by clk together when they can be to create consensus links between contigs. It will output multiple contig links for a pair of contigs. The third program is MarkRepeats. This program identifies repeats using two methods, the first is shortest paths and the second is by examining a-stat on a component by component basis. The final program is OrientContigs.&lt;br /&gt;
OrientContigs uses the contig links to orient and order the contigs into scaffolds, as well as performing some simplification by joining contigs. Each of the programs is covered in more detail below. To get more&lt;br /&gt;
help on running any program use -h.&lt;br /&gt;
&lt;br /&gt;
1. clk&lt;br /&gt;
  - Modifies the bank to create contig edges.&amp;lt;br&amp;gt;&lt;br /&gt;
  - Example: clk -b[ank] data.bnk&lt;br /&gt;
&lt;br /&gt;
2. Bundler&lt;br /&gt;
  - Bundle together contig edges to create contig links.&amp;lt;br&amp;gt;&lt;br /&gt;
  - Example: Bundler -b[ank] data.bnk [-t[ype] comma separated list of edge types]&lt;br /&gt;
     - The -t[ype] option allows only certain contig edges to be processed. ALL means use any type. The types are defined in src/AMOS/Link_AMOS.hh&lt;br /&gt;
&lt;br /&gt;
3. MarkRepeats&lt;br /&gt;
 - Run shortest paths and connected component repeat detection algorithms. This requires AMOS to be built with the Boost graph library available to it.&amp;lt;br&amp;gt;&lt;br /&gt;
 - Example: MarkRepeats -b[and] data.bnk [-redundancy X -aggressive]&lt;br /&gt;
    - The -redundancy ignores linka containing fewer than X edges.&lt;br /&gt;
    - The -aggressive option marks contigs as repetitive based on global astat calculation rather than a connected component one. &lt;br /&gt;
&lt;br /&gt;
4. OrientContigs&lt;br /&gt;
 - Orient and order the contigs based on the links. This program uses a greedy algorithm to orient and order contigs relative to an arbitrary start contig. Edges that contradict the current scaffold are marked bad&lt;br /&gt;
   and ignored for the rest of the analysis. They are still output but don&#039;t affect any subsequent calculations.&lt;br /&gt;
   &lt;br /&gt;
   The output includes a a dot-formatted file, NCBI AGP scaffold formatted file, and xml files formatted to be compatible with Bambus 1 tools.&lt;br /&gt;
   &lt;br /&gt;
   Note that this program does not currently linearize the scaffold but maintains them as a graph. This program also recursively simplifies common patters in the graph. Currently the patterns are&lt;br /&gt;
   bubbles or straight lines. For example, contigs A-&amp;gt;B-&amp;gt;C will be simplified to just A. Also A-&amp;gt;B-&amp;gt;D will become A as well. This simplification is performed recursively until the graph is stable. Note that the   &lt;br /&gt;
   												\&amp;gt;C/&amp;gt; &lt;br /&gt;
   simplification updates the bank in a destructive way by removing contigs and replacing them (as well as their edges) with updated contigs. &lt;br /&gt;
   The marking of the edges as BAD or GOOD also destructively updates the bank. Therefore it is necessary to make a backup of the bank before running this program.&amp;lt;br&amp;gt;&lt;br /&gt;
 - Example: OrientContigs -b[ank] &amp;lt;bank_name&amp;gt; -prefix asm [-all -noreduce -redundancy X -repeats Y -aggressive]. &lt;br /&gt;
    - The - prefix option specifies the prefix to use for all output files.&lt;br /&gt;
    - The -all option specifies whether disconnected contigs should be output as their own scaffolds or if they should be skipped. &lt;br /&gt;
    - The -noreduce option turns off the graph simplification described above. &lt;br /&gt;
    - The -redundancy option ignores links containing fewer than X edges.&lt;br /&gt;
    - The -repeats option reads a file of repeats (Y) which specify one contig ID per line. Repeat contigs and their links are not used for odering/orienting any other data in the graph. Repeats are currently not resolved and are output as single-contig scaffolds. If known, these may be specified or the repeats identified by MarkRepeats (above) may be used.&lt;br /&gt;
    - The -aggressive option will not mark edges that move a contig more than 3 STDEVS away as bad and will try to reconcile the positions.&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:How_do_I_run_the_new_Bambus&amp;diff=5985</id>
		<title>Cbcb:Pop-Lab:How do I run the new Bambus</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:How_do_I_run_the_new_Bambus&amp;diff=5985"/>
		<updated>2009-10-31T21:19:17Z</updated>

		<summary type="html">&lt;p&gt;Sergek: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The new Bambus (aka Bambus 2) actually consists of fourexecutables that run in order using a supplied AMOS bank. An important note, Bambus 2 is still an early beta. As such, it is advisable to back up&lt;br /&gt;
the bnk directory before using. Program documentation is also available on the command line by typing &amp;lt;command&amp;gt; -h.&lt;br /&gt;
&lt;br /&gt;
The first program is clk. This program finds all mated reads within contigs and converts the mate distances to be relative to contigs rather than reads. The second program is Bundler. Bundler joins together&lt;br /&gt;
the contig link messages generated by clk together when they can be to create consensus links between contigs. It will output multiple contig links for a pair of contigs. The third program is MarkRepeats. This program identifies repeats using two methods, the first is shortest paths and the second is by examining a-stat on a component by component basis. The final program is OrientContigs.&lt;br /&gt;
OrientContigs uses the contig links to orient and order the contigs into scaffolds, as well as performing some simplification by joining contigs. Each of the programs is covered in more detail below. To get more&lt;br /&gt;
help on running any program use -h.&lt;br /&gt;
&lt;br /&gt;
# clk&lt;br /&gt;
  - Modifies the bank to create contig edges.&amp;lt;br&amp;gt;&lt;br /&gt;
  - Example: clk -b[ank] data.bnk&lt;br /&gt;
&lt;br /&gt;
# Bundler&lt;br /&gt;
  - Bundle together contig edges to create contig links.&amp;lt;br&amp;gt;&lt;br /&gt;
  - Example: Bundler -b[ank] data.bnk [-t[ype] comma separated list of edge types]&lt;br /&gt;
     - The -t[ype] option allows only certain contig edges to be processed. ALL means use any type. The types are defined in src/AMOS/Link_AMOS.hh&lt;br /&gt;
&lt;br /&gt;
3. MarkRepeats&lt;br /&gt;
 - Run shortest paths and connected component repeat detection algorithms. This requires AMOS to be built with the Boost graph library available to it.&amp;lt;br&amp;gt;&lt;br /&gt;
 - Example: MarkRepeats -b[and] data.bnk [-redundancy X -aggressive]&lt;br /&gt;
    - The -redundancy ignores linka containing fewer than X edges.&lt;br /&gt;
    - The -aggressive option marks contigs as repetitive based on global astat calculation rather than a connected component one. &lt;br /&gt;
&lt;br /&gt;
4. OrientContigs&lt;br /&gt;
 - Orient and order the contigs based on the links. This program uses a greedy algorithm to orient and order contigs relative to an arbitrary start contig. Edges that contradict the current scaffold are marked bad&lt;br /&gt;
   and ignored for the rest of the analysis. They are still output but don&#039;t affect any subsequent calculations.&lt;br /&gt;
   &lt;br /&gt;
   The output includes a a dot-formatted file, NCBI AGP scaffold formatted file, and xml files formatted to be compatible with Bambus 1 tools.&lt;br /&gt;
   &lt;br /&gt;
   Note that this program does not currently linearize the scaffold but maintains them as a graph. This program also recursively simplifies common patters in the graph. Currently the patterns are&lt;br /&gt;
   bubbles or straight lines. For example, contigs A-&amp;gt;B-&amp;gt;C will be simplified to just A. Also A-&amp;gt;B-&amp;gt;D will become A as well. This simplification is performed recursively until the graph is stable. Note that the   &lt;br /&gt;
   												\&amp;gt;C/&amp;gt; &lt;br /&gt;
   simplification updates the bank in a destructive way by removing contigs and replacing them (as well as their edges) with updated contigs. &lt;br /&gt;
   The marking of the edges as BAD or GOOD also destructively updates the bank. Therefore it is necessary to make a backup of the bank before running this program.&amp;lt;br&amp;gt;&lt;br /&gt;
 - Example: OrientContigs -b[ank] &amp;lt;bank_name&amp;gt; -prefix asm [-all -noreduce -redundancy X -repeats Y -aggressive]. &lt;br /&gt;
    - The - prefix option specifies the prefix to use for all output files.&lt;br /&gt;
    - The -all option specifies whether disconnected contigs should be output as their own scaffolds or if they should be skipped. &lt;br /&gt;
    - The -noreduce option turns off the graph simplification described above. &lt;br /&gt;
    - The -redundancy option ignores links containing fewer than X edges.&lt;br /&gt;
    - The -repeats option reads a file of repeats (Y) which specify one contig ID per line. Repeat contigs and their links are not used for odering/orienting any other data in the graph. Repeats are currently not resolved and are output as single-contig scaffolds. If known, these may be specified or the repeats identified by MarkRepeats (above) may be used.&lt;br /&gt;
    - The -aggressive option will not mark edges that move a contig more than 3 STDEVS away as bad and will try to reconcile the positions.&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:Serge-Report&amp;diff=4550</id>
		<title>Cbcb:Pop-Lab:Serge-Report</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:Serge-Report&amp;diff=4550"/>
		<updated>2009-02-12T23:50:22Z</updated>

		<summary type="html">&lt;p&gt;Sergek: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= February =&lt;br /&gt;
&amp;lt;b&amp;gt;February 16th, 2009:&amp;lt;/b&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
* I have been working on the materials for my preliminary exam. My preliminary paper is is available [[Media:PaperList.pdf|here]] and the preliminary paper is [[Media:Paper.pdf|here]].&lt;br /&gt;
* I have also met with Dr. Alan Sussman, a member of my preliminary committee, to discuss parallel option for graph algorithms. He pointed me to several papers on graph and hyper-graph partitioning. I had been thinking of doing randomized graph algorithms for speedup but using efficient partitioning, the standard graph algorithms can be distributed while minimizing communication. The other option is to use shared-memory machines which are more expensive and less available than distributed memory machines.&lt;br /&gt;
* I also implemented two methods for repeat detection in Bambus 2. &lt;br /&gt;
** The first splits the graph into strongly connected components using the SeqAn library. The astat is computed on each connected component separately. The motivation is to subdivide metagenomic samples into their constituent organisms and compute astat only on one organism at a time to avoid inadvertently marking abundant organisms as repeats&lt;br /&gt;
** The second computes all-to-all shortest paths and the number of shortest paths each node is on. The nodes that are on a large number of shortest paths (more than 1.5 stdev from mean) are marked as repetitive.&lt;br /&gt;
** Using the combination of these two approaches Brucella_suis_1330 correctly identifies repeats that were previously causing mis-assembly. I also tried the methods on Acid Mine but SeqAn was crashing. &lt;br /&gt;
* My next steps are to continue writing the preliminary paper, focusing on the proposed work section. I also plan to look at the results on Brucella_suis_1330 in detail to evaluate correctness of contig placement given my repeat identification.&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=File:Paper.pdf&amp;diff=4549</id>
		<title>File:Paper.pdf</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=File:Paper.pdf&amp;diff=4549"/>
		<updated>2009-02-12T23:21:12Z</updated>

		<summary type="html">&lt;p&gt;Sergek: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=File:PaperList.pdf&amp;diff=4548</id>
		<title>File:PaperList.pdf</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=File:PaperList.pdf&amp;diff=4548"/>
		<updated>2009-02-12T23:20:56Z</updated>

		<summary type="html">&lt;p&gt;Sergek: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:Serge-Report&amp;diff=4547</id>
		<title>Cbcb:Pop-Lab:Serge-Report</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:Serge-Report&amp;diff=4547"/>
		<updated>2009-02-12T23:11:57Z</updated>

		<summary type="html">&lt;p&gt;Sergek: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;----&lt;br /&gt;
February 16th, 2009&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab&amp;diff=4546</id>
		<title>Cbcb:Pop-Lab</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab&amp;diff=4546"/>
		<updated>2009-02-12T23:10:17Z</updated>

		<summary type="html">&lt;p&gt;Sergek: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Important pages =&lt;br /&gt;
* Group members [[Cbcb:Pop-Lab:Members]] &lt;br /&gt;
* Blog: [http://genomes.blogspot.com Genomes.blogspot.com]&lt;br /&gt;
* How-tos: [[Cbcb:Pop-Lab:How-to]] - List of documentation on how to perform certain analyses, use different software, etc.&lt;br /&gt;
* Software developed by us: [[Cbcb:Pop-Lab:Software]]&lt;br /&gt;
&lt;br /&gt;
= Meetings/seminars =&lt;br /&gt;
* [[Pop_group_meeting|Group meeting]]: every Monday from 2:30pm in corner conference room (3120 C)&lt;br /&gt;
* [[seminars|CBCB seminar]]: Thursdays 2-3pm during the semester in main conference room&lt;br /&gt;
&lt;br /&gt;
= Progress Reports =&lt;br /&gt;
* [[Cbcb:Pop-Lab:Niranjan-Report|Niranjan]]&lt;br /&gt;
* [[Cbcb:Pop-Lab:Chris-Report|Chris]]&lt;br /&gt;
* [[Cbcb:Pop-Lab:James-Report|James]]&lt;br /&gt;
* [[Cbcb:Pop-Lab:Serge-Report|Serge]]&lt;br /&gt;
&lt;br /&gt;
= Pop Lab Presentations =&lt;br /&gt;
* [[Media:Xoo_Cripsr.odg|Xanthomonas Crispr Slides (openoffice format)]] &lt;br /&gt;
* Listeria Monocytogenes Slides (powerpoint)&lt;br /&gt;
* Metagenome pipelines overview [[Media:Pipeline_outline.pdf|Presentation(pdf)]]&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Pop_group_meeting&amp;diff=3885</id>
		<title>Pop group meeting</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Pop_group_meeting&amp;diff=3885"/>
		<updated>2008-11-17T19:58:47Z</updated>

		<summary type="html">&lt;p&gt;Sergek: /* 11/17/2008 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= 09/29/2008 =&lt;br /&gt;
&lt;br /&gt;
# Benjamin Langmead [http://www.plosone.org/article/info:doi%2F10.1371%2Fjournal.pone.0003263 Metagenomic Analysis of Lysogeny in Tampa Bay: Implications for Prophage Gene Expression] (PLoS one)&lt;br /&gt;
# Bo Liu [http://genomebiology.com/2008/9/9/R139 Systematic bioinformatic analysis of expression levels of 17,330 human genes across 9,783 samples from 175 types of healthy and pathological tissues] (Genome Biology)&lt;br /&gt;
# Daniela Puiu [http://genome.cshlp.org/cgi/reprint/gr.080200.108v1 Sequencing of natural strains of Arabidopsis thaliana with short reads] (Genome Research)&lt;br /&gt;
# Dan Sommer, Niranjan Nagarajan [http://www.nature.com/nature/journal/v455/n7212/edsumm/e080925-04.html Metagenomics: questions to answer] (Nature)&lt;br /&gt;
# James Robert White [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6V73-4SHVST0-1&amp;amp;_user=961305&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=961305&amp;amp;md5=4115e7b8186a36db60f6b11a217a8798 Microbial population dynamics during aerobic sludge granulation at different organic loading rates] (ScienceDirect)&lt;br /&gt;
# Mohammad Reza Ghodsi [http://nar.oxfordjournals.org/cgi/content/short/36/16/5180 A comparison of random sequence reads versus 16S rDNA sequences for estimating the biodiversity of a metagenomic library] (NAR)&lt;br /&gt;
# Sergey Koren [http://www.biomedcentral.com/1471-2105/9/390 R/parallel - speeding up bioinformatics analysis with R] (BMC Bioinformatics)&lt;br /&gt;
# Theodore Gibbons [http://www.ncbi.nlm.nih.gov/pubmed/18711340 High-resolution metagenomics targets specific functional types in complex microbial communities] (Nature Biotechnology) &amp;lt;BR&amp;gt; [http://www.nature.com/nrmicro/journal/v6/n9/abs/nrmicro1935.html Molecular eco-systems biology: towards an understanding of community function] (Systems Microbiology)&lt;br /&gt;
&lt;br /&gt;
= 10/06/2008 =&lt;br /&gt;
# Bo Liu [http://www.pnas.org/content/105/39/15076.short The convergence of carbohydrate active gene repertoires in human gut microbes.] PNAS&lt;br /&gt;
# Daniela Puiu [http://bioinformatics.oxfordjournals.org/cgi/content/full/24/20/2395 SeqMap: mapping massive amount of oligonucleotides to the genome] Bioinformatics&lt;br /&gt;
# James Robert White [http://www.pnas.org/content/104/21/8918.full Artificial selection of simulated microbial ecosystems.] PNAS&lt;br /&gt;
# MohammadReza Ghodsi [http://aem.asm.org/cgi/content/abstract/74/5/1453 Metagenomics: Read Length Matters] Applied Environmental Microbiology&lt;br /&gt;
# Sergey Koren [http://www.ploscompbiol.org/article/info:doi%2F10.1371%2Fjournal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads] PLoS Computational Biology&lt;br /&gt;
&lt;br /&gt;
= 10/13/2008 =&lt;br /&gt;
# Daniela Puiu, Daniel Sommer [http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0003373 MetaSim—A Sequencing Simulator for Genomics and Metagenomics] PLoS ONE&lt;br /&gt;
# Bo Liu, James Robert White [http://www.sciencemag.org/cgi/content/full/322/5899/275 Environmental Genomics Reveals a Single-Species Ecosystem Deep Within Earth.] Science&lt;br /&gt;
# MohammadReza Ghodsi [http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2557142 Probing Metagenomics by Rapid Cluster Analysis of Very Large Datasets] (PLoS one)&lt;br /&gt;
&lt;br /&gt;
MetaSim software:&lt;br /&gt;
      Location:         /fs/sz-user-supported/common/packages/MetaSim/ &lt;br /&gt;
      Executable(Java): /fs/sz-user-supported/common/packages/MetaSim/metasim/MetaSim &lt;br /&gt;
      Database:         /fs/sz-user-supported/common/packages/MetaSim/database           &lt;br /&gt;
      create a symlink to it from your working directory, otherwise you have to import it &lt;br /&gt;
      NCBI databases: [ftp://ftp.ncbi.nlm.nih.gov/genomes/Bacteria/all.fna.tar.gz Complete bacteria];   [ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/gi_taxid_nucl.dmp.gz Taxonomy]&lt;br /&gt;
&lt;br /&gt;
= 10/20/2008 =&lt;br /&gt;
# Bo Liu [http://www.pnas.org/content/105/37/13977.short?rss=1 Accelerated evolution of resistance in multidrug environments] PNAS &amp;lt;br&amp;gt; [http://www.pnas.org/content/105/39/14918.short?rss=1 Drug interactions modulate the potential for evolution of resistance] PNAS&lt;br /&gt;
# Daniel Sommer [http://www.ncbi.nlm.nih.gov/pubmed/18927115?dopt=Abstract NCBI Reference Sequences: current status, policy and new initiatives] NAR&lt;br /&gt;
# Daniela Puiu [http://genomebiology.com/content/pdf/gb-2008-9-10-r151.pdf A simple, fast, and accurate method of phylogenomic inference] Genome Biology&lt;br /&gt;
# Niranjan Nagarajan [http://www.springerlink.com/content/cn3551154w232l73/fulltext.pdf The Stastical Power of Phylogenetic Motif Models] RECOMB 2008&lt;br /&gt;
# Sergey Koren [http://www.nature.com/nmeth/journal/v5/n10/full/nmeth1008-903.html Scientific software: seeing the SNPs between us] Nature Methods&lt;br /&gt;
# Mohammadreza Ghodsi [http://www.biomedcentral.com/1471-2105/9/386 The metagenomics RAST server – a public resource for the automatic phylogenetic and functional analysis of metagenomes] BMC Bioinformatics&lt;br /&gt;
&lt;br /&gt;
= 10/27/2008 =&lt;br /&gt;
# Bo Liu [http://www.pnas.org/content/105/37/14130.short?rss=1 Frequent emergence and limited geographic dispersal of methicillin-resistant Staphylococcus aureus] PNAS&lt;br /&gt;
# Sergey Koren [http://bioinformatics.oxfordjournals.org/cgi/content/abstract/btn548 Aggressive Assembly of Pyrosequencing Reads with Mates] Bioinformatics&lt;br /&gt;
# Daniela Puiu [http://bioinformatics.oxfordjournals.org/cgi/content/full/24/21/2431 ZOOM! Zillions of oligos mapped] Bioinformatics&lt;br /&gt;
# Mohammadreza Ghodsi [http://www.ncbi.nlm.nih.gov/pubmed/18692931 Biodiversity of the microbial community in a Spanish farmhouse cheese as revealed by culture-dependent and culture-independent methods]&lt;br /&gt;
&lt;br /&gt;
= 11/03/2008 = &lt;br /&gt;
# Daniela Puiu [http://genome.cshlp.org/cgi/content/abstract/gr.7088808v1 Short read fragment assembly of bacterial genomes] (Genome Research)&lt;br /&gt;
&lt;br /&gt;
= 11/10/2008 = &lt;br /&gt;
# Bo Liu [http://www.pnas.org/content/early/2008/11/04/0802782105.abstract Metagenome analysis of an extreme microbial symbiosis reveals eurythermal adaptation and metabolic flexibility] PNAS&lt;br /&gt;
&lt;br /&gt;
= 11/17/2008 =&lt;br /&gt;
# Daniela Puiu [http://bioinformatics.oxfordjournals.org/cgi/content/abstract/btn582 Human genomes as email attachments] Bioinformatics&lt;br /&gt;
# Mohammad Ghodsi [http://nar.oxfordjournals.org/cgi/content/short/36/18/e120 Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers] Nucleic Acids Research&lt;br /&gt;
# Sergey Koren [http://www.biomedcentral.com/1471-2105/9/420/abstract XplorSeq: A software environment for integrated management and phylogenetic analysis of metagenomic sequence data] (BMC Bioinformatics)&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Pop_group_meeting&amp;diff=3884</id>
		<title>Pop group meeting</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Pop_group_meeting&amp;diff=3884"/>
		<updated>2008-11-17T19:58:36Z</updated>

		<summary type="html">&lt;p&gt;Sergek: /* 11/10/2008 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= 09/29/2008 =&lt;br /&gt;
&lt;br /&gt;
# Benjamin Langmead [http://www.plosone.org/article/info:doi%2F10.1371%2Fjournal.pone.0003263 Metagenomic Analysis of Lysogeny in Tampa Bay: Implications for Prophage Gene Expression] (PLoS one)&lt;br /&gt;
# Bo Liu [http://genomebiology.com/2008/9/9/R139 Systematic bioinformatic analysis of expression levels of 17,330 human genes across 9,783 samples from 175 types of healthy and pathological tissues] (Genome Biology)&lt;br /&gt;
# Daniela Puiu [http://genome.cshlp.org/cgi/reprint/gr.080200.108v1 Sequencing of natural strains of Arabidopsis thaliana with short reads] (Genome Research)&lt;br /&gt;
# Dan Sommer, Niranjan Nagarajan [http://www.nature.com/nature/journal/v455/n7212/edsumm/e080925-04.html Metagenomics: questions to answer] (Nature)&lt;br /&gt;
# James Robert White [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6V73-4SHVST0-1&amp;amp;_user=961305&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=961305&amp;amp;md5=4115e7b8186a36db60f6b11a217a8798 Microbial population dynamics during aerobic sludge granulation at different organic loading rates] (ScienceDirect)&lt;br /&gt;
# Mohammad Reza Ghodsi [http://nar.oxfordjournals.org/cgi/content/short/36/16/5180 A comparison of random sequence reads versus 16S rDNA sequences for estimating the biodiversity of a metagenomic library] (NAR)&lt;br /&gt;
# Sergey Koren [http://www.biomedcentral.com/1471-2105/9/390 R/parallel - speeding up bioinformatics analysis with R] (BMC Bioinformatics)&lt;br /&gt;
# Theodore Gibbons [http://www.ncbi.nlm.nih.gov/pubmed/18711340 High-resolution metagenomics targets specific functional types in complex microbial communities] (Nature Biotechnology) &amp;lt;BR&amp;gt; [http://www.nature.com/nrmicro/journal/v6/n9/abs/nrmicro1935.html Molecular eco-systems biology: towards an understanding of community function] (Systems Microbiology)&lt;br /&gt;
&lt;br /&gt;
= 10/06/2008 =&lt;br /&gt;
# Bo Liu [http://www.pnas.org/content/105/39/15076.short The convergence of carbohydrate active gene repertoires in human gut microbes.] PNAS&lt;br /&gt;
# Daniela Puiu [http://bioinformatics.oxfordjournals.org/cgi/content/full/24/20/2395 SeqMap: mapping massive amount of oligonucleotides to the genome] Bioinformatics&lt;br /&gt;
# James Robert White [http://www.pnas.org/content/104/21/8918.full Artificial selection of simulated microbial ecosystems.] PNAS&lt;br /&gt;
# MohammadReza Ghodsi [http://aem.asm.org/cgi/content/abstract/74/5/1453 Metagenomics: Read Length Matters] Applied Environmental Microbiology&lt;br /&gt;
# Sergey Koren [http://www.ploscompbiol.org/article/info:doi%2F10.1371%2Fjournal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads] PLoS Computational Biology&lt;br /&gt;
&lt;br /&gt;
= 10/13/2008 =&lt;br /&gt;
# Daniela Puiu, Daniel Sommer [http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0003373 MetaSim—A Sequencing Simulator for Genomics and Metagenomics] PLoS ONE&lt;br /&gt;
# Bo Liu, James Robert White [http://www.sciencemag.org/cgi/content/full/322/5899/275 Environmental Genomics Reveals a Single-Species Ecosystem Deep Within Earth.] Science&lt;br /&gt;
# MohammadReza Ghodsi [http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2557142 Probing Metagenomics by Rapid Cluster Analysis of Very Large Datasets] (PLoS one)&lt;br /&gt;
&lt;br /&gt;
MetaSim software:&lt;br /&gt;
      Location:         /fs/sz-user-supported/common/packages/MetaSim/ &lt;br /&gt;
      Executable(Java): /fs/sz-user-supported/common/packages/MetaSim/metasim/MetaSim &lt;br /&gt;
      Database:         /fs/sz-user-supported/common/packages/MetaSim/database           &lt;br /&gt;
      create a symlink to it from your working directory, otherwise you have to import it &lt;br /&gt;
      NCBI databases: [ftp://ftp.ncbi.nlm.nih.gov/genomes/Bacteria/all.fna.tar.gz Complete bacteria];   [ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/gi_taxid_nucl.dmp.gz Taxonomy]&lt;br /&gt;
&lt;br /&gt;
= 10/20/2008 =&lt;br /&gt;
# Bo Liu [http://www.pnas.org/content/105/37/13977.short?rss=1 Accelerated evolution of resistance in multidrug environments] PNAS &amp;lt;br&amp;gt; [http://www.pnas.org/content/105/39/14918.short?rss=1 Drug interactions modulate the potential for evolution of resistance] PNAS&lt;br /&gt;
# Daniel Sommer [http://www.ncbi.nlm.nih.gov/pubmed/18927115?dopt=Abstract NCBI Reference Sequences: current status, policy and new initiatives] NAR&lt;br /&gt;
# Daniela Puiu [http://genomebiology.com/content/pdf/gb-2008-9-10-r151.pdf A simple, fast, and accurate method of phylogenomic inference] Genome Biology&lt;br /&gt;
# Niranjan Nagarajan [http://www.springerlink.com/content/cn3551154w232l73/fulltext.pdf The Stastical Power of Phylogenetic Motif Models] RECOMB 2008&lt;br /&gt;
# Sergey Koren [http://www.nature.com/nmeth/journal/v5/n10/full/nmeth1008-903.html Scientific software: seeing the SNPs between us] Nature Methods&lt;br /&gt;
# Mohammadreza Ghodsi [http://www.biomedcentral.com/1471-2105/9/386 The metagenomics RAST server – a public resource for the automatic phylogenetic and functional analysis of metagenomes] BMC Bioinformatics&lt;br /&gt;
&lt;br /&gt;
= 10/27/2008 =&lt;br /&gt;
# Bo Liu [http://www.pnas.org/content/105/37/14130.short?rss=1 Frequent emergence and limited geographic dispersal of methicillin-resistant Staphylococcus aureus] PNAS&lt;br /&gt;
# Sergey Koren [http://bioinformatics.oxfordjournals.org/cgi/content/abstract/btn548 Aggressive Assembly of Pyrosequencing Reads with Mates] Bioinformatics&lt;br /&gt;
# Daniela Puiu [http://bioinformatics.oxfordjournals.org/cgi/content/full/24/21/2431 ZOOM! Zillions of oligos mapped] Bioinformatics&lt;br /&gt;
# Mohammadreza Ghodsi [http://www.ncbi.nlm.nih.gov/pubmed/18692931 Biodiversity of the microbial community in a Spanish farmhouse cheese as revealed by culture-dependent and culture-independent methods]&lt;br /&gt;
&lt;br /&gt;
= 11/03/2008 = &lt;br /&gt;
# Daniela Puiu [http://genome.cshlp.org/cgi/content/abstract/gr.7088808v1 Short read fragment assembly of bacterial genomes] (Genome Research)&lt;br /&gt;
&lt;br /&gt;
= 11/10/2008 = &lt;br /&gt;
# Bo Liu [http://www.pnas.org/content/early/2008/11/04/0802782105.abstract Metagenome analysis of an extreme microbial symbiosis reveals eurythermal adaptation and metabolic flexibility] PNAS&lt;br /&gt;
&lt;br /&gt;
= 11/17/2008 =&lt;br /&gt;
# Daniela Puiu [http://bioinformatics.oxfordjournals.org/cgi/content/abstract/btn582 Human genomes as email attachments] Bioinformatics&lt;br /&gt;
# Mohammad Ghodsi [http://nar.oxfordjournals.org/cgi/content/short/36/18/e120 Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers] Nucleic Acids Research&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Pop_group_meeting&amp;diff=3868</id>
		<title>Pop group meeting</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Pop_group_meeting&amp;diff=3868"/>
		<updated>2008-11-10T00:34:43Z</updated>

		<summary type="html">&lt;p&gt;Sergek: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= 09/29/2008 =&lt;br /&gt;
&lt;br /&gt;
# Benjamin Langmead [http://www.plosone.org/article/info:doi%2F10.1371%2Fjournal.pone.0003263 Metagenomic Analysis of Lysogeny in Tampa Bay: Implications for Prophage Gene Expression] (PLoS one)&lt;br /&gt;
# Bo Liu [http://genomebiology.com/2008/9/9/R139 Systematic bioinformatic analysis of expression levels of 17,330 human genes across 9,783 samples from 175 types of healthy and pathological tissues] (Genome Biology)&lt;br /&gt;
# Daniela Puiu [http://genome.cshlp.org/cgi/reprint/gr.080200.108v1 Sequencing of natural strains of Arabidopsis thaliana with short reads] (Genome Research)&lt;br /&gt;
# Dan Sommer, Niranjan Nagarajan [http://www.nature.com/nature/journal/v455/n7212/edsumm/e080925-04.html Metagenomics: questions to answer] (Nature)&lt;br /&gt;
# James Robert White [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6V73-4SHVST0-1&amp;amp;_user=961305&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=961305&amp;amp;md5=4115e7b8186a36db60f6b11a217a8798 Microbial population dynamics during aerobic sludge granulation at different organic loading rates] (ScienceDirect)&lt;br /&gt;
# Mohammad Reza Ghodsi [http://nar.oxfordjournals.org/cgi/content/short/36/16/5180 A comparison of random sequence reads versus 16S rDNA sequences for estimating the biodiversity of a metagenomic library] (NAR)&lt;br /&gt;
# Sergey Koren [http://www.biomedcentral.com/1471-2105/9/390 R/parallel - speeding up bioinformatics analysis with R] (BMC Bioinformatics)&lt;br /&gt;
# Theodore Gibbons [http://www.ncbi.nlm.nih.gov/pubmed/18711340 High-resolution metagenomics targets specific functional types in complex microbial communities] (Nature Biotechnology) &amp;lt;BR&amp;gt; [http://www.nature.com/nrmicro/journal/v6/n9/abs/nrmicro1935.html Molecular eco-systems biology: towards an understanding of community function] (Systems Microbiology)&lt;br /&gt;
&lt;br /&gt;
= 10/06/2008 =&lt;br /&gt;
# Bo Liu [http://www.pnas.org/content/105/39/15076.short The convergence of carbohydrate active gene repertoires in human gut microbes.] PNAS&lt;br /&gt;
# Daniela Puiu [http://bioinformatics.oxfordjournals.org/cgi/content/full/24/20/2395 SeqMap: mapping massive amount of oligonucleotides to the genome] Bioinformatics&lt;br /&gt;
# James Robert White [http://www.pnas.org/content/104/21/8918.full Artificial selection of simulated microbial ecosystems.] PNAS&lt;br /&gt;
# MohammadReza Ghodsi [http://aem.asm.org/cgi/content/abstract/74/5/1453 Metagenomics: Read Length Matters] Applied Environmental Microbiology&lt;br /&gt;
# Sergey Koren [http://www.ploscompbiol.org/article/info:doi%2F10.1371%2Fjournal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads] PLoS Computational Biology&lt;br /&gt;
&lt;br /&gt;
= 10/13/2008 =&lt;br /&gt;
# Daniela Puiu, Daniel Sommer [http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0003373 MetaSim—A Sequencing Simulator for Genomics and Metagenomics] PLoS ONE&lt;br /&gt;
# Bo Liu, James Robert White [http://www.sciencemag.org/cgi/content/full/322/5899/275 Environmental Genomics Reveals a Single-Species Ecosystem Deep Within Earth.] Science&lt;br /&gt;
# MohammadReza Ghodsi [http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2557142 Probing Metagenomics by Rapid Cluster Analysis of Very Large Datasets] (PLoS one)&lt;br /&gt;
&lt;br /&gt;
MetaSim software:&lt;br /&gt;
      Location:         /fs/sz-user-supported/common/packages/MetaSim/ &lt;br /&gt;
      Executable(Java): /fs/sz-user-supported/common/packages/MetaSim/metasim/MetaSim &lt;br /&gt;
      Database:         /fs/sz-user-supported/common/packages/MetaSim/database           &lt;br /&gt;
      create a symlink to it from your working directory, otherwise you have to import it &lt;br /&gt;
      NCBI databases: [ftp://ftp.ncbi.nlm.nih.gov/genomes/Bacteria/all.fna.tar.gz Complete bacteria];   [ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/gi_taxid_nucl.dmp.gz Taxonomy]&lt;br /&gt;
&lt;br /&gt;
= 10/20/2008 =&lt;br /&gt;
# Bo Liu [http://www.pnas.org/content/105/37/13977.short?rss=1 Accelerated evolution of resistance in multidrug environments] PNAS &amp;lt;br&amp;gt; [http://www.pnas.org/content/105/39/14918.short?rss=1 Drug interactions modulate the potential for evolution of resistance] PNAS&lt;br /&gt;
# Daniel Sommer [http://www.ncbi.nlm.nih.gov/pubmed/18927115?dopt=Abstract NCBI Reference Sequences: current status, policy and new initiatives] NAR&lt;br /&gt;
# Daniela Puiu [http://genomebiology.com/content/pdf/gb-2008-9-10-r151.pdf A simple, fast, and accurate method of phylogenomic inference] Genome Biology&lt;br /&gt;
# Niranjan Nagarajan [http://www.springerlink.com/content/cn3551154w232l73/fulltext.pdf The Stastical Power of Phylogenetic Motif Models] RECOMB 2008&lt;br /&gt;
# Sergey Koren [http://www.nature.com/nmeth/journal/v5/n10/full/nmeth1008-903.html Scientific software: seeing the SNPs between us] Nature Methods&lt;br /&gt;
# Mohammadreza Ghodsi [http://www.biomedcentral.com/1471-2105/9/386 The metagenomics RAST server – a public resource for the automatic phylogenetic and functional analysis of metagenomes] BMC Bioinformatics&lt;br /&gt;
&lt;br /&gt;
= 10/27/2008 =&lt;br /&gt;
# Bo Liu [http://www.pnas.org/content/105/37/14130.short?rss=1 Frequent emergence and limited geographic dispersal of methicillin-resistant Staphylococcus aureus] PNAS&lt;br /&gt;
# Sergey Koren [http://bioinformatics.oxfordjournals.org/cgi/content/abstract/btn548 Aggressive Assembly of Pyrosequencing Reads with Mates] Bioinformatics&lt;br /&gt;
# Daniela Puiu [http://bioinformatics.oxfordjournals.org/cgi/content/full/24/21/2431 ZOOM! Zillions of oligos mapped] Bioinformatics&lt;br /&gt;
# Mohammadreza Ghodsi [http://www.ncbi.nlm.nih.gov/pubmed/18692931 Biodiversity of the microbial community in a Spanish farmhouse cheese as revealed by culture-dependent and culture-independent methods]&lt;br /&gt;
&lt;br /&gt;
= 11/03/2008 = &lt;br /&gt;
# Daniela Puiu [http://genome.cshlp.org/cgi/content/abstract/gr.7088808v1 Short read fragment assembly of bacterial genomes] (Genome Research)&lt;br /&gt;
&lt;br /&gt;
= 11/10/2008 = &lt;br /&gt;
# Sergey Koren [http://www.biomedcentral.com/1471-2105/9/420/abstract XplorSeq: A software environment for integrated management and phylogenetic analysis of metagenomic sequence data] (BMC Bioinformatics)&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Pop_group_meeting&amp;diff=3779</id>
		<title>Pop group meeting</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Pop_group_meeting&amp;diff=3779"/>
		<updated>2008-10-27T14:31:52Z</updated>

		<summary type="html">&lt;p&gt;Sergek: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= 09/29/2008 =&lt;br /&gt;
&lt;br /&gt;
# Benjamin Langmead [http://www.plosone.org/article/info:doi%2F10.1371%2Fjournal.pone.0003263 Metagenomic Analysis of Lysogeny in Tampa Bay: Implications for Prophage Gene Expression] (PLoS one)&lt;br /&gt;
# Bo Liu [http://genomebiology.com/2008/9/9/R139 Systematic bioinformatic analysis of expression levels of 17,330 human genes across 9,783 samples from 175 types of healthy and pathological tissues] (Genome Biology)&lt;br /&gt;
# Daniela Puiu [http://genome.cshlp.org/cgi/reprint/gr.080200.108v1 Sequencing of natural strains of Arabidopsis thaliana with short reads] (Genome Research)&lt;br /&gt;
# Dan Sommer, Niranjan Nagarajan [http://www.nature.com/nature/journal/v455/n7212/edsumm/e080925-04.html Metagenomics: questions to answer] (Nature)&lt;br /&gt;
# James Robert White [http://www.sciencedirect.com/science?_ob=ArticleURL&amp;amp;_udi=B6V73-4SHVST0-1&amp;amp;_user=961305&amp;amp;_rdoc=1&amp;amp;_fmt=&amp;amp;_orig=search&amp;amp;_sort=d&amp;amp;view=c&amp;amp;_version=1&amp;amp;_urlVersion=0&amp;amp;_userid=961305&amp;amp;md5=4115e7b8186a36db60f6b11a217a8798 Microbial population dynamics during aerobic sludge granulation at different organic loading rates] (ScienceDirect)&lt;br /&gt;
# Mohammad Reza Ghodsi [http://nar.oxfordjournals.org/cgi/content/short/36/16/5180 A comparison of random sequence reads versus 16S rDNA sequences for estimating the biodiversity of a metagenomic library] (NAR)&lt;br /&gt;
# Sergey Koren [http://www.biomedcentral.com/1471-2105/9/390 R/parallel - speeding up bioinformatics analysis with R] (BMC Bioinformatics)&lt;br /&gt;
# Theodore Gibbons [http://www.ncbi.nlm.nih.gov/pubmed/18711340 High-resolution metagenomics targets specific functional types in complex microbial communities] (Nature Biotechnology) &amp;lt;BR&amp;gt; [http://www.nature.com/nrmicro/journal/v6/n9/abs/nrmicro1935.html Molecular eco-systems biology: towards an understanding of community function] (Systems Microbiology)&lt;br /&gt;
&lt;br /&gt;
= 10/06/2008 =&lt;br /&gt;
# Bo Liu [http://www.pnas.org/content/105/39/15076.short The convergence of carbohydrate active gene repertoires in human gut microbes.] PNAS&lt;br /&gt;
# Daniela Puiu [http://bioinformatics.oxfordjournals.org/cgi/content/full/24/20/2395 SeqMap: mapping massive amount of oligonucleotides to the genome] Bioinformatics&lt;br /&gt;
# James Robert White [http://www.pnas.org/content/104/21/8918.full Artificial selection of simulated microbial ecosystems.] PNAS&lt;br /&gt;
# MohammadReza Ghodsi [http://aem.asm.org/cgi/content/abstract/74/5/1453 Metagenomics: Read Length Matters] Applied Environmental Microbiology&lt;br /&gt;
# Sergey Koren [http://www.ploscompbiol.org/article/info:doi%2F10.1371%2Fjournal.pcbi.1000186 Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads] PLoS Computational Biology&lt;br /&gt;
&lt;br /&gt;
= 10/13/2008 =&lt;br /&gt;
# Daniela Puiu, Daniel Sommer [http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0003373 MetaSim—A Sequencing Simulator for Genomics and Metagenomics] PLoS ONE&lt;br /&gt;
# Bo Liu, James Robert White [http://www.sciencemag.org/cgi/content/full/322/5899/275 Environmental Genomics Reveals a Single-Species Ecosystem Deep Within Earth.] Science&lt;br /&gt;
# MohammadReza Ghodsi [http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2557142 Probing Metagenomics by Rapid Cluster Analysis of Very Large Datasets] (PLoS one)&lt;br /&gt;
&lt;br /&gt;
MetaSim software:&lt;br /&gt;
      Location:         /fs/sz-user-supported/common/packages/MetaSim/ &lt;br /&gt;
      Executable(Java): /fs/sz-user-supported/common/packages/MetaSim/metasim/MetaSim &lt;br /&gt;
      Database:         /fs/sz-user-supported/common/packages/MetaSim/database           &lt;br /&gt;
      create a symlink to it from your working directory, otherwise you have to import it &lt;br /&gt;
      NCBI databases: [ftp://ftp.ncbi.nlm.nih.gov/genomes/Bacteria/all.fna.tar.gz Complete bacteria];   [ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/gi_taxid_nucl.dmp.gz Taxonomy]&lt;br /&gt;
&lt;br /&gt;
= 10/20/2008 =&lt;br /&gt;
# Bo Liu [http://www.pnas.org/content/105/37/13977.short?rss=1 Accelerated evolution of resistance in multidrug environments] PNAS &amp;lt;br&amp;gt; [http://www.pnas.org/content/105/39/14918.short?rss=1 Drug interactions modulate the potential for evolution of resistance] PNAS&lt;br /&gt;
# Daniel Sommer [http://www.ncbi.nlm.nih.gov/pubmed/18927115?dopt=Abstract NCBI Reference Sequences: current status, policy and new initiatives] NAR&lt;br /&gt;
# Daniela Puiu [http://genomebiology.com/content/pdf/gb-2008-9-10-r151.pdf A simple, fast, and accurate method of phylogenomic inference] Genome Biology&lt;br /&gt;
# Niranjan Nagarajan [http://www.springerlink.com/content/cn3551154w232l73/fulltext.pdf The Stastical Power of Phylogenetic Motif Models] RECOMB 2008&lt;br /&gt;
# Sergey Koren [http://www.nature.com/nmeth/journal/v5/n10/full/nmeth1008-903.html Scientific software: seeing the SNPs between us] Nature Methods&lt;br /&gt;
&lt;br /&gt;
= 10/27/2008 =&lt;br /&gt;
&lt;br /&gt;
# Sergey Koren [http://bioinformatics.oxfordjournals.org/cgi/content/abstract/btn548 Aggressive Assembly of Pyrosequencing Reads with Mates] Bioinformatics&lt;br /&gt;
# Daniela Puiu [http://bioinformatics.oxfordjournals.org/cgi/content/full/24/21/2431 ZOOM! Zillions of oligos mapped] Bioinformatics&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:How-to&amp;diff=3350</id>
		<title>Cbcb:Pop-Lab:How-to</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:How-to&amp;diff=3350"/>
		<updated>2008-09-13T23:14:27Z</updated>

		<summary type="html">&lt;p&gt;Sergek: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==How-To repository==&lt;br /&gt;
[http://www.cbcb.umd.edu/intranet/resources.shtml Getting started at the CBCB] &amp;lt;br&amp;gt;&lt;br /&gt;
[[Cbcb:Pop-Lab:AMOS-CVS How to use AMOS through CVS]] &amp;lt;br&amp;gt;&lt;br /&gt;
[[Cbcb:Pop-Lab:AMOScmp-SR How to use AMOScmp with short read data ]] &amp;lt;br&amp;gt;&lt;br /&gt;
How do I annotate a genome at the CBCB?&amp;lt;br&amp;gt;&lt;br /&gt;
[[Cbcb:Pop-Lab:How do I run the new Bambus | How do I run the new Bambus?]] &amp;lt;br&amp;gt;&lt;br /&gt;
How do I use the antibiotic resistance database?&amp;lt;br&amp;gt;&lt;br /&gt;
How do I use the antibiotic resistance database locally? &amp;lt;br&amp;gt;&lt;br /&gt;
How do I run jobs on the grid? &amp;lt;br&amp;gt;&lt;br /&gt;
How do I compare metagenomic datasets through the metastats website?&amp;lt;br&amp;gt;&lt;br /&gt;
How do I compare metagenomic datasets using R directly? &amp;lt;br&amp;gt;&lt;br /&gt;
How do I find CRISPRs in a new genome?&amp;lt;br&amp;gt;&lt;br /&gt;
[[Cbcb:Pop-Lab:SOMA How do I scaffold a genome using optical maps (both locally and through the web) ]] &amp;lt;br&amp;gt;&lt;br /&gt;
How do I generate graph information out of Minimus?&amp;lt;br&amp;gt;&lt;br /&gt;
What tools are available for doing &amp;lt;i&amp;gt;in silico&amp;lt;/i&amp;gt; finishing at the CBCB? &amp;lt;br&amp;gt;&lt;br /&gt;
How do I generate a scaffold graph starting from a 454 .ace file?&amp;lt;br&amp;gt;&lt;br /&gt;
How do I draw a pretty picture of a scaffold stored in an AMOS bank?&amp;lt;br&amp;gt;&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:How-to&amp;diff=3349</id>
		<title>Cbcb:Pop-Lab:How-to</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:How-to&amp;diff=3349"/>
		<updated>2008-09-13T23:13:07Z</updated>

		<summary type="html">&lt;p&gt;Sergek: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==How-To repository==&lt;br /&gt;
[http://www.cbcb.umd.edu/intranet/resources.shtml Getting started at the CBCB] &amp;lt;br&amp;gt;&lt;br /&gt;
[[Cbcb:Pop-Lab:AMOS-CVS How to use AMOS through CVS]] &amp;lt;br&amp;gt;&lt;br /&gt;
[[Cbcb:Pop-Lab:AMOScmp-SR How to use AMOScmp with short read data ]] &amp;lt;br&amp;gt;&lt;br /&gt;
How do I annotate a genome at the CBCB?&amp;lt;br&amp;gt;&lt;br /&gt;
[[Cbcb:Pop-Lab:How do I run the new Bambus]] &amp;lt;br&amp;gt;&lt;br /&gt;
How do I use the antibiotic resistance database?&amp;lt;br&amp;gt;&lt;br /&gt;
How do I use the antibiotic resistance database locally? &amp;lt;br&amp;gt;&lt;br /&gt;
How do I run jobs on the grid? &amp;lt;br&amp;gt;&lt;br /&gt;
How do I compare metagenomic datasets through the metastats website?&amp;lt;br&amp;gt;&lt;br /&gt;
How do I compare metagenomic datasets using R directly? &amp;lt;br&amp;gt;&lt;br /&gt;
How do I find CRISPRs in a new genome?&amp;lt;br&amp;gt;&lt;br /&gt;
[[Cbcb:Pop-Lab:SOMA How do I scaffold a genome using optical maps (both locally and through the web) ]] &amp;lt;br&amp;gt;&lt;br /&gt;
How do I generate graph information out of Minimus?&amp;lt;br&amp;gt;&lt;br /&gt;
What tools are available for doing &amp;lt;i&amp;gt;in silico&amp;lt;/i&amp;gt; finishing at the CBCB? &amp;lt;br&amp;gt;&lt;br /&gt;
How do I generate a scaffold graph starting from a 454 .ace file?&amp;lt;br&amp;gt;&lt;br /&gt;
How do I draw a pretty picture of a scaffold stored in an AMOS bank?&amp;lt;br&amp;gt;&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
	<entry>
		<id>https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:How_do_I_run_the_new_Bambus&amp;diff=3348</id>
		<title>Cbcb:Pop-Lab:How do I run the new Bambus</title>
		<link rel="alternate" type="text/html" href="https://wiki.umiacs.umd.edu/cbcb/index.php?title=Cbcb:Pop-Lab:How_do_I_run_the_new_Bambus&amp;diff=3348"/>
		<updated>2008-09-13T23:12:54Z</updated>

		<summary type="html">&lt;p&gt;Sergek: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The new Bambus (aka Bambus 2) actually consists of three executables that run in order using a supplied AMOS bank. An important note, Bambus 2 is still an early beta. As such, it is advisable to back up&lt;br /&gt;
the bnk directory before using.&lt;br /&gt;
&lt;br /&gt;
The first program is clk. This program finds all mated reads within contigs and converts the mate distances to be relative to contigs rather than reads. The second program is Bundler. Bundler joins together&lt;br /&gt;
the contig link messages generated by clk together when they can be to create consensus links between contigs. It will output multiple contig links for a pair of contigs. The final program is OrientContigs.&lt;br /&gt;
OrientContigs uses the contig links to orient and order the contigs into scaffolds, as well as performing some simplification by joining contigs. Each of the programs is covered in more detail below. To get more&lt;br /&gt;
help on running any program use -h.&lt;br /&gt;
&lt;br /&gt;
1. clk&lt;br /&gt;
  - Modifies the bank to create contig edges.&lt;br /&gt;
  - Example: clk -b[ank] data.bnk&lt;br /&gt;
&lt;br /&gt;
2.Bundler&lt;br /&gt;
  - Bundle together contig edges to create contig links.&lt;br /&gt;
  - Example: Bundler -b[ank] data.bnk [-t[ype] comma separated list of edge types]&lt;br /&gt;
  - The -t[ype] option allows only certain contig edges to be processed. ALL means use any type. The types are defined in src/AMOS/Link_AMOS.hh&lt;br /&gt;
&lt;br /&gt;
3. OrientContigs&lt;br /&gt;
 - Orient and order the contigs based on the links. This program uses a greedy algorithm to orient and order contigs relative to an arbitrary start contig. Edges that contradict the current scaffold are marked bad&lt;br /&gt;
   and ignored for the rest of the analysis. They are still output but don&#039;t affect any subsequent calculations.&lt;br /&gt;
   &lt;br /&gt;
   The output is a dot-formatted file as well as an NCBI AGP scaffold format. &lt;br /&gt;
   &lt;br /&gt;
   Note that this program does not currently linearize the scaffold but maintains them as a graph. This program also recursively simplifies common patters in the graph. Currently the patterns are&lt;br /&gt;
   bubbles or straight lines. For example, contigs A-&amp;gt;B-&amp;gt;C will be simplified to just A. Also A-&amp;gt;B-&amp;gt;D will become A as well. This simplification is performed recursively until the graph is stable. Note that the   &lt;br /&gt;
   												\&amp;gt;C/&amp;gt; &lt;br /&gt;
   simplification updates the bank in a destructive way by removing contigs and replacing them (as well as their edges) with updated contigs. Therefore it is necessary to make a backup of the bank before running&lt;br /&gt;
&lt;br /&gt;
   Example: OrientContigs -b[ank] &amp;lt;bank_name&amp;gt; [-a[ll] -[r]noreduce -[n]noagressive]. The -all option specifies whether disconnected contigs should be output as their own scaffolds or if they should be ignored. The -noreduce option&lt;br /&gt;
   turns off the simplification described above. Finally, the -noaggressive option will mark edges that move a contig more than 3 STDEVS away as bad instead of attempting to reconcile the positions.&lt;/div&gt;</summary>
		<author><name>Sergek</name></author>
	</entry>
</feed>