<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Wikis for genome (re)annotation</title>
	<atom:link href="http://fungalgenomes.org/blog/2007/02/wikis-for-genome-reannotation/feed/" rel="self" type="application/rss+xml" />
	<link>http://fungalgenomes.org/blog/2007/02/wikis-for-genome-reannotation/</link>
	<description>Digesting the fungal genomes</description>
	<lastBuildDate>Tue, 23 Feb 2010 18:31:48 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=abc</generator>
	<item>
		<title>By: Hilmar Lapp</title>
		<link>http://fungalgenomes.org/blog/2007/02/wikis-for-genome-reannotation/comment-page-1/#comment-21</link>
		<dc:creator>Hilmar Lapp</dc:creator>
		<pubDate>Thu, 22 Feb 2007 17:04:29 +0000</pubDate>
		<guid isPermaLink="false">http://fungalgenomes.org/blog/2007/02/wikis-for-genome-reannotation/#comment-21</guid>
		<description>I agree, content aggregation is going to be key. The other component that I think will be critical is formal semantics. These need to be captured from user input without exposing the user to the underlying complexity, yet they need to be integrated into a formal framework (such as RDF) that is computable, i.e., can be reasoned over, for example using &lt;a href=&quot;http://www.sparql.org/sparql.html&quot; rel=&quot;nofollow&quot;&gt;semantic queries formulated in SPARQL&lt;/a&gt; (there are &lt;a href=&quot;http://esw.w3.org/topic/DawgShows&quot; rel=&quot;nofollow&quot;&gt;many examples on the web&lt;/a&gt;), or using UIs such as &lt;a href=&quot;http://swoogle.umbc.edu/&quot; rel=&quot;nofollow&quot;&gt;Swoogle&lt;/a&gt;. There are efforts underway to marry semantic web to Wiki, such as &lt;a href=&quot;http://wiki.ontoworld.org/wiki/Semantic_MediaWiki&quot; rel=&quot;nofollow&quot;&gt;Semantic MediaWiki&lt;/a&gt;, which aim to have the best of both worlds, i.e., a wiki interface for editing data or annotation, and an RDF interface for machines (check out the Export RDF feature). One of the things I noted in the genome wiki examples is that the user is not supposed to edit the (machine-generated) formally structured parts, but instead uses &quot;Notes&quot; sections. I believe this needs to be integrated, not separated. The &lt;a href=&quot;http://www.wikiprofessional.info/&quot; rel=&quot;nofollow&quot;&gt;WikiProteins&lt;/a&gt; demo that apparently has been mentioned in a news article in Nature may provide some inspiration as to how this might work. Exposing obvious metadata as RDF that is now embedded in HTML, using &lt;a href=&quot;http://microformats.org/&quot; rel=&quot;nofollow&quot;&gt;microformats&lt;/a&gt; or not, may become less and less difficult; for example using browser plugins such as &lt;a href=&quot;http://simile.mit.edu/wiki/Piggy_Bank&quot; rel=&quot;nofollow&quot;&gt;PiggyBank&lt;/a&gt; that use generic and custom screen scrapers to enable faceted browsing on (almost) ordinary web pages (anyone up for writing a custom PiggyBank screen scraper for a genome or taxonomic database?). In addition, there are emerging W3C standards such as &lt;a href=&quot;http://www.w3.org/2004/01/rdxh/spec&quot; rel=&quot;nofollow&quot;&gt;GRDDL&lt;/a&gt; that might help in the transition. Just yesterday I&#039;ve just listened to an &lt;a href=&quot;http://www.ibiblio.org/hhalpin/homepage/presentations/isis/&quot; rel=&quot;nofollow&quot;&gt;interesting talk&lt;/a&gt; in this regard by &lt;a href=&quot;http://www.ibiblio.org/hhalpin/&quot; rel=&quot;nofollow&quot;&gt;Harry Halpin&lt;/a&gt;, a co-chair of the GRDDL working group.

BTW interestingly very similar ideas in the area of communal content editing and portals that enable that are being discussed in the Biodiversity field, for example in the highly ambitious &lt;a href=&quot;http://eolinformatics.mbl.edu/&quot; rel=&quot;nofollow&quot;&gt;Encyclopedia of Life project&lt;/a&gt; (see, for example, the &lt;a href=&quot;http://eolinformatics.mbl.edu/Documents/draft_technicalvision.html&quot; rel=&quot;nofollow&quot;&gt;draft technical vision&lt;/a&gt;).
 
As an idea of where this might all lead to, imagine ordinary users could program the web of genome annotations, the tree of life, and ontologies of gene function, anatomy, and disease, using an interface like &lt;a href=&quot;http://pipes.yahoo.com/&quot; rel=&quot;nofollow&quot;&gt;Yahoo Pipes&lt;/a&gt;, just as it allows them now to program the RSS-enabled web.

As for community annotation of genomes, has anyone who reads this tried to poke around on &lt;a href=&quot;http://theseed.uchicago.edu/FIG/index.cgi&quot; rel=&quot;nofollow&quot;&gt;SEED&lt;/a&gt;? I&#039;ve used this extensively when I reconstructed (part of) the metabolic network of T. maritima in the context of a graduate class at UCSD taught by &lt;a href=&quot;http://gcrg.ucsd.edu/personnel/palsson.htm&quot; rel=&quot;nofollow&quot;&gt;B. Palsson&lt;/a&gt;, and I found it the most useful of all genome annotation sources for Thermotoga.</description>
		<content:encoded><![CDATA[<p>I agree, content aggregation is going to be key. The other component that I think will be critical is formal semantics. These need to be captured from user input without exposing the user to the underlying complexity, yet they need to be integrated into a formal framework (such as RDF) that is computable, i.e., can be reasoned over, for example using <a href="http://www.sparql.org/sparql.html" rel="nofollow">semantic queries formulated in SPARQL</a> (there are <a href="http://esw.w3.org/topic/DawgShows" rel="nofollow">many examples on the web</a>), or using UIs such as <a href="http://swoogle.umbc.edu/" rel="nofollow">Swoogle</a>. There are efforts underway to marry semantic web to Wiki, such as <a href="http://wiki.ontoworld.org/wiki/Semantic_MediaWiki" rel="nofollow">Semantic MediaWiki</a>, which aim to have the best of both worlds, i.e., a wiki interface for editing data or annotation, and an RDF interface for machines (check out the Export RDF feature). One of the things I noted in the genome wiki examples is that the user is not supposed to edit the (machine-generated) formally structured parts, but instead uses &#8220;Notes&#8221; sections. I believe this needs to be integrated, not separated. The <a href="http://www.wikiprofessional.info/" rel="nofollow">WikiProteins</a> demo that apparently has been mentioned in a news article in Nature may provide some inspiration as to how this might work. Exposing obvious metadata as RDF that is now embedded in HTML, using <a href="http://microformats.org/" rel="nofollow">microformats</a> or not, may become less and less difficult; for example using browser plugins such as <a href="http://simile.mit.edu/wiki/Piggy_Bank" rel="nofollow">PiggyBank</a> that use generic and custom screen scrapers to enable faceted browsing on (almost) ordinary web pages (anyone up for writing a custom PiggyBank screen scraper for a genome or taxonomic database?). In addition, there are emerging W3C standards such as <a href="http://www.w3.org/2004/01/rdxh/spec" rel="nofollow">GRDDL</a> that might help in the transition. Just yesterday I&#8217;ve just listened to an <a href="http://www.ibiblio.org/hhalpin/homepage/presentations/isis/" rel="nofollow">interesting talk</a> in this regard by <a href="http://www.ibiblio.org/hhalpin/" rel="nofollow">Harry Halpin</a>, a co-chair of the GRDDL working group.</p>
<p>BTW interestingly very similar ideas in the area of communal content editing and portals that enable that are being discussed in the Biodiversity field, for example in the highly ambitious <a href="http://eolinformatics.mbl.edu/" rel="nofollow">Encyclopedia of Life project</a> (see, for example, the <a href="http://eolinformatics.mbl.edu/Documents/draft_technicalvision.html" rel="nofollow">draft technical vision</a>).</p>
<p>As an idea of where this might all lead to, imagine ordinary users could program the web of genome annotations, the tree of life, and ontologies of gene function, anatomy, and disease, using an interface like <a href="http://pipes.yahoo.com/" rel="nofollow">Yahoo Pipes</a>, just as it allows them now to program the RSS-enabled web.</p>
<p>As for community annotation of genomes, has anyone who reads this tried to poke around on <a href="http://theseed.uchicago.edu/FIG/index.cgi" rel="nofollow">SEED</a>? I&#8217;ve used this extensively when I reconstructed (part of) the metabolic network of T. maritima in the context of a graduate class at UCSD taught by <a href="http://gcrg.ucsd.edu/personnel/palsson.htm" rel="nofollow">B. Palsson</a>, and I found it the most useful of all genome annotation sources for Thermotoga.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steven Salzberg</title>
		<link>http://fungalgenomes.org/blog/2007/02/wikis-for-genome-reannotation/comment-page-1/#comment-12</link>
		<dc:creator>Steven Salzberg</dc:creator>
		<pubDate>Sat, 17 Feb 2007 19:01:27 +0000</pubDate>
		<guid isPermaLink="false">http://fungalgenomes.org/blog/2007/02/wikis-for-genome-reannotation/#comment-12</guid>
		<description>This is an excellent discussion and I&#039;m glad to see these points being pursued further.  The wiki sites of some organism databases are a good start, and I hope this will spread.  Jason Stajich makes a good point with the argument that users shouldn&#039;t have to be bioinformaticists in order to improve a gene set.  This has been a repeated problem over the years for many genomes: users who are experts on a particular gene (but not necessarily on bioinformatics methods) cannot fix &quot;their&quot; gene when a new genome appears with that gene mis-annotated.  
  In addition to fixing annotation though, my article was pointing out that we need a way to maintain and distribute genome annotation.  Today (I would argue) that a large majority of scientists use Genbank/EMBL/DDBJ as their &quot;go to&quot; resource for genes for any species - I go there myself, all the time.  I&#039;m hoping that a new type of resource - perhaps a Wiki - will emerge to provide up-to-date annotation.  Perhaps it would need a gatekeeper, but Wikipedia has worked pretty amazingly well with very minimal gatekeeping.</description>
		<content:encoded><![CDATA[<p>This is an excellent discussion and I&#8217;m glad to see these points being pursued further.  The wiki sites of some organism databases are a good start, and I hope this will spread.  Jason Stajich makes a good point with the argument that users shouldn&#8217;t have to be bioinformaticists in order to improve a gene set.  This has been a repeated problem over the years for many genomes: users who are experts on a particular gene (but not necessarily on bioinformatics methods) cannot fix &#8220;their&#8221; gene when a new genome appears with that gene mis-annotated.<br />
  In addition to fixing annotation though, my article was pointing out that we need a way to maintain and distribute genome annotation.  Today (I would argue) that a large majority of scientists use Genbank/EMBL/DDBJ as their &#8220;go to&#8221; resource for genes for any species &#8211; I go there myself, all the time.  I&#8217;m hoping that a new type of resource &#8211; perhaps a Wiki &#8211; will emerge to provide up-to-date annotation.  Perhaps it would need a gatekeeper, but Wikipedia has worked pretty amazingly well with very minimal gatekeeping.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jason Stajich</title>
		<link>http://fungalgenomes.org/blog/2007/02/wikis-for-genome-reannotation/comment-page-1/#comment-7</link>
		<dc:creator>Jason Stajich</dc:creator>
		<pubDate>Thu, 15 Feb 2007 21:28:26 +0000</pubDate>
		<guid isPermaLink="false">http://fungalgenomes.org/blog/2007/02/wikis-for-genome-reannotation/#comment-7</guid>
		<description>These are all nice things, but setting up a wiki with downloadable data is not the same as a wiki for annotation of a genome.  I don&#039;t want users to have to be experts in informatics to be able to make an improvement to the gene set.  But I&#039;m not really designing or developing a genome wiki so I&#039;ll stop complaining!  

The wikis that the MODs have setup are nice starts, but I don&#039;t think that just installing mediawiki doesn&#039;t really change the way that the information flows from researchers into a centralized DB and out back to the community.  I realize we are just at the beginning here, but I&#039;d like to see ways that gene and genome pages are made up of dynamic content that comes from databases of published data, curated content, and automated analyses.

While I&#039;m dreaming, what happens when we want to move away from single-gene oriented focus.  What if we want to annotate function or attributes to a gene cluster.  How can we build systems that aggregate in all the associated information from the different underlying gene informations.  Maybe everything emits RSS and we build giant aggregators.  &lt;a href=&quot;http://lsid.sourceforge.net/&quot; rel=&quot;nofollow&quot;&gt;LSID&lt;/a&gt; enabled of course... =)</description>
		<content:encoded><![CDATA[<p>These are all nice things, but setting up a wiki with downloadable data is not the same as a wiki for annotation of a genome.  I don&#8217;t want users to have to be experts in informatics to be able to make an improvement to the gene set.  But I&#8217;m not really designing or developing a genome wiki so I&#8217;ll stop complaining!  </p>
<p>The wikis that the MODs have setup are nice starts, but I don&#8217;t think that just installing mediawiki doesn&#8217;t really change the way that the information flows from researchers into a centralized DB and out back to the community.  I realize we are just at the beginning here, but I&#8217;d like to see ways that gene and genome pages are made up of dynamic content that comes from databases of published data, curated content, and automated analyses.</p>
<p>While I&#8217;m dreaming, what happens when we want to move away from single-gene oriented focus.  What if we want to annotate function or attributes to a gene cluster.  How can we build systems that aggregate in all the associated information from the different underlying gene informations.  Maybe everything emits RSS and we build giant aggregators.  <a href="http://lsid.sourceforge.net/" rel="nofollow">LSID</a> enabled of course&#8230; =)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: RPM</title>
		<link>http://fungalgenomes.org/blog/2007/02/wikis-for-genome-reannotation/comment-page-1/#comment-6</link>
		<dc:creator>RPM</dc:creator>
		<pubDate>Wed, 14 Feb 2007 20:57:00 +0000</pubDate>
		<guid isPermaLink="false">http://fungalgenomes.org/blog/2007/02/wikis-for-genome-reannotation/#comment-6</guid>
		<description>The analysis of the 12 Drosophila genomes was done (is being done?) by &lt;a href=&quot;http://rana.lbl.gov/drosophila/wiki/index.php/Main_Page&quot; rel=&quot;nofollow&quot;&gt;wiki&lt;/a&gt;. I&#039;m not sure if this wiki will be used now that the genome papers have been submitted for publication, but it could be a useful place to submit reannotations by different groups.</description>
		<content:encoded><![CDATA[<p>The analysis of the 12 Drosophila genomes was done (is being done?) by <a href="http://rana.lbl.gov/drosophila/wiki/index.php/Main_Page" rel="nofollow">wiki</a>. I&#8217;m not sure if this wiki will be used now that the genome papers have been submitted for publication, but it could be a useful place to submit reannotations by different groups.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jim Hu</title>
		<link>http://fungalgenomes.org/blog/2007/02/wikis-for-genome-reannotation/comment-page-1/#comment-5</link>
		<dc:creator>Jim Hu</dc:creator>
		<pubDate>Wed, 14 Feb 2007 00:00:02 +0000</pubDate>
		<guid isPermaLink="false">http://fungalgenomes.org/blog/2007/02/wikis-for-genome-reannotation/#comment-5</guid>
		<description>Several organism databases are working on using wikis for annotation. The link on my name is for EcoliWiki, a prototype of a community annotation system we are setting up for E. coli as part of the new EcoliHub project.  SGD is adding a wiki to replace it&#039;s previous community annotation system, DictyBase, Xanthusbase, and Wormbase all have them.

We also have created a wiki-based browser for the Gene Ontology at http://gowiki.tamu.edu.

As noted in the post, there are a number of challenges to be met in using wikis for annotation.  What will be interesting is to see how the wikis themselves can be used as a mechanism to create and refine protocols for community annotation via these new web media.</description>
		<content:encoded><![CDATA[<p>Several organism databases are working on using wikis for annotation. The link on my name is for EcoliWiki, a prototype of a community annotation system we are setting up for E. coli as part of the new EcoliHub project.  SGD is adding a wiki to replace it&#8217;s previous community annotation system, DictyBase, Xanthusbase, and Wormbase all have them.</p>
<p>We also have created a wiki-based browser for the Gene Ontology at <a href="http://gowiki.tamu.edu" rel="nofollow">http://gowiki.tamu.edu</a>.</p>
<p>As noted in the post, there are a number of challenges to be met in using wikis for annotation.  What will be interesting is to see how the wikis themselves can be used as a mechanism to create and refine protocols for community annotation via these new web media.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
