<?xml version="1.0" encoding="utf-8"?><?xml-stylesheet title="XSL formatting" type="text/xsl" href="http://blog.isavoir.com/feed/rss2/xslt" ?><rss version="2.0"
  xmlns:dc="http://purl.org/dc/elements/1.1/"
  xmlns:wfw="http://wellformedweb.org/CommentAPI/"
  xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
  <title>DNA MANIA - Natural language processing</title>
  <link>http://blog.isavoir.com/</link>
  <description>Bioinformatic, Text Mining, Biological Text Mining, Name entity recognition, Genomic, System Biology, Semantic, Computational Biology, Semantic Web, Knowledge management, Biomedicine, Ontology, Thesaurus, Terminology, Corpora, Content management</description>
  <language>en</language>
  <pubDate>Sat, 05 Jul 2008 13:58:56 +0200</pubDate>
  <copyright>iSavoir @ 2007 copyright reserved</copyright>
  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
  <generator>Dotclear</generator>
  
    
  <item>
    <title>Is Information Extraction from the scientific litterature ready for Life science ?</title>
    <link>http://blog.isavoir.com/post/2007/03/19/Is-Information-Extraction-IE-from-the-scientific-litterature-ready-for-Life-science</link>
    <guid isPermaLink="false">urn:md5:c391034b37a9022837287421fb3c64d0</guid>
    <pubDate>Mon, 19 Mar 2007 13:30:00 +0100</pubDate>
    <dc:creator>Frédéric</dc:creator>
        <category>Text Mining</category>
        <category>information Extraction</category><category>Natural language processing</category><category>NLP</category><category>Ontology</category><category>Ontology-driven information extraction</category>    
    <description>&lt;p&gt;For the average biologist, hands-on literature mining currently means a
keyword search in PubMed. However, methods for extracting biomedical facts from
the scientific literature have improved considerably, and the associated tools
will probably soon be used in many laboratories to automatically annotate and
analyse the growing number of system-wide experimental data sets.&amp;quot;&lt;br /&gt;&lt;/p&gt;
&lt;p&gt;Extract from Nature Review Genetics : Literature mining for the biologist:
from information retrieval to biological discovery by Peer Bork et al. 2006&lt;/p&gt;    &lt;p&gt;Simply put, &lt;a href=&quot;http://blog.isavoir.com/tag/Information%20extraction&quot;&gt;Information
extraction&lt;/a&gt; ( IE) accomplish these tasks :&lt;br /&gt;&lt;/p&gt;
&lt;p&gt;* Take natural language text from a document source , and extract the
essential facts about one or more predefined fact types.&lt;br /&gt;&lt;/p&gt;
&lt;p&gt;* Represent each fact as a template whose slots are filled on the basis of
what is found in the text.&lt;br /&gt;&lt;/p&gt;
&lt;p&gt;IE is typically carried out in support of other tasks, usually forms part of
application or pipeline of processes. The results of IE is either stored in a
databases or subjected to querying or data mining; integrated in knowledge
bases to allow reasoning or presented to users for annotation or curation tasks
.&lt;br /&gt;&lt;/p&gt;
&lt;p&gt;Thus, IE is an application of &lt;a href=&quot;http://blog.isavoir.com/tag/Natural%20language%20processing&quot;&gt;Natural language processing&lt;/a&gt;
(&lt;a href=&quot;http://blog.isavoir.com/tag/NLP&quot;&gt;NLP&lt;/a&gt;). As the term implies, the goal is to extract
information from text , and the aim is to do so without requiring the end user
to read the text. In contrast, information Retrieval (IR) like Search engine is
the activity of finding documents that answer an information need with the help
of an index.&lt;/p&gt;
&lt;p&gt;IE have dealt primarily with news resources , and more recently with
scientific publications. In sciences, general language grammar and dictionary
are not enough. Scientific fields use many technical terms, only a few are
found in common discourses. To some extends, this kind of terms can be listed
in auxiliary terminologies. however, automatic term recognition ( ATR) is
useful for IE to extract named entities on the basis of their internal
structures.&lt;/p&gt;
&lt;p&gt;Regardless of what IE approaches was used in the passed, scientific fields,
especially biology and Biomedicine is not well suited with IE systems that
doesn't make used of &lt;a href=&quot;http://blog.isavoir.com/tag/Ontology&quot;&gt;ontology&lt;/a&gt; and linguistic
lexicons. The best exemple is &lt;a href=&quot;http://www.ims.uni-stuttgart.de/projekte/GenIE/&quot; hreflang=&quot;eng&quot;&gt;GenIE &amp;quot; Genome
Information Extraction&amp;quot;&lt;/a&gt; from the institute for Computational linguistic at
the University of Stuttgart. they uses Ontology-driven information Extraction
technologies that goes behind extracting simple facts from sentences. their aim
is to deal with anaphoric reference and information from each sentence merged
or a relation must ne established between events.&lt;/p&gt;
&lt;p&gt;For instance, if a sentence refers explicitly to a binding action, and the
following sentence is pointing to the gene expression regulation du to the
interaction between binding factors and promoters sequences, then the
dependency between events should be capture.&lt;/p&gt;
&lt;p&gt;A must read &lt;a href=&quot;http://www.nature.com/nrg/journal/v7/n2/abs/nrg1768.html;jsessionid=C3EA31280579A569ED0ED327B540FA2F&quot; hreflang=&quot;eng&quot;&gt;&amp;quot; Literature mining for the biologist: from information
retrieval to biological discovery&amp;quot; by Peer Bork et al. Nature Review Genetics
2006.&lt;/a&gt;&lt;/p&gt;
&lt;pre&gt;
&amp;quot;
&lt;/pre&gt;
&lt;p&gt;DNA MANIA&lt;/p&gt;</description>
    
    
    
          <comments>http://blog.isavoir.com/post/2007/03/19/Is-Information-Extraction-IE-from-the-scientific-litterature-ready-for-Life-science#comment-form</comments>
      <wfw:comment>http://blog.isavoir.com/post/2007/03/19/Is-Information-Extraction-IE-from-the-scientific-litterature-ready-for-Life-science#comment-form</wfw:comment>
      <wfw:commentRss>http://blog.isavoir.com/feed/rss2/comments/89581</wfw:commentRss>
      </item>
    
</channel>
</rss>