<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Computational Linguistics &#8211; News update for Oct 9, 2006</title>
	<atom:link href="http://alwayslearning.wordpress.com/2006/10/09/computational-linguistics-news-update-for-oct-9-2006/feed/" rel="self" type="application/rss+xml" />
	<link>http://alwayslearning.wordpress.com/2006/10/09/computational-linguistics-news-update-for-oct-9-2006/</link>
	<description>The world through the prism of my mind</description>
	<lastBuildDate>Wed, 25 Mar 2009 02:20:22 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Computational Linguistics - News update for Nov 15, 2006 &#171; Always Learning!</title>
		<link>http://alwayslearning.wordpress.com/2006/10/09/computational-linguistics-news-update-for-oct-9-2006/#comment-1040</link>
		<dc:creator>Computational Linguistics - News update for Nov 15, 2006 &#171; Always Learning!</dc:creator>
		<pubDate>Thu, 16 Nov 2006 04:43:35 +0000</pubDate>
		<guid isPermaLink="false">http://alwayslearning.wordpress.com/2006/10/09/computational-linguistics-news-update-for-oct-9-2006/#comment-1040</guid>
		<description>[...] Lots of new sightings of CL/NLP technologies since the last update: [...]</description>
		<content:encoded><![CDATA[<p>[...] Lots of new sightings of CL/NLP technologies since the last update: [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Alexandre Rafalovitch</title>
		<link>http://alwayslearning.wordpress.com/2006/10/09/computational-linguistics-news-update-for-oct-9-2006/#comment-923</link>
		<dc:creator>Alexandre Rafalovitch</dc:creator>
		<pubDate>Wed, 01 Nov 2006 15:21:48 +0000</pubDate>
		<guid isPermaLink="false">http://alwayslearning.wordpress.com/2006/10/09/computational-linguistics-news-update-for-oct-9-2006/#comment-923</guid>
		<description>Francesco,
This is not really a good place to announce new software/service. Nobody will find it.

You will have much better luck at any of the repositories listed at &lt;a href=&quot;http://aclweb.org/aclwiki/index.php?title=Software&quot; rel=&quot;nofollow&quot;&gt;ACL Wiki&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>Francesco,<br />
This is not really a good place to announce new software/service. Nobody will find it.</p>
<p>You will have much better luck at any of the repositories listed at <a href="http://aclweb.org/aclwiki/index.php?title=Software" rel="nofollow">ACL Wiki</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Francesco Sclano</title>
		<link>http://alwayslearning.wordpress.com/2006/10/09/computational-linguistics-news-update-for-oct-9-2006/#comment-922</link>
		<dc:creator>Francesco Sclano</dc:creator>
		<pubDate>Wed, 01 Nov 2006 15:01:20 +0000</pubDate>
		<guid isPermaLink="false">http://alwayslearning.wordpress.com/2006/10/09/computational-linguistics-news-update-for-oct-9-2006/#comment-922</guid>
		<description>Hi everybody,
TermExtractor, my master thesis, is online at the address http://lcl2.di.uniroma1.it !!!

TermExtractor is a software package for automatic
building, validation and maintenance of glossaries in
english language.

TermExtractor extracts terminology consensually
referred in a specific application domain. The package
takes as input a corpus of domain documents, parses
the documents, and extracts a list of &quot;syntactically
plausible&quot; terms (e.g. compounds, adjective-nouns,
etc.). Documents parsing assigns a greater importance
to terms with text layouts (title, bold, italic,
underlined, etc.). Two entropy-based measures, called
Domain Relevance and Domain Consensus, are then used.
Domain Consensus is used to select only the terms
which are consensually referred throughout the corpus
documents. Domain Relevance to select only the terms
which are relevant to the domain of interest, Domain
Relevance is computed with reference to a set of
contrastive terminologies from different domains.
Finally, extracted terms are further filtered using
Lexical Cohesion, that measures the degree of
association of all the words in a terminological
string. Accept files formats are: txt, pdf, ps, dvi,
tex, doc, rtf, ppt, xls, xml, html/htm, chm, wpd and
also zip archives.</description>
		<content:encoded><![CDATA[<p>Hi everybody,<br />
TermExtractor, my master thesis, is online at the address <a href="http://lcl2.di.uniroma1.it" rel="nofollow">http://lcl2.di.uniroma1.it</a> !!!</p>
<p>TermExtractor is a software package for automatic<br />
building, validation and maintenance of glossaries in<br />
english language.</p>
<p>TermExtractor extracts terminology consensually<br />
referred in a specific application domain. The package<br />
takes as input a corpus of domain documents, parses<br />
the documents, and extracts a list of &#8220;syntactically<br />
plausible&#8221; terms (e.g. compounds, adjective-nouns,<br />
etc.). Documents parsing assigns a greater importance<br />
to terms with text layouts (title, bold, italic,<br />
underlined, etc.). Two entropy-based measures, called<br />
Domain Relevance and Domain Consensus, are then used.<br />
Domain Consensus is used to select only the terms<br />
which are consensually referred throughout the corpus<br />
documents. Domain Relevance to select only the terms<br />
which are relevant to the domain of interest, Domain<br />
Relevance is computed with reference to a set of<br />
contrastive terminologies from different domains.<br />
Finally, extracted terms are further filtered using<br />
Lexical Cohesion, that measures the degree of<br />
association of all the words in a terminological<br />
string. Accept files formats are: txt, pdf, ps, dvi,<br />
tex, doc, rtf, ppt, xls, xml, html/htm, chm, wpd and<br />
also zip archives.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
