| | |
| | |
| |
|
| | <HTML> |
| | <HEAD> |
| | <TITLE>CNTLIST(5WN) manual page</TITLE> |
| | </HEAD> |
| | <BODY> |
| | <A HREF="#toc">Table of Contents</A><P> |
| | |
| | <H2><A NAME="sect0" HREF="#toc0">NAME </A></H2> |
| | cntlist - file listing number of times each tagged sense occurs |
| | in a semantic concordance, sorted most to least frequently tagged <P> |
| | cntlist.rev |
| | - file listing number of times each tagged sense occurs in a semantic concordance, |
| | sorted by sense key |
| | <H2><A NAME="sect1" HREF="#toc1">DESCRIPTION </A></H2> |
| | A cntlist file for a semantic concordance |
| | lists the number of times each semantically tagged sense occurs in the |
| | concordance and its sense number in the WordNet database. Each line in |
| | the file corresponds to a sense in the WordNet database to which at least |
| | one semantic tag points. Only senses that are tagged in a concordance |
| | are in the concordance's cntlist file. <P> |
| | |
| | <H3><A NAME="sect2" HREF="#toc2">WordNet Database <I>cntlist </I> File |
| | </A></H3> |
| | In the WordNet database, words are assigned sense numbers based on frequency |
| | of use in semantically tagged corpora. The cntlist file used by <B><A HREF="grind.1WN.html">grind</B>(1WN)<B></B></A> |
| | |
| | to build the WordNet database and assign the sense numbers is a union |
| | of the cntlist files from the various semantic concordances that were |
| | formerly released by Princeton University. This combined cntlist file |
| | is provided with the WordNet package and is found in the <B>WNSEARCHDIR </B> |
| | directory. <P> |
| | The <I>cntlist.rev </I> file is used at run-time by the WordNet library |
| | code and browser interfaces to print in the output display the number |
| | of times each sense has been tagged. |
| | <H3><A NAME="sect3" HREF="#toc3">File Format </A></H3> |
| | Each line in a cntlist |
| | file contains information for one sense. The file is ordered from most |
| | to least frequently tagged sense. The fields are separated by one space, |
| | and each line is terminated with a newline character. Senses having the |
| | same <I>tag_cnt </I> value are listed in reverse alphabetical order of the <I>lemma |
| | </I> field of the <I>sense_key </I>. <P> |
| | Each line in <B>cntlist </B> is of the form: <P> |
| | <blockquote><I>tag_cnt sense_key sense_number |
| | </I> </blockquote> |
| | <P> |
| | where <I>tag_cnt </I> is the decimal number of times the sense is tagged in |
| | the corresponding semantic concordance. <I>sense_key </I> is a WordNet sense |
| | encoding and <I>sense_number </I> is a WordNet sense number as described in <P> |
| | |
| | The <I>cntlist.rev </I> file contains the same fields described above, in the |
| | following order: <P> |
| | <blockquote><I>sense_key sense_number tag_cnt </I> </blockquote> |
| | <P> |
| | |
| | <H2><A NAME="sect4" HREF="#toc4">NOTES </A></H2> |
| | Princeton |
| | no longer maintains or releases the Semantic Concordance files. The <I>cntlist |
| | </I> file used to order the senses in WordNet 3.0 was generated from the Semantic |
| | Concordance files at the point that they were last updated in 2001. In |
| | general, the order of senses presented usually reflects what the user |
| | would expect, however sense ordering is now less reliable than in prior |
| | releases and should not be construed as an accurate indicator of frequency |
| | of use. |
| | <H2><A NAME="sect5" HREF="#toc5">ENVIRONMENT VARIABLES (UNIX) </A></H2> |
| |
|
| | <DL> |
| |
|
| | <DT><B>WNHOME</B> </DT> |
| | <DD>Base directory for WordNet. |
| | Default is <B>/usr/local/WordNet-3.0 </B>. </DD> |
| |
|
| | <DT><B>WNSEARCHDIR</B> </DT> |
| | <DD>Directory in which the |
| | WordNet database has been installed. Default is <B>WNHOME/dict </B>. </DD> |
| | </DL> |
| | |
| | <H2><A NAME="sect6" HREF="#toc6">REGISTRY |
| | (WINDOWS) </A></H2> |
| |
|
| | <DL> |
| |
|
| | <DT><B>HKEY_LOCAL_MACHINE\SOFTWARE\WordNet\3.0\WNHome</B> </DT> |
| | <DD>Base directory for |
| | WordNet. Default is <B>C:\Program Files\WordNet\3.0 </B>. </DD> |
| |
|
| | <DT><B>HKEY_CURRENT_USER\SOFTWARE\WordNet\3.0\wnres</B> |
| | </DT> |
| | <DD>User's default browser options. </DD> |
| | </DL> |
| | |
| | <H2><A NAME="sect7" HREF="#toc7">FILES </A></H2> |
| |
|
| | <DL> |
| |
|
| | <DT><B>cntlist, cntlist.rev</B> </DT> |
| | <DD>file of combined |
| | semantic concordance <B>cntlist </B> files. Used to assign sense numbers in WordNet |
| | database </DD> |
| | </DL> |
| | |
| | <H2><A NAME="sect8" HREF="#toc8">SEE ALSO </A></H2> |
| | <B><A HREF="grind.1WN.html">grind</B>(1WN)</A> |
| | , <B><A HREF="wnintro.5WN.html">wnintro</B>(5WN)</A> |
| | , <B><A HREF="senseidx.5WN.html">senseidx</B>(5WN)</A> |
| | . <P> |
| |
|
| | <HR><P> |
| | <A NAME="toc"><B>Table of Contents</B></A><P> |
| | <UL> |
| | <LI><A NAME="toc0" HREF="#sect0">NAME</A></LI> |
| | <LI><A NAME="toc1" HREF="#sect1">DESCRIPTION</A></LI> |
| | <UL> |
| | <LI><A NAME="toc2" HREF="#sect2">WordNet Database cntlist File</A></LI> |
| | <LI><A NAME="toc3" HREF="#sect3">File Format</A></LI> |
| | </UL> |
| | <LI><A NAME="toc4" HREF="#sect4">NOTES</A></LI> |
| | <LI><A NAME="toc5" HREF="#sect5">ENVIRONMENT VARIABLES (UNIX)</A></LI> |
| | <LI><A NAME="toc6" HREF="#sect6">REGISTRY (WINDOWS)</A></LI> |
| | <LI><A NAME="toc7" HREF="#sect7">FILES</A></LI> |
| | <LI><A NAME="toc8" HREF="#sect8">SEE ALSO</A></LI> |
| | </UL> |
| | </BODY></HTML> |
| |
|