Package nltk_lite :: Package corpora :: Module senseval
[hide private]
[frames] | no frames]

Module senseval

source code

Read from the Senseval 2 Corpus.

SENSEVAL [http://www.senseval.org/] Evaluation exercises for Word Sense Disambiguation. Organized by ACL-SIGLEX [http://www.siglex.org/]

Prepared by Ted Pedersen <tpederse@umn.edu>, University of Minnesota, http://www.d.umn.edu/~tpederse/data.html Distributed with permission.

The NLTK version of the Senseval 2 files uses well-formed XML. Each instance of the ambiguous words "hard", "interest", "line", and "serve" is tagged with a sense identifier, and supplied with context.

Classes [hide private]
  SensevalParser
Functions [hide private]
 
_to_ascii(text) source code
iterator over tuple
raw(files=['hard', 'interest', 'line', 'serve']) source code
 
demo() source code
Variables [hide private]
  items = ['hard', 'interest', 'line', 'serve']
Function Details [hide private]

raw(files=['hard', 'interest', 'line', 'serve'])

source code 
Parameters:
  • files (string or tuple(string)) - One or more Senseval files to be processed
Returns: iterator over tuple