Plucene::Analysis::Analyzer.3pm

Langue: en

Version: 2008-03-01 (debian - 07/07/09)

Section: 3 (Bibliothèques de fonctions)

NAME

Plucene::Analysis::Analyzer - base class for Analyzers

SYNOPSIS

         my $analyzer = Plucene::Analysis::Analyzer::Subclass->new;
 
 

DESCRIPTION

This is an abstract base class of Analyzers.

An Analyzer builds TokenStreams, which analyze text. It thus represents a policy for extracting index terms from text.

Typical implementations first build a Tokenizer, which breaks the stream of characters from the Reader into raw Tokens. One or more TokenFilters may then be applied to the output of the Tokenizer.

METHODS


new

         my $analyzer = Plucene::Analysis::Analyzer::Subclass->new;
 
 

tokenstream

This must be defined in a subclass