Encode::Detect::Detector.3pm

Langue: en

Version: 2009-04-15 (debian - 07/07/09)

Section: 3 (Bibliothèques de fonctions)

NAME

Encode::Detect::Detector - Detects the encoding of data

SYNOPSIS

   use Encode::Detect::Detector;
   my $charset = detect($octets);
 
   my $d = new Encode::Detect::Detector;
   $d->handle($octets);
   $d->handle($more_octets);
   $d->end;
   my $charset = $d->getresult;
 
 

DESCRIPTION

This module provides an interface to Mozilla's universal charset detector, which detects the charset used to encode data.

METHODS

$charset = Encode::Detect::Detector->detect($octets)

Detect the charset used to encode the data in $octets and return the charset's name. Returns undef if the charset cannot be determined with sufficient confidence.

$d = Encode::Detect::Detector->new()

Creates a new "Encode::Detect::Detector" object and returns it.

$d->handle($octets)

Provides an additional chunk of data to be examined by the detector. May be called multiple times.

Returns zero on success, nonzero if a memory allocation failed.

$d->eof

Informs the detector that there is no more data to be examined. In many cases, this is necessary in order for the detector to make a decision on the charset.

$d->reset

Resets the detector to its initial state.

$d->getresult

Returns the name of the detected charset or "undef" if no charset has (yet) been decided upon. May be called at any time.

SEE ALSO

Encode::Detect

AUTHOR

John Gardiner Myers <jgmyers@proofpoint.com>

SUPPORT

For help and thank you notes, e-mail the author directly. To report a bug, submit a patch, or add to the wishlist please visit the CPAN bug manager at: http://rt.cpan.org