tr_tagcontentgrep

Langue: en

Version: 2008-01-15 (mandriva - 01/05/08)

Section: 1 (Commandes utilisateur)

NAME

tr_tagcontentgrep -- grep for a xml/sgml/html tag

SYNOPSIS

  tr_tagcontentgrep [-hl] regexp-pattern [file ...]
 
 

DESCRIPTION

tr_tagcontentgrep opens all files provided on the command line and searches for the given pattern in the tags. The search is not case sensitive.

tr_tagcontentgrep is part of the HTML::TagReader package but is an example that you can also do 'reading by tag' without HTML::TagReader. tr_tagcontentgrep uses plain perl and sets the $/ variable. Working without HTML::TagReader causes however problems when working with faulty html code where single '<'-characters appear somewhere in the text.

All space in the tags is reduced to max. one space. You can search for ``a href'' even if the original tag had multiple spaces between ``a'' and ``href''.

OPTIONS

-h this help

-l list filename and line number

EXAMPLE

tr_tagcontentgrep -l img file.html

would e.g print something like:

index.html:53: <IMG src=``../images/transpix.gif'' alt="``> index.html:257: <IMG SRC=''../Logo.gif" width=128 height=53>

AUTHOR

tr_tagcontentgrep is part of the HTML::TagReader package and was written by Guido Socher [guido(at)linuxfocus.org]