HTML::TreeBuilder - Parser that builds a HTML syntax tree


SYNOPSIS

        $h = new HTML::TreeBuilder;
        $h->parse($document);
        #...

        print $h->as_HTML;  # or any other HTML::Element method



DESCRIPTION

       This is a parser that builds (and actually itself is) a
       HTML syntax tree.

       Objects of this class inherit the methods of both
       HTML::Parser and HTML::Element.  After parsing has taken
       place it can be regarded as the syntax tree itself.

       The following method all control how parsing takes place.
       You can set the attributes by passing a TRUE or FALSE
       value as argument.

       $p->implicit_tags
           Setting this attribute to true will instruct the
           parser to try to deduce implicit elements and implicit
           end tags.  If it is false you get a parse tree that
           just reflects the text as it stands.  Might be useful
           for quick & dirty parsing.  Default is true.

           Implicit elements have the implicit() attribute set.

       $p->ignore_unknown
           This attribute controls whether unknown tags should be
           represented as elements in the parse tree.  Default is
           true.

       $p->ignore_text
           Do not represent the text content of elements.  This
           saves space if all you want is to examine the
           structure of the document.  Default is false.

       $p->warn
           Call warn() with an appropriate message for syntax
           errors.  Default is false.


SEE ALSO

       the HTML::Parser manpage, the HTML::Element manpage


COPYRIGHT

       Copyright 1995-1996 Gisle Aas. All rights reserved.

       This library is free software; you can redistribute it
       Gisle Aas <aas@sn.no>