wok diff perl-html-parser/description.txt @ rev 25037

Up glza (0.11.4)
author Pascal Bellard <pascal.bellard@slitaz.org>
date Sat May 21 21:38:29 2022 +0000 (2022-05-21)
parents
children
line diff
     1.1 --- /dev/null	Thu Jan 01 00:00:00 1970 +0000
     1.2 +++ b/perl-html-parser/description.txt	Sat May 21 21:38:29 2022 +0000
     1.3 @@ -0,0 +1,22 @@
     1.4 +Objects of the HTML::Parser class will recognize markup and
     1.5 +separate it from plain text (alias data content) in HTML
     1.6 +documents.
     1.7 +As different kinds of markup and text are recognized, the
     1.8 +corresponding event handlers are invoked.
     1.9 +
    1.10 +HTML::Parser is not a generic SGML parser. We have tried to
    1.11 +make it able to deal with the HTML that is actually "out there",
    1.12 +and it normally parses as closely as possible to the way the
    1.13 +popular web browsers do it instead of strictly following one
    1.14 +of the many HTML specifications from W3C.
    1.15 +Where there is disagreement, there is often an option that
    1.16 +you can enable to get the official behaviour.
    1.17 +
    1.18 +The document to be parsed may be supplied in arbitrary chunks.
    1.19 +This makes on-the-fly parsing as documents are received from
    1.20 +the network possible.
    1.21 +
    1.22 +If event driven parsing does not feel right for your application,
    1.23 +you might want to use HTML::PullParser.
    1.24 +This is an HTML::Parser subclass that allows a more conventional
    1.25 +program structure.