wok-current diff perl-html-parser/description.txt @ rev 24942
BootProg: clear cmdline
author | Pascal Bellard <pascal.bellard@slitaz.org> |
---|---|
date | Wed Apr 20 15:10:50 2022 +0000 (2022-04-20) |
parents | |
children |
line diff
1.1 --- /dev/null Thu Jan 01 00:00:00 1970 +0000 1.2 +++ b/perl-html-parser/description.txt Wed Apr 20 15:10:50 2022 +0000 1.3 @@ -0,0 +1,22 @@ 1.4 +Objects of the HTML::Parser class will recognize markup and 1.5 +separate it from plain text (alias data content) in HTML 1.6 +documents. 1.7 +As different kinds of markup and text are recognized, the 1.8 +corresponding event handlers are invoked. 1.9 + 1.10 +HTML::Parser is not a generic SGML parser. We have tried to 1.11 +make it able to deal with the HTML that is actually "out there", 1.12 +and it normally parses as closely as possible to the way the 1.13 +popular web browsers do it instead of strictly following one 1.14 +of the many HTML specifications from W3C. 1.15 +Where there is disagreement, there is often an option that 1.16 +you can enable to get the official behaviour. 1.17 + 1.18 +The document to be parsed may be supplied in arbitrary chunks. 1.19 +This makes on-the-fly parsing as documents are received from 1.20 +the network possible. 1.21 + 1.22 +If event driven parsing does not feel right for your application, 1.23 +you might want to use HTML::PullParser. 1.24 +This is an HTML::Parser subclass that allows a more conventional 1.25 +program structure.