COPYRIGHT000664001750001750 746214116150214 14460 0ustar00taitai000000000000HTML-HTML5-Parser-0.992Format: http://www.debian.org/doc/packaging-manuals/copyright-format/1.0/ Upstream-Name: HTML-HTML5-Parser Upstream-Contact: Toby Inkster Source: https://metacpan.org/release/HTML-HTML5-Parser Files: INSTALL LICENSE NEWS TODO examples/charsets.pl examples/html/iso-8859-15.html examples/html/utf-16.html examples/html/utf-8.html t/data/rt-96399-1.html t/data/rt-96399-2.html t/html5lib-fail/domjs-unsafe.dat t/html5lib-fail/entities02.dat t/html5lib-fail/plain-text-unsafe.dat t/html5lib-fail/tests1.dat t/html5lib-fail/tests11.dat t/html5lib-fail/tests14.dat t/html5lib-fail/tests16.dat t/html5lib-fail/tests21.dat t/html5lib-fail/tests22.dat t/html5lib-fail/tests23.dat t/html5lib-fail/tests5.dat t/html5lib-fail/tests6.dat t/html5lib-fail/tests9.dat t/html5lib-fail/tests_innerHTML_1.dat t/html5lib-pass/adoption01.dat t/html5lib-pass/adoption02.dat t/html5lib-pass/comments01.dat t/html5lib-pass/doctype01.dat t/html5lib-pass/entities01.dat t/html5lib-pass/html5test-com.dat t/html5lib-pass/inbody01.dat t/html5lib-pass/isindex.dat t/html5lib-pass/pending-spec-changes-plain-text-unsafe.dat t/html5lib-pass/pending-spec-changes.dat t/html5lib-pass/scriptdata01.dat t/html5lib-pass/scripted/adoption01.dat t/html5lib-pass/scripted/ark.dat t/html5lib-pass/scripted/webkit01.dat t/html5lib-pass/tables01.dat t/html5lib-pass/tests10.dat t/html5lib-pass/tests12.dat t/html5lib-pass/tests15.dat t/html5lib-pass/tests17.dat t/html5lib-pass/tests18.dat t/html5lib-pass/tests19.dat t/html5lib-pass/tests2.dat t/html5lib-pass/tests20.dat t/html5lib-pass/tests24.dat t/html5lib-pass/tests25.dat t/html5lib-pass/tests26.dat t/html5lib-pass/tests3.dat t/html5lib-pass/tests4.dat t/html5lib-pass/tests7.dat t/html5lib-pass/tests8.dat t/html5lib-pass/tricky01.dat t/html5lib-pass/webkit01.dat t/html5lib-pass/webkit02.dat Copyright: Unknown License: Unknown Files: Changes META.json META.yml dist.ini doap.ttl lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm lib/HTML/HTML5/Parser/Charset/Info.pm lib/HTML/HTML5/Parser/Charset/UnicodeChecker.pm lib/HTML/HTML5/Parser/Charset/UniversalCharDet.pm lib/HTML/HTML5/Parser/Charset/WebLatin1.pm lib/HTML/HTML5/Parser/Charset/WebThai.pm lib/HTML/HTML5/Parser/TagSoupParser.pm lib/HTML/HTML5/Parser/Tokenizer.pm Copyright: Copyright 2021 Toby Inkster. License: GPL-1.0+ or Artistic-1.0 Files: lib/HTML/HTML5/Parser/UA.pm t/02html5.t t/03html4.t t/04fragments.t t/05origins.t t/06xlxdsln.t t/07ua.t t/08ua-lwp.t t/99html5lib.t t/rt-79019.t Copyright: Copyright (C) 2012 by Toby Inkster. License: GPL-1.0+ or Artistic-1.0 Files: COPYRIGHT CREDITS SIGNATURE Copyright: None License: public-domain Files: README lib/HTML/HTML5/Parser.pm Copyright: Copyright (C) 2007-2011 by Wakaba. Copyright (C) 2009-2012 by Toby Inkster. License: GPL-1.0+ or Artistic-1.0 Files: bin/html2xhtml bin/html5debug Copyright: Copyright 2013 Toby Inkster. License: GPL-1.0+ or Artistic-1.0 Files: t/01basic.t Copyright: Copyright (C) 2010-2012 by Toby Inkster. License: GPL-1.0+ or Artistic-1.0 Files: lib/HTML/HTML5/Parser/Error.pm Copyright: Copyright (C) 2011-2012 by Toby Inkster. License: GPL-1.0+ or Artistic-1.0 Files: Makefile.PL Copyright: Copyright 2020 Toby Inkster. License: GPL-1.0+ or Artistic-1.0 Files: t/lib/Test/HTTP/Server.pm Copyright: Copyright 2012 Toby Inkster. License: GPL-1.0+ or Artistic-1.0 Files: t/rt-96399.t Copyright: Copyright (C) 2017 by Gregory Todd Williams. License: GPL-1.0+ or Artistic-1.0 License: Artistic-1.0 This software is Copyright (c) 2021 by the copyright holder(s). This is free software, licensed under: The Artistic License 1.0 License: GPL-1.0 This software is Copyright (c) 2021 by the copyright holder(s). This is free software, licensed under: The GNU General Public License, Version 1, February 1989 CREDITS000664001750001750 52114116150214 14152 0ustar00taitai000000000000HTML-HTML5-Parser-0.992Maintainer: - Toby Inkster Contributor: - AUDREYT - Alex Vandiver - Dorian Taylor - Gregory Williams - Kjetil Kjernsmo - SHELLAC - TOBYINK - Wakaba Thanks: - - - Fitz Elliott - GWILLIAMS - Luben Karavelov Changes000664001750001750 1325614116150214 14476 0ustar00taitai000000000000HTML-HTML5-Parser-0.992HTML-HTML5-Parser ================= Created: 2009-11-26 Home page: Bug tracker: Maintainer: Toby Inkster 0.992 2021-09-08 [ Packaging ] - Point issue tracker at Github. - Point repo at Github. 0.991 2021-09-08 [ Bug Fixes ] - Better regexp for differentiating between filenames and URIs. - Don't assume @INC contains . - Don't install a global __WARN__ handler in BEGIN. Alex Vandiver++ - Fix encoding issues. Gregory Williams++ Kjetil Kjernsmo++ [ Packaging ] - Modernize packaging in general. 0.301 2013-07-08 [ Bug Fixes ] - Fix t/99html5lib.t still using Mo in one place. Fitz Elliott++ 0.300 2013-07-06 [ Bug Fixes ] - Fix many major memory leaks. Dorian Taylor++ [ Packaging ] - t/99html5lib.t now uses Moo instead of Mo. [ Other ] - Memory leak fix necessitated some API changes; in particular some methods which were available as class methods are now object methods only. 0.208 2012-12-06 [ Bug Fixes ] - If two tags were in the same file, attributes on the second element could cause crashes. Fixes RT#79019. Luben Karavelov++ - Minor fix re LWP-independence. 0.206 2012-06-30 [ Bug Fixes ] - Disable t/07ua.t and t/08ua-lwp.t on Windows because Test::HTTP::Server doesn't work on that platform. - Remove 'use 5.010' from t/07ua.t. 0.204 2012-06-29 [ Bug Fixes ] - Stop using defined-or operator in HTML::HTML5::Parser::UA, as it doesn't work in Perl 5.8. 0.202 2012-06-27 - Drop dependency on LWP::UserAgent in favour of HTTP::Tiny which is part of core since 5.14. 0.200 2012-06-13 - Drop dependency on Error.pm; use Try::Tiny instead. - Drop dependency on HTML::Encoding; use IO::HTML instead. - Passing a couple more of the html5lib test suite files. - Suppress warnings about invalid Unicode code points. 0.110 2012-03-20 [ Bug Fixes ] - Removed a use of smart match which was breaking Perl 5.8.x compatibility. Fixes RT#75809. [ Other ] - Comment nodes should now keep their line numbers and column numbers. - Text nodes should keep line numbers and column numbers too. - use XML::LibXML::Devel::SetLineNumber if it is available. 0.109 2012-03-15 Yes, they are come, but they are not past. [ Bug Fixes ] - Fix several fatal errors on infrequently used code paths in TagSoupParser.pm. - Use correct case for MathML's definitionURL attribute. [ Packaging ] - Bundle test cases from html5lib. [ Other ] - Added: Nodes now have an implictness flag (returned by source_line called in a list context). - Added: Provide dtd_element method to get more information about the DTD. - Added: Provide parse_balanced_chunk to parse HTML fragments. - Make source_line method work more reliably. This requires XML::LibXML::Devel, and thus a fairly recent version of XML-LibXML. 0.108 2012-01-18 [ Bug Fixes ] - Crashed when generating an error message about some missing closing tags. audreyt++ [ Documentation ] - Copyright 2012. [ Other ] - Updated: Provide load_xml and load_html methods for compatibility with XML::LibXML 1.70 and newer. 0.107 2011-10-20 [ Bug Fixes ] - parse_file wasn't accepting relative file names [ Other ] - Added: html2xhtml can output to a file. - Added: html2xhtml now reads from STDIN by default. 0.106 2011-10-10 [ Bug Fixes ] - Tokenizer.pm was still trying to require NamedEntityList.pm. 0.105 2011-10-07 [ Packaging ] - Module::Package::RDF. [ Other ] - Added: Bundle 'html5debug' script. - HTML::HTML5::Parser::Error overloads stringification. - use HTML::HTML5::Entities 0.104 2011-09-22 - Added: Some error handling stuff. - Added: Support element. - Updated: Catch up to revision d81fcb920a1a3c351149cd66a64bf1b8ae14a172 (2011-08-21) upstream. 0.103 2011-02-09 [ Bug Fixes ] - TagSoupParser.pm called a method that is renamed between this distribution and upstream using its upstream name. [ Documentation ] - Copyright 2011. 0.102 2011-01-19 [ Bug Fixes ] - Fix source_line method. [ Other ] - Added: Allow element to appear in if document has an HTML4 doctype. This is a willful violation of the HTML5 parsing algorithm. (The may have elements as children, as well as any children that would normally be allowed in the of the document, such as ; any other content is treated as the beginning of the , and thus closes and . That's slightly looser than the HTML 4 spec which says only should be used, but stricter than the HTML 4 DTD which allows pretty much anything in there!) - Added: Support
element. - Added: Support element. - Updated: Catch up to revision f2c921a886ab0b3dfb8d21b82525e98a4a921ad4 (2010-10-11) upstream. 0.101 2010-06-30 [ Bug Fixes ] - UTF-8 fix. 0.100 2010-06-23 [ Bug Fixes ] - Minor bugfixes. 0.04 2010-04-21 - Updated: Catch up to revision cf2c0df8a6dfb50fee923dfb21b14c83f282ccdc (2010-02-28) upstream. 0.03 2010-01-15 [ Bug Fixes ] - Module didn't use URI::file properly. shellac++ [ Documentation ] - Copyright 2010. [ Packaging ] - Upgrade distribution to my new packaging regime (auto-generated changelogs, etc) 0.02 2009-12-16 [ Packaging ] - Bundle the html2xhtml tool. [ Other ] - Replace Inline::Python encoding detection with weaker, but native Perl HTML::Encoding package. 0.01 2009-12-03 Original version 0.00_01 2009-12-01 Developer preview INSTALL000664001750001750 172614116150214 14213 0ustar00taitai000000000000HTML-HTML5-Parser-0.992 Installing HTML-HTML5-Parser should be straightforward. INSTALLATION WITH CPANMINUS If you have cpanm, you only need one line: % cpanm HTML::HTML5::Parser If you are installing into a system-wide directory, you may need to pass the "-S" flag to cpanm, which uses sudo to install the module: % cpanm -S HTML::HTML5::Parser INSTALLATION WITH THE CPAN SHELL Alternatively, if your CPAN shell is set up, you should just be able to do: % cpan HTML::HTML5::Parser MANUAL INSTALLATION As a last resort, you can manually install it. Download the tarball and unpack it. Consult the file META.json for a list of pre-requisites. Install these first. To build HTML-HTML5-Parser: % perl Makefile.PL % make && make test Then install it: % make install If you are installing into a system-wide directory, you may need to run: % sudo make install LICENSE000664001750001750 4365514116150214 14216 0ustar00taitai000000000000HTML-HTML5-Parser-0.992This software is copyright (c) 2021 by Toby Inkster. This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself. Terms of the Perl programming language system itself a) the GNU General Public License as published by the Free Software Foundation; either version 1, or (at your option) any later version, or b) the "Artistic License" --- The GNU General Public License, Version 1, February 1989 --- This software is Copyright (c) 2021 by Toby Inkster. This is free software, licensed under: The GNU General Public License, Version 1, February 1989 GNU GENERAL PUBLIC LICENSE Version 1, February 1989 Copyright (C) 1989 Free Software Foundation, Inc. 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed. Preamble The license agreements of most software companies try to keep users at the mercy of those companies. By contrast, our General Public License is intended to guarantee your freedom to share and change free software--to make sure the software is free for all its users. The General Public License applies to the Free Software Foundation's software and to any other program whose authors commit to using it. You can use it for your programs, too. When we speak of free software, we are referring to freedom, not price. Specifically, the General Public License is designed to make sure that you have the freedom to give away or sell copies of free software, that you receive source code or can get it if you want it, that you can change the software or use pieces of it in new free programs; and that you know you can do these things. To protect your rights, we need to make restrictions that forbid anyone to deny you these rights or to ask you to surrender the rights. These restrictions translate to certain responsibilities for you if you distribute copies of the software, or if you modify it. For example, if you distribute copies of a such a program, whether gratis or for a fee, you must give the recipients all the rights that you have. You must make sure that they, too, receive or can get the source code. And you must tell them their rights. We protect your rights with two steps: (1) copyright the software, and (2) offer you this license which gives you legal permission to copy, distribute and/or modify the software. Also, for each author's protection and ours, we want to make certain that everyone understands that there is no warranty for this free software. If the software is modified by someone else and passed on, we want its recipients to know that what they have is not the original, so that any problems introduced by others will not reflect on the original authors' reputations. The precise terms and conditions for copying, distribution and modification follow. GNU GENERAL PUBLIC LICENSE TERMS AND CONDITIONS FOR COPYING, DISTRIBUTION AND MODIFICATION 0. This License Agreement applies to any program or other work which contains a notice placed by the copyright holder saying it may be distributed under the terms of this General Public License. The "Program", below, refers to any such program or work, and a "work based on the Program" means either the Program or any work containing the Program or a portion of it, either verbatim or with modifications. Each licensee is addressed as "you". 1. You may copy and distribute verbatim copies of the Program's source code as you receive it, in any medium, provided that you conspicuously and appropriately publish on each copy an appropriate copyright notice and disclaimer of warranty; keep intact all the notices that refer to this General Public License and to the absence of any warranty; and give any other recipients of the Program a copy of this General Public License along with the Program. You may charge a fee for the physical act of transferring a copy. 2. You may modify your copy or copies of the Program or any portion of it, and copy and distribute such modifications under the terms of Paragraph 1 above, provided that you also do the following: a) cause the modified files to carry prominent notices stating that you changed the files and the date of any change; and b) cause the whole of any work that you distribute or publish, that in whole or in part contains the Program or any part thereof, either with or without modifications, to be licensed at no charge to all third parties under the terms of this General Public License (except that you may choose to grant warranty protection to some or all third parties, at your option). c) If the modified program normally reads commands interactively when run, you must cause it, when started running for such interactive use in the simplest and most usual way, to print or display an announcement including an appropriate copyright notice and a notice that there is no warranty (or else, saying that you provide a warranty) and that users may redistribute the program under these conditions, and telling the user how to view a copy of this General Public License. d) You may charge a fee for the physical act of transferring a copy, and you may at your option offer warranty protection in exchange for a fee. Mere aggregation of another independent work with the Program (or its derivative) on a volume of a storage or distribution medium does not bring the other work under the scope of these terms. 3. You may copy and distribute the Program (or a portion or derivative of it, under Paragraph 2) in object code or executable form under the terms of Paragraphs 1 and 2 above provided that you also do one of the following: a) accompany it with the complete corresponding machine-readable source code, which must be distributed under the terms of Paragraphs 1 and 2 above; or, b) accompany it with a written offer, valid for at least three years, to give any third party free (except for a nominal charge for the cost of distribution) a complete machine-readable copy of the corresponding source code, to be distributed under the terms of Paragraphs 1 and 2 above; or, c) accompany it with the information you received as to where the corresponding source code may be obtained. (This alternative is allowed only for noncommercial distribution and only if you received the program in object code or executable form alone.) Source code for a work means the preferred form of the work for making modifications to it. For an executable file, complete source code means all the source code for all modules it contains; but, as a special exception, it need not include source code for modules which are standard libraries that accompany the operating system on which the executable file runs, or for standard header files or definitions files that accompany that operating system. 4. You may not copy, modify, sublicense, distribute or transfer the Program except as expressly provided under this General Public License. Any attempt otherwise to copy, modify, sublicense, distribute or transfer the Program is void, and will automatically terminate your rights to use the Program under this License. However, parties who have received copies, or rights to use copies, from you under this General Public License will not have their licenses terminated so long as such parties remain in full compliance. 5. By copying, distributing or modifying the Program (or any work based on the Program) you indicate your acceptance of this license to do so, and all its terms and conditions. 6. Each time you redistribute the Program (or any work based on the Program), the recipient automatically receives a license from the original licensor to copy, distribute or modify the Program subject to these terms and conditions. You may not impose any further restrictions on the recipients' exercise of the rights granted herein. 7. The Free Software Foundation may publish revised and/or new versions of the General Public License from time to time. Such new versions will be similar in spirit to the present version, but may differ in detail to address new problems or concerns. Each version is given a distinguishing version number. If the Program specifies a version number of the license which applies to it and "any later version", you have the option of following the terms and conditions either of that version or of any later version published by the Free Software Foundation. If the Program does not specify a version number of the license, you may choose any version ever published by the Free Software Foundation. 8. If you wish to incorporate parts of the Program into other free programs whose distribution conditions are different, write to the author to ask for permission. For software which is copyrighted by the Free Software Foundation, write to the Free Software Foundation; we sometimes make exceptions for this. Our decision will be guided by the two goals of preserving the free status of all derivatives of our free software and of promoting the sharing and reuse of software generally. NO WARRANTY 9. BECAUSE THE PROGRAM IS LICENSED FREE OF CHARGE, THERE IS NO WARRANTY FOR THE PROGRAM, TO THE EXTENT PERMITTED BY APPLICABLE LAW. EXCEPT WHEN OTHERWISE STATED IN WRITING THE COPYRIGHT HOLDERS AND/OR OTHER PARTIES PROVIDE THE PROGRAM "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE ENTIRE RISK AS TO THE QUALITY AND PERFORMANCE OF THE PROGRAM IS WITH YOU. SHOULD THE PROGRAM PROVE DEFECTIVE, YOU ASSUME THE COST OF ALL NECESSARY SERVICING, REPAIR OR CORRECTION. 10. IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MAY MODIFY AND/OR REDISTRIBUTE THE PROGRAM AS PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES, INCLUDING ANY GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING OUT OF THE USE OR INABILITY TO USE THE PROGRAM (INCLUDING BUT NOT LIMITED TO LOSS OF DATA OR DATA BEING RENDERED INACCURATE OR LOSSES SUSTAINED BY YOU OR THIRD PARTIES OR A FAILURE OF THE PROGRAM TO OPERATE WITH ANY OTHER PROGRAMS), EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. END OF TERMS AND CONDITIONS Appendix: How to Apply These Terms to Your New Programs If you develop a new program, and you want it to be of the greatest possible use to humanity, the best way to achieve this is to make it free software which everyone can redistribute and change under these terms. To do so, attach the following notices to the program. It is safest to attach them to the start of each source file to most effectively convey the exclusion of warranty; and each file should have at least the "copyright" line and a pointer to where the full notice is found. Copyright (C) 19yy This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 1, or (at your option) any later version. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston MA 02110-1301 USA Also add information on how to contact you by electronic and paper mail. If the program is interactive, make it output a short notice like this when it starts in an interactive mode: Gnomovision version 69, Copyright (C) 19xx name of author Gnomovision comes with ABSOLUTELY NO WARRANTY; for details type `show w'. This is free software, and you are welcome to redistribute it under certain conditions; type `show c' for details. The hypothetical commands `show w' and `show c' should show the appropriate parts of the General Public License. Of course, the commands you use may be called something other than `show w' and `show c'; they could even be mouse-clicks or menu items--whatever suits your program. You should also get your employer (if you work as a programmer) or your school, if any, to sign a "copyright disclaimer" for the program, if necessary. Here a sample; alter the names: Yoyodyne, Inc., hereby disclaims all copyright interest in the program `Gnomovision' (a program to direct compilers to make passes at assemblers) written by James Hacker. , 1 April 1989 Ty Coon, President of Vice That's all there is to it! --- The Artistic License 1.0 --- This software is Copyright (c) 2021 by Toby Inkster. This is free software, licensed under: The Artistic License 1.0 The Artistic License Preamble The intent of this document is to state the conditions under which a Package may be copied, such that the Copyright Holder maintains some semblance of artistic control over the development of the package, while giving the users of the package the right to use and distribute the Package in a more-or-less customary fashion, plus the right to make reasonable modifications. Definitions: - "Package" refers to the collection of files distributed by the Copyright Holder, and derivatives of that collection of files created through textual modification. - "Standard Version" refers to such a Package if it has not been modified, or has been modified in accordance with the wishes of the Copyright Holder. - "Copyright Holder" is whoever is named in the copyright or copyrights for the package. - "You" is you, if you're thinking about copying or distributing this Package. - "Reasonable copying fee" is whatever you can justify on the basis of media cost, duplication charges, time of people involved, and so on. (You will not be required to justify it to the Copyright Holder, but only to the computing community at large as a market that must bear the fee.) - "Freely Available" means that no fee is charged for the item itself, though there may be fees involved in handling the item. It also means that recipients of the item may redistribute it under the same conditions they received it. 1. You may make and give away verbatim copies of the source form of the Standard Version of this Package without restriction, provided that you duplicate all of the original copyright notices and associated disclaimers. 2. You may apply bug fixes, portability fixes and other modifications derived from the Public Domain or from the Copyright Holder. A Package modified in such a way shall still be considered the Standard Version. 3. You may otherwise modify your copy of this Package in any way, provided that you insert a prominent notice in each changed file stating how and when you changed that file, and provided that you do at least ONE of the following: a) place your modifications in the Public Domain or otherwise make them Freely Available, such as by posting said modifications to Usenet or an equivalent medium, or placing the modifications on a major archive site such as ftp.uu.net, or by allowing the Copyright Holder to include your modifications in the Standard Version of the Package. b) use the modified Package only within your corporation or organization. c) rename any non-standard executables so the names do not conflict with standard executables, which must also be provided, and provide a separate manual page for each non-standard executable that clearly documents how it differs from the Standard Version. d) make other distribution arrangements with the Copyright Holder. 4. You may distribute the programs of this Package in object code or executable form, provided that you do at least ONE of the following: a) distribute a Standard Version of the executables and library files, together with instructions (in the manual page or equivalent) on where to get the Standard Version. b) accompany the distribution with the machine-readable source of the Package with your modifications. c) accompany any non-standard executables with their corresponding Standard Version executables, giving the non-standard executables non-standard names, and clearly documenting the differences in manual pages (or equivalent), together with instructions on where to get the Standard Version. d) make other distribution arrangements with the Copyright Holder. 5. You may charge a reasonable copying fee for any distribution of this Package. You may charge any fee you choose for support of this Package. You may not charge a fee for this Package itself. However, you may distribute this Package in aggregate with other (possibly commercial) programs as part of a larger (possibly commercial) software distribution provided that you do not advertise this Package as a product of your own. 6. The scripts and library files supplied as input to or produced as output from the programs of this Package do not automatically fall under the copyright of this Package, but belong to whomever generated them, and may be sold commercially, and may be aggregated with this Package. 7. C or perl subroutines supplied by you and linked into this Package shall not be considered part of this Package. 8. The name of the Copyright Holder may not be used to endorse or promote products derived from this software without specific prior written permission. 9. THIS PACKAGE IS PROVIDED "AS IS" AND WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF MERCHANTIBILITY AND FITNESS FOR A PARTICULAR PURPOSE. The End MANIFEST000664001750001750 443714116150214 14315 0ustar00taitai000000000000HTML-HTML5-Parser-0.992COPYRIGHT CREDITS Changes INSTALL LICENSE MANIFEST META.json META.yml Makefile.PL NEWS README SIGNATURE TODO bin/html2xhtml bin/html5debug dist.ini doap.ttl examples/charsets.pl examples/html/iso-8859-15.html examples/html/utf-16.html examples/html/utf-8.html lib/HTML/HTML5/Parser.pm lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm lib/HTML/HTML5/Parser/Charset/Info.pm lib/HTML/HTML5/Parser/Charset/UnicodeChecker.pm lib/HTML/HTML5/Parser/Charset/UniversalCharDet.pm lib/HTML/HTML5/Parser/Charset/WebLatin1.pm lib/HTML/HTML5/Parser/Charset/WebThai.pm lib/HTML/HTML5/Parser/Error.pm lib/HTML/HTML5/Parser/TagSoupParser.pm lib/HTML/HTML5/Parser/Tokenizer.pm lib/HTML/HTML5/Parser/UA.pm t/01basic.t t/02html5.t t/03html4.t t/04fragments.t t/05origins.t t/06xlxdsln.t t/07ua.t t/08ua-lwp.t t/99html5lib.t t/data/rt-96399-1.html t/data/rt-96399-2.html t/html5lib-fail/domjs-unsafe.dat t/html5lib-fail/entities02.dat t/html5lib-fail/plain-text-unsafe.dat t/html5lib-fail/tests1.dat t/html5lib-fail/tests11.dat t/html5lib-fail/tests14.dat t/html5lib-fail/tests16.dat t/html5lib-fail/tests21.dat t/html5lib-fail/tests22.dat t/html5lib-fail/tests23.dat t/html5lib-fail/tests5.dat t/html5lib-fail/tests6.dat t/html5lib-fail/tests9.dat t/html5lib-fail/tests_innerHTML_1.dat t/html5lib-pass/adoption01.dat t/html5lib-pass/adoption02.dat t/html5lib-pass/comments01.dat t/html5lib-pass/doctype01.dat t/html5lib-pass/entities01.dat t/html5lib-pass/html5test-com.dat t/html5lib-pass/inbody01.dat t/html5lib-pass/isindex.dat t/html5lib-pass/pending-spec-changes-plain-text-unsafe.dat t/html5lib-pass/pending-spec-changes.dat t/html5lib-pass/scriptdata01.dat t/html5lib-pass/scripted/adoption01.dat t/html5lib-pass/scripted/ark.dat t/html5lib-pass/scripted/webkit01.dat t/html5lib-pass/tables01.dat t/html5lib-pass/tests10.dat t/html5lib-pass/tests12.dat t/html5lib-pass/tests15.dat t/html5lib-pass/tests17.dat t/html5lib-pass/tests18.dat t/html5lib-pass/tests19.dat t/html5lib-pass/tests2.dat t/html5lib-pass/tests20.dat t/html5lib-pass/tests24.dat t/html5lib-pass/tests25.dat t/html5lib-pass/tests26.dat t/html5lib-pass/tests3.dat t/html5lib-pass/tests4.dat t/html5lib-pass/tests7.dat t/html5lib-pass/tests8.dat t/html5lib-pass/tricky01.dat t/html5lib-pass/webkit01.dat t/html5lib-pass/webkit02.dat t/lib/Test/HTTP/Server.pm t/rt-79019.t t/rt-96399.t META.json000664001750001750 1213114116150214 14613 0ustar00taitai000000000000HTML-HTML5-Parser-0.992{ "abstract" : "parse HTML reliably", "author" : [ "Toby Inkster " ], "dynamic_config" : 0, "generated_by" : "Dist::Inkt::Profile::TOBYINK version 0.024, CPAN::Meta::Converter version 2.150010", "keywords" : [ "HTML", "HTML5", "Web" ], "license" : [ "perl_5" ], "meta-spec" : { "url" : "http://search.cpan.org/perldoc?CPAN::Meta::Spec", "version" : 2 }, "name" : "HTML-HTML5-Parser", "no_index" : { "directory" : [ "eg", "examples", "inc", "t", "xt" ] }, "optional_features" : {}, "prereqs" : { "configure" : { "requires" : { "ExtUtils::MakeMaker" : "6.17" } }, "develop" : { "recommends" : { "Dist::Inkt" : "0.001" } }, "runtime" : { "requires" : { "HTML::HTML5::Entities" : "0.002", "HTTP::Tiny" : "0", "IO::HTML" : "0", "Scalar::Util" : "0", "Try::Tiny" : "0", "URI::file" : "0", "XML::LibXML" : "1.94", "XML::LibXML::Devel" : "0", "perl" : "5.008001" } }, "test" : { "recommends" : { "LWP::UserAgent" : "0" }, "requires" : { "FindBin" : "0", "Test::More" : "0.61", "Test::Requires" : "0" } } }, "provides" : { "HTML::HTML5::Parser" : { "file" : "lib/HTML/HTML5/Parser.pm", "version" : "0.992" }, "HTML::HTML5::Parser::Charset::DecodeHandle" : { "file" : "lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm", "version" : "0.992" }, "HTML::HTML5::Parser::Charset::DecodeHandle::ByteBuffer" : { "file" : "lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm" }, "HTML::HTML5::Parser::Charset::DecodeHandle::CharString" : { "file" : "lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm" }, "HTML::HTML5::Parser::Charset::DecodeHandle::Encode" : { "file" : "lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm" }, "HTML::HTML5::Parser::Charset::DecodeHandle::ISO2022JP" : { "file" : "lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm" }, "HTML::HTML5::Parser::Charset::Info" : { "file" : "lib/HTML/HTML5/Parser/Charset/Info.pm", "version" : "0.992" }, "HTML::HTML5::Parser::Charset::USASCII" : { "file" : "lib/HTML/HTML5/Parser/Charset/WebLatin1.pm" }, "HTML::HTML5::Parser::Charset::UnicodeChecker" : { "file" : "lib/HTML/HTML5/Parser/Charset/UnicodeChecker.pm", "version" : "0.992" }, "HTML::HTML5::Parser::Charset::UniversalCharDet" : { "file" : "lib/HTML/HTML5/Parser/Charset/UniversalCharDet.pm", "version" : "0.992" }, "HTML::HTML5::Parser::Charset::WebLatin1" : { "file" : "lib/HTML/HTML5/Parser/Charset/WebLatin1.pm", "version" : "0.992" }, "HTML::HTML5::Parser::Charset::WebLatin5" : { "file" : "lib/HTML/HTML5/Parser/Charset/WebLatin1.pm" }, "HTML::HTML5::Parser::Charset::WebThai" : { "file" : "lib/HTML/HTML5/Parser/Charset/WebThai.pm", "version" : "0.992" }, "HTML::HTML5::Parser::Charset::WebThai::WebTIS620" : { "file" : "lib/HTML/HTML5/Parser/Charset/WebThai.pm" }, "HTML::HTML5::Parser::Error" : { "file" : "lib/HTML/HTML5/Parser/Error.pm", "version" : "0.992" }, "HTML::HTML5::Parser::TagSoupParser" : { "file" : "lib/HTML/HTML5/Parser/TagSoupParser.pm", "version" : "0.992" }, "HTML::HTML5::Parser::TagSoupParser::RestartParser" : { "file" : "lib/HTML/HTML5/Parser/TagSoupParser.pm" }, "HTML::HTML5::Parser::Tokenizer" : { "file" : "lib/HTML/HTML5/Parser/Tokenizer.pm", "version" : "0.992" }, "HTML::HTML5::Parser::UA" : { "file" : "lib/HTML/HTML5/Parser/UA.pm", "version" : "0.992" } }, "release_status" : "stable", "resources" : { "bugtracker" : { "web" : "https://github.com/tobyink/p5-html-html5-parser/issues" }, "homepage" : "https://metacpan.org/release/HTML-HTML5-Parser", "license" : [ "http://dev.perl.org/licenses/" ], "repository" : { "type" : "git", "url" : "git://github.com/tobyink/p5-html-html5-parser.git", "web" : "https://github.com/tobyink/p5-html-html5-parser" }, "x_identifier" : "http://purl.org/NET/cpan-uri/dist/HTML-HTML5-Parser/project" }, "version" : "0.992", "x_contributors" : [ "Wakaba", "SHELLAC", "AUDREYT ", "TOBYINK", "Dorian Taylor", "Gregory Williams", "Kjetil Kjernsmo", "Alex Vandiver" ], "x_provides_scripts" : { "html2xhtml" : { "file" : "bin/html2xhtml" }, "html5debug" : { "file" : "bin/html5debug" } }, "x_serialization_backend" : "JSON::PP version 4.06" } META.yml000664001750001750 663214116150214 14434 0ustar00taitai000000000000HTML-HTML5-Parser-0.992--- abstract: 'parse HTML reliably' author: - 'Toby Inkster ' build_requires: FindBin: '0' Test::More: '0.61' Test::Requires: '0' configure_requires: ExtUtils::MakeMaker: '6.17' dynamic_config: 0 generated_by: 'Dist::Inkt::Profile::TOBYINK version 0.024, CPAN::Meta::Converter version 2.150010' keywords: - HTML - HTML5 - Web license: perl meta-spec: url: http://module-build.sourceforge.net/META-spec-v1.4.html version: '1.4' name: HTML-HTML5-Parser no_index: directory: - eg - examples - inc - t - xt optional_features: {} provides: HTML::HTML5::Parser: file: lib/HTML/HTML5/Parser.pm version: '0.992' HTML::HTML5::Parser::Charset::DecodeHandle: file: lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm version: '0.992' HTML::HTML5::Parser::Charset::DecodeHandle::ByteBuffer: file: lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm HTML::HTML5::Parser::Charset::DecodeHandle::CharString: file: lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm HTML::HTML5::Parser::Charset::DecodeHandle::Encode: file: lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm HTML::HTML5::Parser::Charset::DecodeHandle::ISO2022JP: file: lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm HTML::HTML5::Parser::Charset::Info: file: lib/HTML/HTML5/Parser/Charset/Info.pm version: '0.992' HTML::HTML5::Parser::Charset::USASCII: file: lib/HTML/HTML5/Parser/Charset/WebLatin1.pm HTML::HTML5::Parser::Charset::UnicodeChecker: file: lib/HTML/HTML5/Parser/Charset/UnicodeChecker.pm version: '0.992' HTML::HTML5::Parser::Charset::UniversalCharDet: file: lib/HTML/HTML5/Parser/Charset/UniversalCharDet.pm version: '0.992' HTML::HTML5::Parser::Charset::WebLatin1: file: lib/HTML/HTML5/Parser/Charset/WebLatin1.pm version: '0.992' HTML::HTML5::Parser::Charset::WebLatin5: file: lib/HTML/HTML5/Parser/Charset/WebLatin1.pm HTML::HTML5::Parser::Charset::WebThai: file: lib/HTML/HTML5/Parser/Charset/WebThai.pm version: '0.992' HTML::HTML5::Parser::Charset::WebThai::WebTIS620: file: lib/HTML/HTML5/Parser/Charset/WebThai.pm HTML::HTML5::Parser::Error: file: lib/HTML/HTML5/Parser/Error.pm version: '0.992' HTML::HTML5::Parser::TagSoupParser: file: lib/HTML/HTML5/Parser/TagSoupParser.pm version: '0.992' HTML::HTML5::Parser::TagSoupParser::RestartParser: file: lib/HTML/HTML5/Parser/TagSoupParser.pm HTML::HTML5::Parser::Tokenizer: file: lib/HTML/HTML5/Parser/Tokenizer.pm version: '0.992' HTML::HTML5::Parser::UA: file: lib/HTML/HTML5/Parser/UA.pm version: '0.992' requires: HTML::HTML5::Entities: '0.002' HTTP::Tiny: '0' IO::HTML: '0' Scalar::Util: '0' Try::Tiny: '0' URI::file: '0' XML::LibXML: '1.94' XML::LibXML::Devel: '0' perl: '5.008001' resources: Identifier: http://purl.org/NET/cpan-uri/dist/HTML-HTML5-Parser/project bugtracker: https://github.com/tobyink/p5-html-html5-parser/issues homepage: https://metacpan.org/release/HTML-HTML5-Parser license: http://dev.perl.org/licenses/ repository: git://github.com/tobyink/p5-html-html5-parser.git version: '0.992' x_contributors: - Wakaba - SHELLAC - 'AUDREYT ' - TOBYINK - 'Dorian Taylor' - 'Gregory Williams' - 'Kjetil Kjernsmo' - 'Alex Vandiver' x_provides_scripts: html2xhtml: file: bin/html2xhtml html5debug: file: bin/html5debug x_serialization_backend: 'CPAN::Meta::YAML version 0.018' Makefile.PL000664001750001750 2342714116150214 15156 0ustar00taitai000000000000HTML-HTML5-Parser-0.992use strict; use ExtUtils::MakeMaker 6.17; my $EUMM = eval( $ExtUtils::MakeMaker::VERSION ); my $meta = { "abstract" => "parse HTML reliably", "author" => ["Toby Inkster "], "dynamic_config" => 0, "generated_by" => "Dist::Inkt::Profile::TOBYINK version 0.024, CPAN::Meta::Converter version 2.150010", "keywords" => ["HTML", "HTML5", "Web"], "license" => ["perl_5"], "meta-spec" => { url => "http://search.cpan.org/perldoc?CPAN::Meta::Spec", version => 2, }, "name" => "HTML-HTML5-Parser", "no_index" => { directory => ["eg", "examples", "inc", "t", "xt"] }, "prereqs" => { configure => { requires => { "ExtUtils::MakeMaker" => 6.17 } }, develop => { recommends => { "Dist::Inkt" => 0.001 } }, runtime => { requires => { "HTML::HTML5::Entities" => 0.002, "HTTP::Tiny" => 0, "IO::HTML" => 0, "perl" => 5.008001, "Scalar::Util" => 0, "Try::Tiny" => 0, "URI::file" => 0, "XML::LibXML" => 1.94, "XML::LibXML::Devel" => 0, }, }, test => { recommends => { "LWP::UserAgent" => 0 }, requires => { "FindBin" => 0, "Test::More" => 0.61, "Test::Requires" => 0 }, }, }, "provides" => { "HTML::HTML5::Parser" => { file => "lib/HTML/HTML5/Parser.pm", version => 0.992 }, "HTML::HTML5::Parser::Charset::DecodeHandle" => { file => "lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm", version => 0.992, }, "HTML::HTML5::Parser::Charset::DecodeHandle::ByteBuffer" => { file => "lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm" }, "HTML::HTML5::Parser::Charset::DecodeHandle::CharString" => { file => "lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm" }, "HTML::HTML5::Parser::Charset::DecodeHandle::Encode" => { file => "lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm" }, "HTML::HTML5::Parser::Charset::DecodeHandle::ISO2022JP" => { file => "lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm" }, "HTML::HTML5::Parser::Charset::Info" => { file => "lib/HTML/HTML5/Parser/Charset/Info.pm", version => 0.992 }, "HTML::HTML5::Parser::Charset::UnicodeChecker" => { file => "lib/HTML/HTML5/Parser/Charset/UnicodeChecker.pm", version => 0.992, }, "HTML::HTML5::Parser::Charset::UniversalCharDet" => { file => "lib/HTML/HTML5/Parser/Charset/UniversalCharDet.pm", version => 0.992, }, "HTML::HTML5::Parser::Charset::USASCII" => { file => "lib/HTML/HTML5/Parser/Charset/WebLatin1.pm" }, "HTML::HTML5::Parser::Charset::WebLatin1" => { file => "lib/HTML/HTML5/Parser/Charset/WebLatin1.pm", version => 0.992, }, "HTML::HTML5::Parser::Charset::WebLatin5" => { file => "lib/HTML/HTML5/Parser/Charset/WebLatin1.pm" }, "HTML::HTML5::Parser::Charset::WebThai" => { file => "lib/HTML/HTML5/Parser/Charset/WebThai.pm", version => 0.992, }, "HTML::HTML5::Parser::Charset::WebThai::WebTIS620" => { file => "lib/HTML/HTML5/Parser/Charset/WebThai.pm" }, "HTML::HTML5::Parser::Error" => { file => "lib/HTML/HTML5/Parser/Error.pm", version => 0.992 }, "HTML::HTML5::Parser::TagSoupParser" => { file => "lib/HTML/HTML5/Parser/TagSoupParser.pm", version => 0.992 }, "HTML::HTML5::Parser::TagSoupParser::RestartParser" => { file => "lib/HTML/HTML5/Parser/TagSoupParser.pm" }, "HTML::HTML5::Parser::Tokenizer" => { file => "lib/HTML/HTML5/Parser/Tokenizer.pm", version => 0.992 }, "HTML::HTML5::Parser::UA" => { file => "lib/HTML/HTML5/Parser/UA.pm", version => 0.992 }, }, "release_status" => "stable", "resources" => { bugtracker => { web => "https://github.com/tobyink/p5-html-html5-parser/issues" }, homepage => "https://metacpan.org/release/HTML-HTML5-Parser", license => ["http://dev.perl.org/licenses/"], repository => { type => "git", url => "git://github.com/tobyink/p5-html-html5-parser.git", web => "https://github.com/tobyink/p5-html-html5-parser", }, x_identifier => "http://purl.org/NET/cpan-uri/dist/HTML-HTML5-Parser/project", }, "version" => 0.992, "x_contributors" => [ "Wakaba", "SHELLAC", "AUDREYT ", "TOBYINK", "Dorian Taylor", "Gregory Williams", "Kjetil Kjernsmo", "Alex Vandiver", ], "x_provides_scripts" => { html2xhtml => { file => "bin/html2xhtml" }, html5debug => { file => "bin/html5debug" }, }, }; my %dynamic_config; my %WriteMakefileArgs = ( ABSTRACT => $meta->{abstract}, AUTHOR => ($EUMM >= 6.5702 ? $meta->{author} : $meta->{author}[0]), DISTNAME => $meta->{name}, VERSION => $meta->{version}, EXE_FILES => [ map $_->{file}, values %{ $meta->{x_provides_scripts} || {} } ], NAME => do { my $n = $meta->{name}; $n =~ s/-/::/g; $n }, test => { TESTS => "t/*.t" }, %dynamic_config, ); $WriteMakefileArgs{LICENSE} = $meta->{license}[0] if $EUMM >= 6.3001; sub deps { my %r; for my $stage (@_) { for my $dep (keys %{$meta->{prereqs}{$stage}{requires}}) { next if $dep eq 'perl'; my $ver = $meta->{prereqs}{$stage}{requires}{$dep}; $r{$dep} = $ver if !exists($r{$dep}) || $ver >= $r{$dep}; } } \%r; } my ($build_requires, $configure_requires, $runtime_requires, $test_requires); if ($EUMM >= 6.6303) { $WriteMakefileArgs{BUILD_REQUIRES} ||= deps('build'); $WriteMakefileArgs{CONFIGURE_REQUIRES} ||= deps('configure'); $WriteMakefileArgs{TEST_REQUIRES} ||= deps('test'); $WriteMakefileArgs{PREREQ_PM} ||= deps('runtime'); } elsif ($EUMM >= 6.5503) { $WriteMakefileArgs{BUILD_REQUIRES} ||= deps('build', 'test'); $WriteMakefileArgs{CONFIGURE_REQUIRES} ||= deps('configure'); $WriteMakefileArgs{PREREQ_PM} ||= deps('runtime'); } elsif ($EUMM >= 6.52) { $WriteMakefileArgs{CONFIGURE_REQUIRES} ||= deps('configure'); $WriteMakefileArgs{PREREQ_PM} ||= deps('runtime', 'build', 'test'); } else { $WriteMakefileArgs{PREREQ_PM} ||= deps('configure', 'build', 'test', 'runtime'); } { my ($minperl) = reverse sort( grep defined && /^[0-9]+(\.[0-9]+)?$/, map $meta->{prereqs}{$_}{requires}{perl}, qw( configure build runtime ) ); if (defined($minperl)) { die "Installing $meta->{name} requires Perl >= $minperl" unless $] >= $minperl; $WriteMakefileArgs{MIN_PERL_VERSION} ||= $minperl if $EUMM >= 6.48; } } my $mm = WriteMakefile(%WriteMakefileArgs); exit(0); NEWS000664001750001750 51114116150214 13630 0ustar00taitai000000000000HTML-HTML5-Parser-0.9922013-07-05 HTML-HTML5-Parser-0.300 ================================== Fixed (DORIAN++) some major memory leaks at the expense of some minor API changes. 2012-03-15 HTML-HTML5-Parser-0.109 ================================== We now bundle the tree construction parts of the html5lib test suite, and pass more than half of it. README000664001750001750 3057614116150214 14067 0ustar00taitai000000000000HTML-HTML5-Parser-0.992NAME HTML::HTML5::Parser - parse HTML reliably SYNOPSIS use HTML::HTML5::Parser; my $parser = HTML::HTML5::Parser->new; my $doc = $parser->parse_string(<<'EOT'); Foo

Foo bar.

Baz
Quux. EOT my $fdoc = $parser->parse_file( $html_file_name ); my $fhdoc = $parser->parse_fh( $html_file_handle ); DESCRIPTION This library is substantially the same as the non-CPAN module Whatpm::HTML. Changes include: * Provides an XML::LibXML-like DOM interface. If you usually use XML::LibXML's DOM parser, this should be a drop-in solution for tag soup HTML. * Constructs an XML::LibXML::Document as the result of parsing. * Via bundling and modifications, removed external dependencies on non-CPAN packages. Constructor `new` $parser = HTML::HTML5::Parser->new; # or $parser = HTML::HTML5::Parser->new(no_cache => 1); The constructor does nothing interesting besides take one flag argument, `no_cache => 1`, to disable the global element metadata cache. Disabling the cache is handy for conserving memory if you parse a large number of documents, however, class methods such as `/source_line` will not work, and must be run from an instance of this parser. XML::LibXML-Compatible Methods `parse_file`, `parse_html_file` $doc = $parser->parse_file( $html_file_name [,\%opts] ); This function parses an HTML document from a file or network; $html_file_name can be either a filename or an URL. Options include 'encoding' to indicate file encoding (e.g. 'utf-8') and 'user_agent' which should be a blessed `LWP::UserAgent` (or HTTP::Tiny) object to be used when retrieving URLs. If requesting a URL and the response Content-Type header indicates an XML-based media type (such as XHTML), XML::LibXML::Parser will be used automatically (instead of the tag soup parser). The XML parser can be told to use a DTD catalogue by setting the option 'xml_catalogue' to the filename of the catalogue. HTML (tag soup) parsing can be forced using the option 'force_html', even when an XML media type is returned. If an options hashref was passed, parse_file will set $options->{'parser_used'} to the name of the class used to parse the URL, to allow the calling code to double-check which parser was used afterwards. If an options hashref was passed, parse_file will set $options->{'response'} to the HTTP::Response object obtained by retrieving the URI. `parse_fh`, `parse_html_fh` $doc = $parser->parse_fh( $io_fh [,\%opts] ); `parse_fh()` parses a IOREF or a subclass of `IO::Handle`. Options include 'encoding' to indicate file encoding (e.g. 'utf-8'). `parse_string`, `parse_html_string` $doc = $parser->parse_string( $html_string [,\%opts] ); This function is similar to `parse_fh()`, but it parses an HTML document that is available as a single string in memory. Options include 'encoding' to indicate file encoding (e.g. 'utf-8'). `load_xml`, `load_html` Wrappers for the parse_* functions. These should be roughly compatible with the equivalently named functions in XML::LibXML. Note that `load_xml` first attempts to parse as real XML, falling back to HTML5 parsing; `load_html` just goes straight for HTML5. `parse_balanced_chunk` $fragment = $parser->parse_balanced_chunk( $string [,\%opts] ); This method is roughly equivalent to XML::LibXML's method of the same name, but unlike XML::LibXML, and despite its name it does not require the chunk to be "balanced". This method is somewhat black magic, but should work, and do the proper thing in most cases. Of course, the proper thing might not be what you'd expect! I'll try to keep this explanation as brief as possible... Consider the following string: Hello World What is the proper way to parse that? If it were found in a document like this: X

Hello World
Then the document would end up equivalent to the following XHTML: X
Hello World
The superfluous `` is simply ignored. However, if it were found in a document like this: X World
Hello
Then the result would be: X World
Hello
Yes, `World` gets hoisted up before the ``. This is weird, I know, but it's how browsers do it in real life. So what should: $string = q{HelloWorld}; $fragment = $parser->parse_balanced_chunk($string); actually return? Well, you can choose... $string = q{HelloWorld}; $frag1 = $parser->parse_balanced_chunk($string, {within=>'div'}); say $frag1->toString; # HelloWorld $frag2 = $parser->parse_balanced_chunk($string, {within=>'td'}); say $frag2->toString; # WorldHello If you don't pass a "within" option, then the chunk is parsed as if it were within a `
` element. This is often the most sensible option. If you pass something like `{ within => "foobar" }` where "foobar" is not a real HTML element name (as found in the HTML5 spec), then this method will croak; if you pass the name of a void element (e.g. "br" or "meta") then this method will croak; there are a handful of other unsupported elements which will croak (namely: "noscript", "noembed", "noframes"). Note that the second time around, although we parsed the string "as if it were within a `
World"; my $NS = 'xmlns="http://www.w3.org/1999/xhtml"'; can_ok $parser => 'parse_balanced_chunk'; is( $parser->parse_balanced_chunk($input, {within=>'div'})->toString, "HelloWorld", 'within div', ); is( $parser->parse_balanced_chunk($input, {within=>'td'})->toString, "WorldHello ", 'within td', ); is( $parser->parse_balanced_chunk($input, {force_within=>'td'})->toString, "Hello", 'force within td', ); my $list = $parser->parse_balanced_chunk($input, {mark_outliers=>1, within=>'td', as=>'list'}); ok( $list->get_node(1)->hasAttribute('data-perl-html-html5-parser-outlier'), 'mark outliers', ); =head1 PURPOSE Test C method. =head1 AUTHOR Toby Inkster, Etobyink@cpan.orgE =head1 COPYRIGHT AND LICENCE Copyright (C) 2012 by Toby Inkster This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. 05origins.t000664001750001750 624014116150214 15425 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t## skip Test::Tabs use Test::More tests => 29; use HTML::HTML5::Parser; my $dom = HTML::HTML5::Parser->load_html(string => <<'HTML'); Test 5: Origins

This is a test! HTML can_ok 'HTML::HTML5::Parser' => 'source_line' or BAIL_OUT('No "source_line" method!!'); my @root = HTML::HTML5::Parser->source_line($dom->documentElement); is($root[0], 2, 'root element has correct line number'); is($root[1], 1, 'root element has correct col number'); ok(!$root[2], 'root element explicit'); my @head = HTML::HTML5::Parser->source_line($dom->getElementsByTagName('head')->get_node(1)); ok(defined $head[0], 'head element has a line number'); ok(defined $head[1], 'head element has a col number'); ok($head[2], 'head element implicit'); my @title_text = HTML::HTML5::Parser->source_line($dom->getElementsByTagName('title')->get_node(1)->childNodes->get_node(1)); is($title_text[0], 3, 'text node in title element has a line number'); is($title_text[1], 10, 'text node in title element has a col number'); ok(!$title_text[2], 'text node in title element explicit'); my @para = HTML::HTML5::Parser->source_line($dom->getElementsByTagName('p')->get_node(1)); is($para[0], 4, 'p element has correct line number'); is($para[1], 3, 'p element has correct col number'); ok(!$para[2], 'para element explicit'); my $para = HTML::HTML5::Parser->source_line($dom->getElementsByTagName('p')->get_node(1)); is($para, 4, 'p element has correct line number (scalar context)'); my @b = HTML::HTML5::Parser->source_line($dom->getElementsByTagName('b')->get_node(1)); is($b[0], 5, 'b element has correct line number'); is($b[1], 5, 'b element has correct col number'); ok(!$b[2], 'b element explicit'); my @i = HTML::HTML5::Parser->source_line($dom->getElementsByTagName('i')->get_node(1)); is($i[0], 5, 'i element has correct line number'); is($i[1], 17, 'i element has correct col number'); ok(!$i[2], 'i element explicit'); my @a = HTML::HTML5::Parser->source_line($dom->getElementsByTagName('a')->get_node(1)); is($a[0], 6, 'a element has correct line number'); is($a[1], 5, 'a element has correct col number'); ok(!$a[2], 'a element explicit'); my @href = HTML::HTML5::Parser->source_line($dom->getElementsByTagName('a')->get_node(1)->getAttributeNode('href')); is($href[0], 6, 'href attribute has correct line number'); is($href[1], 8, 'href attribute has correct col number'); ok(!$href[2], 'href attribute explicit'); # It's not easy to actually find comments in the DOM! my $comment = $dom->getElementsByTagName('p')->[0]->childNodes->[-2]; my @comment = HTML::HTML5::Parser->source_line($comment); is($comment[0], 8, 'comment has correct line number') or diag($comment->toString); is($comment[1], 5, 'comment has correct col number'); ok(!$comment[2], 'comment is explicit'); =head1 PURPOSE Check that line/column numbers are reported. =head1 AUTHOR Toby Inkster, Etobyink@cpan.orgE =head1 COPYRIGHT AND LICENCE Copyright (C) 2012 by Toby Inkster This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. 06xlxdsln.t000664001750001750 265614116150214 15457 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t## skip Test::Tabs use Test::More; use HTML::HTML5::Parser; eval { require XML::LibXML::Devel::SetLineNumber; 1 } or plan skip_all => "Need XML::LibXML::Devel::SetLineNumber"; plan tests => 6; my $dom = HTML::HTML5::Parser->load_html(string => <<'HTML'); Test 6: XML::LibXML::Devel::SetLineNumber

This is a test! HTML is( $dom->documentElement->line_number, 2, 'root element has correct line number', ); ok( defined $dom->getElementsByTagName('head')->get_node(1)->line_number, 'head element has a line number', ); is( $dom->getElementsByTagName('p')->get_node(1)->line_number, 4, 'p element has correct line number', ); is( $dom->getElementsByTagName('b')->get_node(1)->line_number, 5, 'b element has correct line number', ); is( $dom->getElementsByTagName('i')->get_node(1)->line_number, 5, 'i element has correct line number', ); is( $dom->getElementsByTagName('a')->get_node(1)->line_number, 6, 'a element has correct line number', ); =head1 PURPOSE Check that L can be used to get line number information for XML elements. =head1 AUTHOR Toby Inkster, Etobyink@cpan.orgE =head1 COPYRIGHT AND LICENCE Copyright (C) 2012 by Toby Inkster This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. 07ua.t000664001750001750 276414116150214 14371 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t## skip Test::Tabs use 5.008; use strict; use lib 'lib'; use lib 't/lib'; use Test::More; use HTML::HTML5::Parser; use HTML::HTML5::Parser::UA; use URI::file; { package Test::HTTP::Server::Request; sub doc1 { shift->{out_headers}{content_type} = 'text/html'; q{ Test!

Test!

}; } } eval { require Test::HTTP::Server; 1; } or plan skip_all => "Could not use Test::HTTP::Server: $@"; plan skip_all => "Test::HTTP::Server 0.03 fails on Win32" if $^O =~ /win/i && Test::HTTP::Server->VERSION lt '0.04'; plan tests => 3; my $server = Test::HTTP::Server->new(); my $baseuri = $server->uri; $HTML::HTML5::Parser::UA::NO_LWP = 1 if $HTML::HTML5::Parser::UA::NO_LWP eq '0'; my $file_response = HTML::HTML5::Parser::UA->get(URI::file->new_abs("t/01basic.t")); is( $file_response->{status}, 200, "simple file response - status 200", ); my $http_response = HTML::HTML5::Parser::UA->get($baseuri . 'doc1'); is( $file_response->{status}, 200, "simple HTTP response - status 200", ); my $dom = HTML::HTML5::Parser->load_html(location => $baseuri.'doc1'); is( $dom->getElementsByTagName('title')->shift->textContent, 'Test!', 'UA usage by parser', ); =head1 PURPOSE Check that L works with L. =head1 AUTHOR Toby Inkster, Etobyink@cpan.orgE =head1 COPYRIGHT AND LICENCE Copyright (C) 2012 by Toby Inkster This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. 08ua-lwp.t000664001750001750 101314116150214 15154 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t## skip Test::Tabs use Test::More; use Test::Requires 'LWP::UserAgent'; use HTML::HTML5::Parser::UA; $HTML::HTML5::Parser::UA::NO_LWP = ''; do './07ua.t' if -s '07ua.t'; do './t/07ua.t' if -s 't/07ua.t'; =head1 PURPOSE Check that L works with L. =head1 AUTHOR Toby Inkster, Etobyink@cpan.orgE =head1 COPYRIGHT AND LICENCE Copyright (C) 2012 by Toby Inkster This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. 99html5lib.t000664001750001750 1726514116150214 15541 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t## skip Test::Tabs use Test::More; use HTML::HTML5::Parser; BEGIN { eval { require Moo; 1 } or plan skip_all => 'Need Moo!' }; { package XML::LibXML::Document; sub pythonDebug { my $self = shift; my ($indent, $parser) = @_; $indent = '' unless defined $indent; my $return; my $element = $parser->dtd_element($self); my $public = $parser->dtd_public_id($self) || ''; my $system = $parser->dtd_system_id($self) || ''; if (defined $element) { $return = sprintf( "| \n", $element, (($public||$system) ? " \"$public\"" : ""), (($public||$system) ? " \"$system\"" : ""), ); } $return .= $_->pythonDebug(q{| }, $parser) foreach $self->childNodes; return $return; } } { package XML::LibXML::DocumentFragment; sub pythonDebug { my $self = shift; my ($indent, $parser) = @_; $indent = '' unless defined $indent; $self->normalize; my $return; foreach ($self->childNodes) { $return .= $_->pythonDebug($indent . q{| }, $parser); } return $return; } } { package XML::LibXML::Element; sub pythonDebug { my $self = shift; my ($indent, $parser) = @_; $indent = '' unless defined $indent; $self->normalize; my $nsbit = ''; $nsbit = 'svg ' if $self->namespaceURI =~ /svg/i; $nsbit = 'math ' if $self->namespaceURI =~ /math/i; my $return = sprintf("%s<%s%s>\n", $indent, $nsbit, $self->localname); my @attribs = sort { $a->localname cmp $b->localname } grep { not $_->isa('XML::LibXML::Namespace') } $self->attributes; foreach (@attribs) { $return .= $_->pythonDebug($indent . q{ }, $parser); } if ($self->localname eq 'noscript') { my $innerHTML = join q{}, map { $_->toString } $self->childNodes; $return .= $indent . q{ "} . $innerHTML . "\"\n"; } else { foreach ($self->childNodes) { $return .= $_->pythonDebug($indent . q{ }, $parser); } } return $return; } } { package XML::LibXML::Text; sub pythonDebug { my $self = shift; my ($indent, $parser) = @_; $indent = '' unless defined $indent; return sprintf("%s\"%s\"\n", $indent, $self->data); } } { package XML::LibXML::Comment; sub pythonDebug { my $self = shift; my ($indent, $parser) = @_; $indent = '' unless defined $indent; return sprintf("%s\n", $indent, $self->data); } } { package XML::LibXML::Attr; sub pythonDebug { my $self = shift; my ($indent, $parser) = @_; $indent = '' unless defined $indent; return sprintf("%s%s %s=\"%s\"\n", $indent, split(/:/, $self->nodeName), $self->value) if $self->namespaceURI && $self->nodeName=~/:/; return sprintf("%s%s=\"%s\"\n", $indent, $self->localname, $self->value); } } { package Local::HTML5Lib::Test; use Moo; has test_file => (is => 'rw'); has test_number => (is => 'rw'); has data => (is => 'rw'); has errors => (is => 'rw'); has document => (is => 'rw'); has document_fragment => (is => 'rw'); has parser => (is => 'lazy', builder => '_build_parser'); sub test_id { my $self = shift; if ($self->test_file->filename =~ m{ / ([^/]+) $ }x) { sprintf('%s:%s', $1, $self->test_number||1); } } sub dom { my ($self) = @_; if ($self->document_fragment) { return $self->parser->parse_balanced_chunk( $self->data, {within => $self->document_fragment}, ); } return eval { $self->parser->parse_string($self->data); } || do { my $e = $@; my $xml = 'XML::LibXML::Document'->new('1.0', 'utf-8'); $xml->setDocumentElement( $xml->createElementNS('http://www.w3.org/1999/xhtml', 'html') ); $xml->documentElement->appendText("ERROR: $e"); $xml; } } sub _build_parser { require HTML::HTML5::Parser; 'HTML::HTML5::Parser'->new; } sub __uniscape { my $str = shift; eval { $str =~ s{ ([^\n\x20-\x7E]) }{ sprintf('\x{%04X}', ord($1)) }gex; }; $str; } sub run { my ($self) = @_; my $expected = $self->document."\n"; my $got = $self->dom->pythonDebug(undef, $self->parser); utf8::decode($got); local $Test::Builder::Level = $Test::Builder::Level + 1; SKIP: { my $excuse = $::SKIP->{ $self->test_id }; Test::More::skip($excuse, 1) if defined $excuse; if ($got eq $expected) { Test::More::pass("DATA: ".$self->data); return 1; } else { Test::More::fail("DATA: ".$self->data); Test::More::diag("ID: ".$self->test_id); Test::More::diag("GOT:\n" . __uniscape $got); Test::More::diag("EXPECTED:\n" . __uniscape $expected); return 0; } } } } { package Local::HTML5Lib::TestFile; use Moo; has filename => (is => "rw"); has tests => (is => "rw"); has last_score => (is => "rw"); sub read_file { my ($class, $filename) = @_; my $self = $class->new( filename => $filename, ); my @tests; open my $fh, '<', $filename; push @tests, (my $current_test = { test_file=>$self }); my $current_key; my @lines = <$fh>; # sometimes we need to peek at the next line; while (defined ($_ = shift @lines)) { no warnings; if (!/\S/ and (!defined $lines[0] or $lines[0]=~ /^\#data/)) { $current_test->{test_number} = @tests; chomp $current_test->{$current_key} if defined $current_key; $current_test = { test_file=>$self }; $current_key = undef; push @tests, $current_test; next; } if (/^\#(.+)/) { chomp $current_test->{$current_key} if defined $current_key; ($current_key = $1) =~ s/-/_/g; next; } $current_test->{$current_key} .= $_; } chomp $current_test->{$current_key}; $self->tests([ map { utf8::decode($_->{document}); utf8::decode($_->{data}); Local::HTML5Lib::Test->new(%$_); } @tests]); return $self; } sub run { local $Test::Builder::Level = $Test::Builder::Level + 1; my $self = shift; $self->{last_score} = 0; Test::More::subtest( sprintf("Test file: %s", $self->filename), sub { $self->{last_score} += ($_->run ? 1 : 0) for @{ $self->tests } }, ); } } package main; our $SKIP = { 'tests26.dat:10' => 'requires HTML parser to construct a DOM tree which is illegal in libxml (bad attribute name)', 'webkit01.dat:14' => 'requires HTML parser to construct a DOM tree which is illegal in libxml (bad element name)', 'webkit01.dat:42' => 'requires HTML parser to construct a DOM tree which is illegal in libxml (bad attribute name)', 'webkit02.dat:4' => 'I basically just disagree with this test.', 'html5test-com.dat:1' => 'requires HTML parser to construct a DOM tree which is illegal in libxml (bad element name)', 'html5test-com.dat:2' => 'requires HTML parser to construct a DOM tree which is illegal in libxml (bad attribute name)', 'html5test-com.dat:4' => 'requires HTML parser to construct a DOM tree which is illegal in libxml (bad attribute name)', }; my @fails; my @passes; unless (@ARGV) { @ARGV = ; } plan tests => scalar(@ARGV); while (my $f = shift) { my $F = Local::HTML5Lib::TestFile->read_file($f); if ($F->run) { push @passes, $F; } else { push @fails, $F; } } if (@fails) { diag "FAILED:"; diag sprintf(" %s [%d/%d]", $_->filename, $_->last_score, scalar(@{$_->tests})) for @fails; } if (@passes) { diag "PASSED:"; diag sprintf(" %s [%d/%d]", $_->filename, $_->last_score, scalar(@{$_->tests})) for @passes; } =head1 PURPOSE Tests from html5lib's testdata/tree-construction. =head1 SEE ALSO L. =head1 AUTHOR Toby Inkster, Etobyink@cpan.orgE =head1 COPYRIGHT AND LICENCE Copyright (C) 2012 by Toby Inkster This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. rt-79019.t000664001750001750 123414116150214 14720 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t## skip Test::Tabs use Test::More tests => 1; use HTML::HTML5::Parser; my $dom = HTML::HTML5::Parser::->load_html(IO => \*DATA); is( $dom->documentElement->lookupNamespaceURI('fb'), 'http://ogp.me/ns/fb#', ); =head1 PURPOSE Check that some weird namespace thing doesn't crash the parser. =head1 SEE ALSO L. =head1 AUTHOR Toby Inkster, Etobyink@cpan.orgE =head1 COPYRIGHT AND LICENCE Copyright (C) 2012 by Toby Inkster This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. =cut __DATA__ rt-96399.t000664001750001750 246714116150214 14743 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t=head1 PURPOSE Test handling of utf-8 encoded file data =head1 SEE ALSO =over 4 =item * L =item * L =back =head1 AUTHOR Gregory Todd Williams, Egwilliams@cpan.orgE =head1 COPYRIGHT AND LICENCE Copyright (C) 2017 by Gregory Todd Williams This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. =cut use Test::More tests => 2; use Test::Requires 'LWP::UserAgent'; use HTML::HTML5::Parser; use Encode qw(decode_utf8); subtest 'U+2193 DOWNWARDS ARROW' => sub { my $filename = 't/data/rt-96399-1.html'; my $parser = HTML::HTML5::Parser->new; my $doc = $parser->parse_file($filename); is($parser->charset($doc), 'utf-8', 'recognized encoding as utf-8'); like(decode_utf8($doc->toString()), qr/\x{2193}/, 'encoding properly round-trips U+2193 DOWNWARDS ARROW'); }; subtest 'U+00E9 LATIN SMALL LETTER E WITH ACUTE' => sub { my $filename = 't/data/rt-96399-2.html'; my $parser = HTML::HTML5::Parser->new; my $doc = $parser->parse_file($filename); is($parser->charset($doc), 'utf-8', 'recognized encoding as utf-8'); like(decode_utf8($doc->toString()), qr/\x{00E9}/, 'encoding properly round-trips U+00E9 DOWNWARDS ARROW'); }; iso-8859-15.html000664001750001750 5114116150214 20210 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/examples/html

¤100

utf-16.html000664001750001750 11414116150214 17622 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/examples/htmlÿþ<meta charset="utf-16"> <p>¬ 100</p>ÿþ utf-8.html000664001750001750 4514116150214 17526 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/examples/html

€100

rt-96399-1.html000664001750001750 41514116150214 16462 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t/data     title   

↓

 rt-96399-2.html000664001750001750 41414116150214 16462 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t/data     title   

é

 domjs-unsafe.dat000664001750001750 1475714116150214 21173 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t/html5lib-fail#data foo bar #errors #document | | | | | "foo bar" #data foo bar #errors #document | | | | | "foo bar" #data foo bar #errors #document | | | | | "foo bar" #data #errors #document | | | #errors #document | | | #errors #document | | | #errors #document | | | #errors #document | | | #errors #document | | | #errors #document | | | #errors #document | | | #errors #document | | | #errors #document | | | <p>

#errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 28 Unexpected end tag (div). Ignored. #document | | | --> EOF #errors #document | | | | #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. Line: 1 Col: 51 Unexpected end of file. Expected end tag (style). #document | | |

` element", the `Hello` bit did not strictly end up within the `` element (not even within the `` element!) yet it still gets returned. We'll call things such as this "outliers". There is a "force_within" option which tells parse_balanced_chunk to ignore outliers: $frag3 = $parser->parse_balanced_chunk($string, {force_within=>'td'}); say $frag3->toString; # Hello There is a boolean option "mark_outliers" which marks each outlier with an attribute (`data-perl-html-html5-parser-outlier`) to indicate its outlier status. Clearly, this is ignored when you use "force_within" because no outliers are returned. Some outliers may be XML::LibXML::Text elements; text nodes don't have attributes, so these will not be marked with an attribute. A last note is to mention what gets returned by this method. Normally it's an XML::LibXML::DocumentFragment object, but if you call the method in list context, a list of the individual node elements is returned. Alternatively you can request the data to be returned as an XML::LibXML::NodeList object: # Get an XML::LibXML::NodeList my $list = $parser->parse_balanced_chunk($str, {as=>'list'}); The exact implementation of this method may change from version to version, but the long-term goal will be to approach how common desktop browsers parse HTML fragments when implementing the setter for DOM's `innerHTML` attribute. The push parser and SAX-based parser are not supported. Trying to change an option (such as recover_silently) will make HTML::HTML5::Parser carp a warning. (But you can inspect the options.) Error Handling Error handling is obviously different to XML::LibXML, as errors are (bugs notwithstanding) non-fatal. `error_handler` Get/set an error handling function. Must be set to a coderef or undef. The error handling function will be called with a single parameter, a HTML::HTML5::Parser::Error object. `errors` Returns a list of errors that occurred during the last parse. See HTML::HTML5::Parser::Error. Additional Methods The module provides a few methods to obtain additional, non-DOM data from DOM nodes. `dtd_public_id` $pubid = $parser->dtd_public_id( $doc ); For an XML::LibXML::Document which has been returned by HTML::HTML5::Parser, using this method will tell you the Public Identifier of the DTD used (if any). `dtd_system_id` $sysid = $parser->dtd_system_id( $doc ); For an XML::LibXML::Document which has been returned by HTML::HTML5::Parser, using this method will tell you the System Identifier of the DTD used (if any). `dtd_element` $element = $parser->dtd_element( $doc ); For an XML::LibXML::Document which has been returned by HTML::HTML5::Parser, using this method will tell you the root element declared in the DTD used (if any). That is, if the document has this doctype: ... it will return "html". This may return the empty string if a DTD was present but did not contain a root element; or undef if no DTD was present. `compat_mode` $mode = $parser->compat_mode( $doc ); Returns 'quirks', 'limited quirks' or undef (standards mode). `charset` $charset = $parser->charset( $doc ); The character set apparently used by the document. `source_line` ($line, $col) = $parser->source_line( $node ); $line = $parser->source_line( $node ); In scalar context, `source_line` returns the line number of the source code that started a particular node (element, attribute or comment). In list context, returns a tuple: $line, $column, $implicitness. Tab characters count as one column, not eight. $implicitness indicates that the node was not explicitly marked up in the source code, but its existence was inferred by the parser. For example, in the following markup, the HTML, TITLE and P elements are explicit, but the HEAD and BODY elements are implicit. I have an implicit head

And an implicit body too!

(Note that implicit elements do still have a line number and column number.) The implictness indicator is a new feature, and I'd appreciate any bug reports where it gets things wrong. XML::LibXML::Node has a `line_number` method. In general this will always return 0 and HTML::HTML5::Parser has no way of influencing it. However, if you install XML::LibXML::Devel::SetLineNumber on your system, the `line_number` method will start working (at least for elements). SEE ALSO . HTML::HTML5::Writer, HTML::HTML5::Builder, XML::LibXML, XML::LibXML::PrettyPrint, XML::LibXML::Devel::SetLineNumber. AUTHOR Toby Inkster, COPYRIGHT AND LICENCE Copyright (C) 2007-2011 by Wakaba Copyright (C) 2009-2012 by Toby Inkster This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. DISCLAIMER OF WARRANTIES THIS PACKAGE IS PROVIDED "AS IS" AND WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF MERCHANTIBILITY AND FITNESS FOR A PARTICULAR PURPOSE. SIGNATURE000664001750001750 2272314116150214 14466 0ustar00taitai000000000000HTML-HTML5-Parser-0.992This file contains message digests of all files listed in MANIFEST, signed via the Module::Signature module, version 0.87. To verify the content in this distribution, first make sure you have Module::Signature installed, then type: % cpansign -v It will check each file's integrity, as well as the signature's validity. If "==> Signature verified OK! <==" is not displayed, the distribution may already have been compromised, and you should not run its Makefile.PL or Build.PL. -----BEGIN PGP SIGNED MESSAGE----- Hash: RIPEMD160 SHA256 80af4d30a407df8c1b544863187d2fd565fdf5a2dc914b2b9433f9c56a4ab05a COPYRIGHT SHA256 3b6ef79732e47218fca0b10b768890123dc7f01c8b21064eb5f6789aff9ae992 CREDITS SHA256 7116e911acb2a36f90fc8e25737f01e9c545d6c0349f6b1820dd716acdb9f390 Changes SHA256 34b85d2053bae078dd9285413d6953e37407003758b74355da9aaca89272137a INSTALL SHA256 34427749499bd6d6e5d206089bc03d76c647a1350f8b5f95ea628d2b6ff56e4c LICENSE SHA256 3053f9f7acf813ba5943759a4009a2f184e2a6cef05586d108844fed7634b9b6 MANIFEST SHA256 a9822ad457955bebd2308d756f98b99f351ce45f186bbd72e68bf0faac34ac60 META.json SHA256 49fc99da56e9d9e419dab79b3497433e5a4529263a8b2e336fea4149792ca834 META.yml SHA256 ad9f0d2bc41679a5a83b2245024c18b9b85f8e54a551825bc4e3168af9a8dc8e Makefile.PL SHA256 307c4c6de88d67005f7bb8e0078f5419d8a04e87f20ef26b3153c0055d6fa870 NEWS SHA256 116313570b9960ab735336b7747710ee57e9016361767ddd230e27587c648fd4 README SHA256 cd64515147afe0a28ade3140e117e453181960d58e426bc2c3df54d8de4c061b TODO SHA256 fdc7c9e831d86bf0576bbf3ffe042fbbe87f91522431d3950e4b97c986d61109 bin/html2xhtml SHA256 bedb8185a3fb37da94857892cd11ec18f157cebfac5e64c23b327f9b5e5f387a bin/html5debug SHA256 dbf9616300f3fa037c110dd56d7b5297638203cead74b940eac2ea54726035db dist.ini SHA256 215a0989f6cabe231a38cdafadd7307e1307d9065d71261e9568fff7a65183cf doap.ttl SHA256 a4f2e094af2cfade4ee4b1202204bb7427187831b8365507ec245bb8e21c63d0 examples/charsets.pl SHA256 d9707832acd85f76ac9e53e3959b910e9282d2ffd73cfad40c91ffba72a4a3c4 examples/html/iso-8859-15.html SHA256 111cf443366407af9e4309bb12d87d6a46e4bd14e4403a92aceacc3878b91c5e examples/html/utf-16.html SHA256 6f5546b5986a87730656818414455fe7c34b8d30757db781f75760026a473004 examples/html/utf-8.html SHA256 7694c197b3f1c0a385d8d60e193491137385ad3d2207facbd0f8a9dd1f95aa2e lib/HTML/HTML5/Parser.pm SHA256 b67b32a6a613d4e94d5adc19e3430d44963436a91bb91d02df310cafae3fc4e3 lib/HTML/HTML5/Parser/Charset/DecodeHandle.pm SHA256 11e32067f25a239760cd6925adca1f9551a525cd1f94121276cdec560a3c2448 lib/HTML/HTML5/Parser/Charset/Info.pm SHA256 7f35bffdd6958d9289c7ba45bec34612487d8c2a1dfa5b4a0a6050b5123c4bb3 lib/HTML/HTML5/Parser/Charset/UnicodeChecker.pm SHA256 8d0572a9316fe771c4a84248f823803a6e51d23b5e3564f0638c183c78f92e17 lib/HTML/HTML5/Parser/Charset/UniversalCharDet.pm SHA256 8fc35ac4ccf2ae9e0ceb9e1ddbc40f0cf2888ce0145ec9e075cc6d33c991b676 lib/HTML/HTML5/Parser/Charset/WebLatin1.pm SHA256 3bfa2adaa514f487a9db75683a041b7320361457196815c71418074d328e375e lib/HTML/HTML5/Parser/Charset/WebThai.pm SHA256 844dee50fdeb91fee0fca870faeefff9c8e6201b67fc92373ef40ae8fab8b181 lib/HTML/HTML5/Parser/Error.pm SHA256 05a452ae6cb3701ce09a2faf73acf69884f00c921bac0876820a9a1eb46775f3 lib/HTML/HTML5/Parser/TagSoupParser.pm SHA256 3da35ad19ab815faad35af1a0031ec65a761a2b74c552746e34f36ef9b6b13a7 lib/HTML/HTML5/Parser/Tokenizer.pm SHA256 7d9389795dfbff19b146574ad5d86dff67c5c3f6b1fd079ecf16fb446227e6bd lib/HTML/HTML5/Parser/UA.pm SHA256 9eb6e92d24fb8b5044d148f45887b56a51228e39a8ee1cb1d04b645baaee4729 t/01basic.t SHA256 746f404fa2b2835a5d7eb69f7e29bf49c978bef429cdc087d11c8d1c1e1f4e6f t/02html5.t SHA256 9ffd35284d0df2297b105310de749bdbf529865acbe5077865d76df0be9b9a96 t/03html4.t SHA256 d1f233e18127993f5be7220e85499bc092476719c4fa0fd283ffe4fb28ddf2a9 t/04fragments.t SHA256 410a01a53e73287eab7f1c96f9b35afff58f5569441a13e3cf690ac61de22e40 t/05origins.t SHA256 c94be2af03805c5b9dfae97848b144af17f34066196902e15c57f0133d107e6e t/06xlxdsln.t SHA256 c20277e73516a56328e07d524ea1d7274242d9dade2ec2a40fd96dcf0beea5e7 t/07ua.t SHA256 36e0587099b2387c70b224377affc5c71b615f125221f5701957ea56584e006e t/08ua-lwp.t SHA256 fa138fff099a5bf2a615a08dac74484ccea31626f9c8bf07ea864ef7b486b35d t/99html5lib.t SHA256 3192a96e9429635574096990c2bb7aa52473845dbd2c60e54e3fc37d916037b0 t/data/rt-96399-1.html SHA256 63add0e06b545f5b48d5e45db4c8cf5d76fda5401aa60393b301d413ec546641 t/data/rt-96399-2.html SHA256 d4523af0eed25aec52208b19873dd75676b2d91c17a2a939744e38b2d68deac0 t/html5lib-fail/domjs-unsafe.dat SHA256 24f3084af8cb63d67fd6521ef2599a010a2ce03f7ba110ab5e762b31ad07693f t/html5lib-fail/entities02.dat SHA256 170431e1bb402259dfb6abd987b2cf34d6f5c3c26a1b882f046bd1d389ad6a8f t/html5lib-fail/plain-text-unsafe.dat SHA256 b0bf176f1ef02d2e15ee96ea1fd5798d1c487004afd3955945c66743c526c73d t/html5lib-fail/tests1.dat SHA256 a808ff45b57e47f3f51c91f72e30c5765c04edd0b1581ec6d946a8b86136a92c t/html5lib-fail/tests11.dat SHA256 3d84604685cf32f1219630771f3dda94b75de036ada6a03737ed86c94b9de87e t/html5lib-fail/tests14.dat SHA256 f2ea7bab84f06e39ced17738942f277fc82a5e232dddbcad3e0cd8e4f50f05a9 t/html5lib-fail/tests16.dat SHA256 3ee6ea3e6db7bdcc3a37cf565d08f09c5d86102f2999fc4eb037bbcf12925451 t/html5lib-fail/tests21.dat SHA256 8a6fa8041caa6b93538424a89377763957b12132cac38bbfcb5d404ddbc91a5f t/html5lib-fail/tests22.dat SHA256 0a0d5356e6864a0fd5830ac4d47c476b73406de19eca53bea51d294789d14440 t/html5lib-fail/tests23.dat SHA256 4b8c3f495bf06895b4ddb8ed87798ac2c69d9c60785c30534b3b14822407abaf t/html5lib-fail/tests5.dat SHA256 5b6af9446d749358448b0523d5b9fe5aa3d53ca2910eaad7991588e9a707e4a4 t/html5lib-fail/tests6.dat SHA256 b4acdf2833107af127fcc47d396af47f147f0cc6c445c33a36ae4c2df0cf521c t/html5lib-fail/tests9.dat SHA256 0a69ca30b5207607b37f56424184b0345faf5973bd86e24e2f5164e3e983edd8 t/html5lib-fail/tests_innerHTML_1.dat SHA256 877c1ddc166541e7e9197ed5c063aa5d2401d28b678d4b73673b0f42578c1e3d t/html5lib-pass/adoption01.dat SHA256 edb6d2f6d34abc66af649b7b707b5843e4a27bad90525d3b9aa03225c510f346 t/html5lib-pass/adoption02.dat SHA256 5f065fd632e6796a497b196fea2952f4d17c3271619ee6eb3f3c0e626ec257be t/html5lib-pass/comments01.dat SHA256 887f47b1523e7f8cf2183ab1e79708b9f45b02bf6c8d3757c87d7c1adb966fa0 t/html5lib-pass/doctype01.dat SHA256 2091575802187aa5c5c2c38e7dd8923edfa227008b8dcc10feb2ef1104d3597a t/html5lib-pass/entities01.dat SHA256 33c77b0ac689da0a5d54f636211e16e946af0843c4182bd4390feed8e4ca8bdc t/html5lib-pass/html5test-com.dat SHA256 450fef994be02adf3984869aa1b16c9a1374252ec725b2ffdeb5af935c56cc7f t/html5lib-pass/inbody01.dat SHA256 1aef6f007462ab849fde00007d96a35e72bb63537fbf2c3cb6aeb1b36d5a9434 t/html5lib-pass/isindex.dat SHA256 335c624782b92662ef2e2a36cf32e08a063e66fab075f7a84c7d64d369556c75 t/html5lib-pass/pending-spec-changes-plain-text-unsafe.dat SHA256 6087fefa83e8845522137287836a2c8a63b45d601fe1b4e142716d1fe0affafc t/html5lib-pass/pending-spec-changes.dat SHA256 e15bfd60984254475748b219213e827a0dcf56f06b375156a00d3b3ac2361150 t/html5lib-pass/scriptdata01.dat SHA256 384062f5eea31dfcea144ce4ba6b51e667ea7482b2b1807ea39a7c8af5952902 t/html5lib-pass/scripted/adoption01.dat SHA256 28f20ec06ed4cefad77f5122def5ab5d67a77dbe3488da2abe68bb64c6ce8c6b t/html5lib-pass/scripted/ark.dat SHA256 a792b5eb910ccfd7e69271e96afc7482a6c0ab4f59ce26f60364c562184c9b68 t/html5lib-pass/scripted/webkit01.dat SHA256 ba8b2c014c2e9fb1eb34b6521b6c397582b4042f06a4de06e853eeebaafb56da t/html5lib-pass/tables01.dat SHA256 1bfd6777a042730d7f3387a1d989d4c9b48cda61a32ae735aa42f3b43a144ac4 t/html5lib-pass/tests10.dat SHA256 e6c506cea74979a0d6ca47f6b175c7b6177d6e57db708920a89694e31dfd8a42 t/html5lib-pass/tests12.dat SHA256 afff31191e399d0ed34c6c937091b6a1656734f94c84a155279c1d5cc2c1de99 t/html5lib-pass/tests15.dat SHA256 cc195bd6219d5eeab089a70680a25af9dd84dccc0fb7553faf1656a6345541fc t/html5lib-pass/tests17.dat SHA256 6f0c8321aee5d6e39fef58cccf56824b48f1d1b29dc53233adac3508eb9fee02 t/html5lib-pass/tests18.dat SHA256 7feaca5bdca560b2799bd250da8c00348db689ed121d6792645d900001186818 t/html5lib-pass/tests19.dat SHA256 5529aa4182185407ed74d729bf6b0439f4e9e5bb2fb8ea1aeeda7226e9aa9487 t/html5lib-pass/tests2.dat SHA256 c14228cae841cdd24e349fdf9a5195d1633fd4d4c81cf3f8c06f8a7bd21d79a9 t/html5lib-pass/tests20.dat SHA256 fdd5c21f60f42235ded224a03e7182d289088328f460525adaf3c14e772a24ac t/html5lib-pass/tests24.dat SHA256 a218bb0cc22ee4b9e04b28bf4e6657779d6c3a40e484879ad1628c07aba1e5f9 t/html5lib-pass/tests25.dat SHA256 198c495a95094218ddf66d43345d77cb74e31de17253e3276acd7a7fbaac9499 t/html5lib-pass/tests26.dat SHA256 fc272622b260e880730b4300813979880b763e257f7ca36dfbe21c0ee52d5e1c t/html5lib-pass/tests3.dat SHA256 b24985dc5cd48468221e365dbf18f1dc9b1743c58086aaf1c47191ccb2faff3f t/html5lib-pass/tests4.dat SHA256 4de7c4ada88a5dd5f0c1d13b81bbe4213569c5911a6a4272347f306638e751ba t/html5lib-pass/tests7.dat SHA256 fbf40bddf189d97abea886cf525b99b64db1683bfe64b621c2cf9a570af66b67 t/html5lib-pass/tests8.dat SHA256 8b8e99d4c3c56bd9b9cd9231aae3e5385ec3df326e149c330b3456ab6e067dcd t/html5lib-pass/tricky01.dat SHA256 e65ac7465c7d251d675acf749fe22a77cd7116a55e523064cbd3b9d1026a3b60 t/html5lib-pass/webkit01.dat SHA256 eb4745d675b109bbd84abec99286b471d751a6cc7be708254149dda61322f0fa t/html5lib-pass/webkit02.dat SHA256 d0ae4c1da1aad56560d1d6943dacd8a7e1872883cfc54bc9dc725b2c3072cf4b t/lib/Test/HTTP/Server.pm SHA256 002b1dbe81a415b07ee62688ed663337f86051c9b1329480be354264a35bf1e3 t/rt-79019.t SHA256 fb8fde31b5ed21034173c37c7204ca175c6e7eebed5fb2bc27efef3df6f84ad4 t/rt-96399.t -----BEGIN PGP SIGNATURE----- iF0EAREDAB0WIQRVJKj/4+s6z4WzNujOv4Eoaip9OQUCYTjQjAAKCRDOv4Eoaip9 OUthAJ9oOU/O8Xjw/oz6LQfaTEBLdegeQgCdGfRfSX6dyv1ADQ4fKFlG5TlagEo= =rdTY -----END PGP SIGNATURE----- TODO000664001750001750 16314116150214 13624 0ustar00taitai000000000000HTML-HTML5-Parser-0.992* It would be nice to pass some more of the html5lib tests. (The entities tests in particular should be doable.) dist.ini000664001750001750 10214116150214 14571 0ustar00taitai000000000000HTML-HTML5-Parser-0.992;;class='Dist::Inkt::Profile::TOBYINK' ;;name='HTML-HTML5-Parser' doap.ttl000664001750001750 6241714116150214 14656 0ustar00taitai000000000000HTML-HTML5-Parser-0.992@prefix dc: . @prefix doap: . @prefix doap-bugs: . @prefix doap-changeset: . @prefix doap-deps: . @prefix foaf: . @prefix owl: . @prefix rdfs: . @prefix xsd: . a foaf:Person; foaf:name "Wakaba"; foaf:page . a doap:Project; doap-bugs:issue , , , ; doap-deps:develop-recommendation [ doap-deps:on "Dist::Inkt 0.001"^^doap-deps:CpanId ]; doap-deps:runtime-requirement [ doap-deps:on "perl 5.008001"^^doap-deps:CpanId ], [ doap-deps:on "Try::Tiny"^^doap-deps:CpanId ], [ doap-deps:on "IO::HTML"^^doap-deps:CpanId ], [ doap-deps:on "HTML::HTML5::Entities 0.002"^^doap-deps:CpanId; ], [ doap-deps:on "HTTP::Tiny"^^doap-deps:CpanId ], [ doap-deps:on "XML::LibXML 1.94"^^doap-deps:CpanId ], [ doap-deps:on "XML::LibXML::Devel"^^doap-deps:CpanId ], [ doap-deps:on "Scalar::Util"^^doap-deps:CpanId ], [ doap-deps:on "URI::file"^^doap-deps:CpanId ]; doap-deps:test-recommendation [ doap-deps:on "LWP::UserAgent"^^doap-deps:CpanId ]; doap-deps:test-requirement [ doap-deps:on "Test::Requires"^^doap-deps:CpanId ], [ doap-deps:on "FindBin"^^doap-deps:CpanId ], [ doap-deps:on "Test::More 0.61"^^doap-deps:CpanId ]; doap:bug-database ; doap:category [ rdfs:label "Web"@en ], [ rdfs:label "HTML"@en ], [ rdfs:label "HTML5"@en ]; doap:created "2009-11-26"^^xsd:date; doap:developer , ; doap:documenter ; doap:download-page ; doap:homepage ; doap:license ; doap:maintainer ; doap:name "HTML-HTML5-Parser"; doap:programming-language "Perl"; doap:release , , , , , , , , , , , , , , , , , , , , , , , , ; doap:repository [ a doap:GitRepository; doap:browse ; ]; doap:shortdesc "parse HTML reliably"@en; doap:tester . a doap:Version; rdfs:label "Developer preview"@en; dc:issued "2009-12-01"^^xsd:date; doap:file-release ; doap:revision "0.00_01"^^xsd:string. a doap:Version; rdfs:label "Original version"@en; dc:issued "2009-12-03"^^xsd:date; doap:file-release ; doap:revision "0.01"^^xsd:string. a doap:Version; dc:issued "2009-12-16"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ rdfs:label "Replace Inline::Python encoding detection with weaker, but native Perl HTML::Encoding package."@en; ], [ a doap-changeset:Addition, doap-changeset:Packaging; rdfs:label "Bundle the html2xhtml tool."@en; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.02"^^xsd:string. a doap:Version; dc:issued "2010-01-15"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "Module didn't use URI::file properly."@en; doap-changeset:blame [ foaf:nick "shellac" ]; ], [ a doap-changeset:Packaging, doap-changeset:Update; rdfs:label "Upgrade distribution to my new packaging regime (auto-generated changelogs, etc)"@en; ], [ a doap-changeset:Documentation, doap-changeset:Update; rdfs:label "Copyright 2010."@en; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.03"^^xsd:string. a doap:Version; dc:issued "2010-04-21"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Update; rdfs:label "Catch up to revision cf2c0df8a6dfb50fee923dfb21b14c83f282ccdc (2010-02-28) upstream."@en; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.04"^^xsd:string. a doap:Version; dc:issued "2010-06-23"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "Minor bugfixes."@en; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.100"^^xsd:string. a doap:Version; dc:issued "2010-06-30"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "UTF-8 fix."@en; doap-changeset:fixes [ rdfs:label "Wide characters in DOM tree."@en; doap-bugs:reporter ; ]; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.101"^^xsd:string. a doap:Version; dc:issued "2011-01-19"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "Fix source_line method."@en; ], [ a doap-changeset:Update; rdfs:label "Catch up to revision f2c921a886ab0b3dfb8d21b82525e98a4a921ad4 (2010-10-11) upstream."@en; ], [ a doap-changeset:Addition; rdfs:label "Allow element to appear in if document has an HTML4 doctype. This is a willful violation of the HTML5 parsing algorithm. (The may have elements as children, as well as any children that would normally be allowed in the of the document, such as ; any other content is treated as the beginning of the , and thus closes and . That's slightly looser than the HTML 4 spec which says only should be used, but stricter than the HTML 4 DTD which allows pretty much anything in there!)"@en; ], [ a doap-changeset:Addition; rdfs:label "Support
element."@en; ], [ a doap-changeset:Addition; rdfs:label "Support element."@en; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.102"^^xsd:string. a doap:Version; dc:issued "2011-02-09"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Documentation, doap-changeset:Update; rdfs:label "Copyright 2011."@en; ], [ a doap-changeset:Bugfix; rdfs:label "TagSoupParser.pm called a method that is renamed between this distribution and upstream using its upstream name."@en; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.103"^^xsd:string. a doap:Version; dc:issued "2011-09-22"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Update; rdfs:label "Catch up to revision d81fcb920a1a3c351149cd66a64bf1b8ae14a172 (2011-08-21) upstream."@en; ], [ a doap-changeset:Addition; rdfs:label "Support element."@en; ], [ a doap-changeset:Addition; rdfs:label "Some error handling stuff."@en; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.104"^^xsd:string. a doap:Version; dc:issued "2011-10-07"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ rdfs:label "HTML::HTML5::Parser::Error overloads stringification."@en; ], [ a doap-changeset:Packaging; rdfs:label "Module::Package::RDF."@en; ], [ a doap-changeset:Addition; rdfs:label "Bundle 'html5debug' script."@en; ], [ rdfs:label "use HTML::HTML5::Entities"@en ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.105"^^xsd:string. a doap:Version; dc:issued "2011-10-10"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "Tokenizer.pm was still trying to require NamedEntityList.pm."@en; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.106"^^xsd:string. a doap:Version; dc:issued "2011-10-20"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "parse_file wasn't accepting relative file names"@en; ], [ a doap-changeset:Addition; rdfs:label "html2xhtml now reads from STDIN by default."@en; ], [ a doap-changeset:Addition; rdfs:label "html2xhtml can output to a file."@en; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.107"^^xsd:string. a doap:Version; dc:issued "2012-01-18"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "Crashed when generating an error message about some missing closing tags."@en; doap-changeset:blame ; ], [ a doap-changeset:Documentation, doap-changeset:Update; rdfs:label "Copyright 2012."@en; ], [ a doap-changeset:Update; rdfs:label "Provide load_xml and load_html methods for compatibility with XML::LibXML 1.70 and newer."@en; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.108"^^xsd:string. a doap:Version; rdfs:label "Yes, they are come, but they are not past."@en; dc:issued "2012-03-15"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "Use correct case for MathML's definitionURL attribute."@en; ], [ a doap-changeset:Bugfix; rdfs:label "Fix several fatal errors on infrequently used code paths in TagSoupParser.pm."@en; ], [ a doap-changeset:Packaging; rdfs:label "Bundle test cases from html5lib."@en; ], [ a doap-changeset:Addition; rdfs:label "Provide parse_balanced_chunk to parse HTML fragments."@en; ], [ a doap-changeset:Addition; rdfs:label "Provide dtd_element method to get more information about the DTD."@en; ], [ rdfs:label "Make source_line method work more reliably. This requires XML::LibXML::Devel, and thus a fairly recent version of XML-LibXML."@en; ], [ a doap-changeset:Addition; rdfs:label "Nodes now have an implictness flag (returned by source_line called in a list context)."@en; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.109"^^xsd:string. a doap:Version; dc:issued "2012-03-20"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "Removed a use of smart match which was breaking Perl 5.8.x compatibility."@en; doap-changeset:fixes ; ], [ rdfs:label "use XML::LibXML::Devel::SetLineNumber if it is available."; ], [ rdfs:label "Text nodes should keep line numbers and column numbers too."; ], [ rdfs:label "Comment nodes should now keep their line numbers and column numbers."; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.110"^^xsd:string. a doap:Version; dc:issued "2012-06-13"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ rdfs:label "Suppress warnings about invalid Unicode code points."; ], [ rdfs:label "Drop dependency on Error.pm; use Try::Tiny instead."; ], [ rdfs:label "Drop dependency on HTML::Encoding; use IO::HTML instead."; ], [ rdfs:label "Passing a couple more of the html5lib test suite files."; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.200"^^xsd:string. a doap:Version; dc:issued "2012-06-27"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ rdfs:label "Drop dependency on LWP::UserAgent in favour of HTTP::Tiny which is part of core since 5.14."; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.202"^^xsd:string. a doap:Version; dc:issued "2012-06-29"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "Stop using defined-or operator in HTML::HTML5::Parser::UA, as it doesn't work in Perl 5.8."; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.204"^^xsd:string. a doap:Version; dc:issued "2012-06-30"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "Remove 'use 5.010' from t/07ua.t."; ], [ a doap-changeset:Bugfix; rdfs:label "Disable t/07ua.t and t/08ua-lwp.t on Windows because Test::HTTP::Server doesn't work on that platform."; rdfs:seeAlso ; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.206"^^xsd:string. a doap:Version; dc:issued "2012-12-06"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "Minor fix re LWP-independence."; ], [ a doap-changeset:Bugfix; rdfs:label "If two tags were in the same file, attributes on the second element could cause crashes."@en; doap-changeset:fixes ; doap-changeset:thanks [ foaf:name "Luben Karavelov" ]; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.208"^^xsd:string. a doap:Version; dc:issued "2013-07-06"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "Fix many major memory leaks."; doap-changeset:blame [ foaf:name "Dorian Taylor"; foaf:nick "DORIAN"; ]; doap-changeset:fixes ; ], [ a doap-changeset:Change; rdfs:label "Memory leak fix necessitated some API changes; in particular some methods which were available as class methods are now object methods only."@en; ], [ a doap-changeset:Packaging; rdfs:label "t/99html5lib.t now uses Moo instead of Mo."@en; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.300"^^xsd:string. a doap:Version; dc:issued "2013-07-08"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "Fix t/99html5lib.t still using Mo in one place."@en; doap-changeset:fixes ; doap-changeset:thanks [ foaf:mbox ; foaf:name "Fitz Elliott"; ]; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.301"^^xsd:string. a doap:Version; dc:issued "2021-09-08"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Bugfix; rdfs:label "Don't install a global __WARN__ handler in BEGIN."@en; doap-changeset:blame [ foaf:name "Alex Vandiver" ]; ], [ a doap-changeset:Packaging; rdfs:label "Modernize packaging in general."@en; ], [ a doap-changeset:Bugfix; rdfs:label "Don't assume @INC contains ."@en; ], [ a doap-changeset:Bugfix; rdfs:label "Better regexp for differentiating between filenames and URIs."@en; ], [ a doap-changeset:Bugfix; rdfs:label "Fix encoding issues."@en; doap-changeset:blame [ foaf:name "Gregory Williams" ], [ foaf:name "Kjetil Kjernsmo" ]; doap-changeset:fixes ; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.991"^^xsd:string. a doap:Version; dc:issued "2021-09-08"^^xsd:date; doap-changeset:changeset [ doap-changeset:item [ a doap-changeset:Packaging; rdfs:label "Point repo at Github."@en; ], [ a doap-changeset:Packaging; rdfs:label "Point issue tracker at Github."@en; ]; doap-changeset:versus ; ]; doap:file-release ; doap:revision "0.992"^^xsd:string. foaf:nick "audreyt". a doap-bugs:Issue; rdfs:label "Warning produced when run under perl -w"; dc:created "2010-03-23T16:50:39"^^xsd:dateTime; dc:modified "2010-04-23T13:00:02"^^xsd:dateTime; doap-bugs:assignee _:B1; doap-bugs:id "55845"^^xsd:string; doap-bugs:page ; doap-bugs:reporter _:B2; doap-bugs:status . a doap-bugs:Issue; rdfs:label "Please don't require Module::Signature"; dc:created "2011-02-07T07:25:30"^^xsd:dateTime; dc:modified "2012-01-18T13:14:13"^^xsd:dateTime; doap-bugs:assignee _:B1; doap-bugs:id "65517"^^xsd:string; doap-bugs:page ; doap-bugs:reporter _:B2; doap-bugs:status . a doap-bugs:Issue; rdfs:label "LibXML version check uses smart match"; dc:created "2012-03-16T10:30:03"^^xsd:dateTime; dc:modified "2012-03-19T09:12:28"^^xsd:dateTime; doap-bugs:assignee _:B1; doap-bugs:id "75809"^^xsd:string; doap-bugs:page ; doap-bugs:reporter [ a foaf:Agent; foaf:mbox ; ]; doap-bugs:status . a doap-bugs:Issue; rdfs:label "Failure mode of TagSoupParser"; dc:created "2012-08-16T15:47:33"^^xsd:dateTime; dc:modified "2012-08-18T15:56:56"^^xsd:dateTime; doap-bugs:assignee _:B1; doap-bugs:id "79019"^^xsd:string; doap-bugs:page ; doap-bugs:reporter [ a foaf:Agent; foaf:mbox ; ]; doap-bugs:status . a foaf:Person; foaf:homepage ; foaf:mbox ; foaf:name "Toby Inkster"; foaf:page ; owl:sameAs . _:B2 a foaf:Agent; foaf:mbox . _:B1 a foaf:Agent; foaf:nick "TOBYINK". html2xhtml000664001750001750 64114116150214 15733 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/bin#!/usr/bin/perl ## skip Test::Tabs use 5.010; use HTML::HTML5::Parser; my $input = shift // '-'; my $output = shift // '-'; my $parser = HTML::HTML5::Parser->new; my $dom = ($input eq '-') ? $parser->parse_string(do { local $/ = }) : $parser->parse_file($input); if ($output eq '-') { print $dom->toString; } else { open my($fh), '>:encoding(UTF-8)', $output; print $fh $dom->toString; close $fh; }html5debug000664001750001750 307014116150214 15707 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/bin#!/usr/bin/perl ## skip Test::Tabs use Getopt::Long; use HTML::HTML5::Parser; my $output = $ENV{HTML_OUTPUT} || 'debug:json'; my $help; GetOptions( 'output|o=s' => \$output, 'help|usage|h' => \$help, ); if ($help) { my $name = $0; print <new; my $h = join '', <>; my $hash; if ($output =~ /debug/i) { load('XML::LibXML::Debugging'); $hash = $p->parse_string($h)->toDebuggingHash; } elsif ($output =~ /clark/i) { load('XML::LibXML::Debugging'); print $p->parse_string($h)->toClarkML; } elsif ($output =~ /html/i) { load('HTML::HTML5::Writer'); print HTML::HTML5::Writer->new->document($p->parse_string($h)); } elsif ($output =~ /parser/i) { $p->parse_string($h); $hash = $p; } elsif ($output =~ /err/i) { $p->parse_string($h); print "$_\n" foreach $p->errors; } else { print $p->parse_string($h)->toString; } if (defined $hash and $output =~ /json/) { load('JSON'); print to_json($hash, {pretty=>1,canonical=>1}); } elsif (defined $hash) { load('Data::Dumper'); print Dumper($hash); } charsets.pl000664001750001750 121114116150214 17136 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/examplesuse utf8::all; use HTML::HTML5::Parser; my $U = HTML::HTML5::Parser->load_html(location => 'examples/html/utf-8.html'); my $X = HTML::HTML5::Parser->load_html(location => 'examples/html/utf-16.html'); my $W = HTML::HTML5::Parser->load_html(location => 'examples/html/iso-8859-15.html'); print "UTF-8... ", $U->getElementsByTagName('p')->[0]->textContent, "\t", HTML::HTML5::Parser->charset($U), "\n"; print "UTF-16.. ", $X->getElementsByTagName('p')->[0]->textContent, "\t", HTML::HTML5::Parser->charset($X), "\n"; print "Western... ", $W->getElementsByTagName('p')->[0]->textContent, "\t", HTML::HTML5::Parser->charset($W), "\n"; 01basic.t000664001750001750 133714116150214 15032 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t## skip Test::Tabs use Test::More tests => 3; BEGIN { use_ok('HTML::HTML5::Parser') }; my $parser = new_ok 'HTML::HTML5::Parser'; can_ok $parser, qw/ parse_file parse_html_file parse_fh parse_html_fh parse_string parse_html_string parse_balanced_chunk load_xml load_html error_handler errors compat_mode dtd_public_id dtd_system_id dtd_element source_line /; =head1 PURPOSE Test that L can be loaded and instantiated, and that the object has the expected methods. =head1 AUTHOR Toby Inkster, Etobyink@cpan.orgE =head1 COPYRIGHT AND LICENCE Copyright (C) 2010-2012 by Toby Inkster This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. 02html5.t000664001750001750 200314116150214 14772 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t## skip Test::Tabs use Test::More tests => 3; use HTML::HTML5::Parser; my $parser = HTML::HTML5::Parser->new; my $html = < foo

Foo

  • Bart< The inequality is 2<3 .

Baz


HHH

CCC
HTML ok(my $dom = $parser->parse_string($html), "parse_string works"); is($parser->dtd_public_id($dom), "+//IDN demiblog.org//Foo Bar//EN", "dtd_public_id works"); my @italics = $dom->getElementsByTagName('i'); my $lone_letter = $italics[1]; is($lone_letter->textContent, 't<', "parsing seems to follow HTML5 rules"); =head1 PURPOSE Test basic functionality. =head1 AUTHOR Toby Inkster, Etobyink@cpan.orgE =head1 COPYRIGHT AND LICENCE Copyright (C) 2012 by Toby Inkster This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. 03html4.t000664001750001750 163214116150214 15001 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t## skip Test::Tabs use Test::More tests => 2; use HTML::HTML5::Parser; my $parser = HTML::HTML5::Parser->new; my $html = <foo

foo

HTML my $dom_4 = $parser->parse_string(''.$html); my $dom_5 = $parser->parse_string(''.$html); my ($object_4) = $dom_4->getElementsByTagName('object'); my ($object_5) = $dom_5->getElementsByTagName('object'); is($object_4->parentNode->tagName, 'head', 'HTML 4 allows in .'); is($object_5->parentNode->tagName, 'body', 'HTML 5 disallows in .'); =head1 PURPOSE Test that HTML 4 allows C<< >> elements in C<< >>. =head1 AUTHOR Toby Inkster, Etobyink@cpan.orgE =head1 COPYRIGHT AND LICENCE Copyright (C) 2012 by Toby Inkster This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. 04fragments.t000664001750001750 214114116150214 15734 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t## skip Test::Tabs use Test::More tests => 5; use HTML::HTML5::Parser; my $parser = HTML::HTML5::Parser->new; my $input = "Hello

#errors Line: 1 Col: 9 Unexpected end tag (strong). Expected DOCTYPE. Line: 1 Col: 9 Unexpected end tag (strong) after the (implied) root element. Line: 1 Col: 13 Unexpected end tag (b) after the (implied) root element. Line: 1 Col: 18 Unexpected end tag (em) after the (implied) root element. Line: 1 Col: 22 Unexpected end tag (i) after the (implied) root element. Line: 1 Col: 26 Unexpected end tag (u) after the (implied) root element. Line: 1 Col: 35 Unexpected end tag (strike) after the (implied) root element. Line: 1 Col: 39 Unexpected end tag (s) after the (implied) root element. Line: 1 Col: 47 Unexpected end tag (blink) after the (implied) root element. Line: 1 Col: 52 Unexpected end tag (tt) after the (implied) root element. Line: 1 Col: 58 Unexpected end tag (pre) after the (implied) root element. Line: 1 Col: 64 Unexpected end tag (big) after the (implied) root element. Line: 1 Col: 72 Unexpected end tag (small) after the (implied) root element. Line: 1 Col: 79 Unexpected end tag (font) after the (implied) root element. Line: 1 Col: 88 Unexpected end tag (select) after the (implied) root element. Line: 1 Col: 93 Unexpected end tag (h1) after the (implied) root element. Line: 1 Col: 98 Unexpected end tag (h2) after the (implied) root element. Line: 1 Col: 103 Unexpected end tag (h3) after the (implied) root element. Line: 1 Col: 108 Unexpected end tag (h4) after the (implied) root element. Line: 1 Col: 113 Unexpected end tag (h5) after the (implied) root element. Line: 1 Col: 118 Unexpected end tag (h6) after the (implied) root element. Line: 1 Col: 125 Unexpected end tag (body) after the (implied) root element. Line: 1 Col: 130 Unexpected end tag (br). Treated as br element. Line: 1 Col: 134 End tag (a) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 140 This element (img) has no end tag. Line: 1 Col: 148 Unexpected end tag (title). Ignored. Line: 1 Col: 155 Unexpected end tag (span). Ignored. Line: 1 Col: 163 Unexpected end tag (style). Ignored. Line: 1 Col: 172 Unexpected end tag (script). Ignored. Line: 1 Col: 180 Unexpected end tag (table). Ignored. Line: 1 Col: 185 Unexpected end tag (th). Ignored. Line: 1 Col: 190 Unexpected end tag (td). Ignored. Line: 1 Col: 195 Unexpected end tag (tr). Ignored. Line: 1 Col: 203 This element (frame) has no end tag. Line: 1 Col: 210 This element (area) has no end tag. Line: 1 Col: 217 Unexpected end tag (link). Ignored. Line: 1 Col: 225 This element (param) has no end tag. Line: 1 Col: 230 This element (hr) has no end tag. Line: 1 Col: 238 This element (input) has no end tag. Line: 1 Col: 244 Unexpected end tag (col). Ignored. Line: 1 Col: 251 Unexpected end tag (base). Ignored. Line: 1 Col: 258 Unexpected end tag (meta). Ignored. Line: 1 Col: 269 This element (basefont) has no end tag. Line: 1 Col: 279 This element (bgsound) has no end tag. Line: 1 Col: 287 This element (embed) has no end tag. Line: 1 Col: 296 This element (spacer) has no end tag. Line: 1 Col: 300 Unexpected end tag (p). Ignored. Line: 1 Col: 305 End tag (dd) seen too early. Expected other end tag. Line: 1 Col: 310 End tag (dt) seen too early. Expected other end tag. Line: 1 Col: 320 Unexpected end tag (caption). Ignored. Line: 1 Col: 331 Unexpected end tag (colgroup). Ignored. Line: 1 Col: 339 Unexpected end tag (tbody). Ignored. Line: 1 Col: 347 Unexpected end tag (tfoot). Ignored. Line: 1 Col: 355 Unexpected end tag (thead). Ignored. Line: 1 Col: 365 End tag (address) seen too early. Expected other end tag. Line: 1 Col: 378 End tag (blockquote) seen too early. Expected other end tag. Line: 1 Col: 387 End tag (center) seen too early. Expected other end tag. Line: 1 Col: 393 Unexpected end tag (dir). Ignored. Line: 1 Col: 399 End tag (div) seen too early. Expected other end tag. Line: 1 Col: 404 End tag (dl) seen too early. Expected other end tag. Line: 1 Col: 415 End tag (fieldset) seen too early. Expected other end tag. Line: 1 Col: 425 End tag (listing) seen too early. Expected other end tag. Line: 1 Col: 432 End tag (menu) seen too early. Expected other end tag. Line: 1 Col: 437 End tag (ol) seen too early. Expected other end tag. Line: 1 Col: 442 End tag (ul) seen too early. Expected other end tag. Line: 1 Col: 447 End tag (li) seen too early. Expected other end tag. Line: 1 Col: 454 End tag (nobr) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 460 This element (wbr) has no end tag. Line: 1 Col: 476 End tag (button) seen too early. Expected other end tag. Line: 1 Col: 486 End tag (marquee) seen too early. Expected other end tag. Line: 1 Col: 495 End tag (object) seen too early. Expected other end tag. Line: 1 Col: 513 Unexpected end tag (html). Ignored. Line: 1 Col: 513 Unexpected end tag (frameset). Ignored. Line: 1 Col: 520 Unexpected end tag (head). Ignored. Line: 1 Col: 529 Unexpected end tag (iframe). Ignored. Line: 1 Col: 537 This element (image) has no end tag. Line: 1 Col: 547 This element (isindex) has no end tag. Line: 1 Col: 557 Unexpected end tag (noembed). Ignored. Line: 1 Col: 568 Unexpected end tag (noframes). Ignored. Line: 1 Col: 579 Unexpected end tag (noscript). Ignored. Line: 1 Col: 590 Unexpected end tag (optgroup). Ignored. Line: 1 Col: 599 Unexpected end tag (option). Ignored. Line: 1 Col: 611 Unexpected end tag (plaintext). Ignored. Line: 1 Col: 622 Unexpected end tag (textarea). Ignored. #document | | | |
|

#data

#errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 20 Unexpected end tag (strong) in table context caused voodoo mode. Line: 1 Col: 20 End tag (strong) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 24 Unexpected end tag (b) in table context caused voodoo mode. Line: 1 Col: 24 End tag (b) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 29 Unexpected end tag (em) in table context caused voodoo mode. Line: 1 Col: 29 End tag (em) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 33 Unexpected end tag (i) in table context caused voodoo mode. Line: 1 Col: 33 End tag (i) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 37 Unexpected end tag (u) in table context caused voodoo mode. Line: 1 Col: 37 End tag (u) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 46 Unexpected end tag (strike) in table context caused voodoo mode. Line: 1 Col: 46 End tag (strike) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 50 Unexpected end tag (s) in table context caused voodoo mode. Line: 1 Col: 50 End tag (s) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 58 Unexpected end tag (blink) in table context caused voodoo mode. Line: 1 Col: 58 Unexpected end tag (blink). Ignored. Line: 1 Col: 63 Unexpected end tag (tt) in table context caused voodoo mode. Line: 1 Col: 63 End tag (tt) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 69 Unexpected end tag (pre) in table context caused voodoo mode. Line: 1 Col: 69 End tag (pre) seen too early. Expected other end tag. Line: 1 Col: 75 Unexpected end tag (big) in table context caused voodoo mode. Line: 1 Col: 75 End tag (big) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 83 Unexpected end tag (small) in table context caused voodoo mode. Line: 1 Col: 83 End tag (small) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 90 Unexpected end tag (font) in table context caused voodoo mode. Line: 1 Col: 90 End tag (font) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 99 Unexpected end tag (select) in table context caused voodoo mode. Line: 1 Col: 99 Unexpected end tag (select). Ignored. Line: 1 Col: 104 Unexpected end tag (h1) in table context caused voodoo mode. Line: 1 Col: 104 End tag (h1) seen too early. Expected other end tag. Line: 1 Col: 109 Unexpected end tag (h2) in table context caused voodoo mode. Line: 1 Col: 109 End tag (h2) seen too early. Expected other end tag. Line: 1 Col: 114 Unexpected end tag (h3) in table context caused voodoo mode. Line: 1 Col: 114 End tag (h3) seen too early. Expected other end tag. Line: 1 Col: 119 Unexpected end tag (h4) in table context caused voodoo mode. Line: 1 Col: 119 End tag (h4) seen too early. Expected other end tag. Line: 1 Col: 124 Unexpected end tag (h5) in table context caused voodoo mode. Line: 1 Col: 124 End tag (h5) seen too early. Expected other end tag. Line: 1 Col: 129 Unexpected end tag (h6) in table context caused voodoo mode. Line: 1 Col: 129 End tag (h6) seen too early. Expected other end tag. Line: 1 Col: 136 Unexpected end tag (body) in the table row phase. Ignored. Line: 1 Col: 141 Unexpected end tag (br) in table context caused voodoo mode. Line: 1 Col: 141 Unexpected end tag (br). Treated as br element. Line: 1 Col: 145 Unexpected end tag (a) in table context caused voodoo mode. Line: 1 Col: 145 End tag (a) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 151 Unexpected end tag (img) in table context caused voodoo mode. Line: 1 Col: 151 This element (img) has no end tag. Line: 1 Col: 159 Unexpected end tag (title) in table context caused voodoo mode. Line: 1 Col: 159 Unexpected end tag (title). Ignored. Line: 1 Col: 166 Unexpected end tag (span) in table context caused voodoo mode. Line: 1 Col: 166 Unexpected end tag (span). Ignored. Line: 1 Col: 174 Unexpected end tag (style) in table context caused voodoo mode. Line: 1 Col: 174 Unexpected end tag (style). Ignored. Line: 1 Col: 183 Unexpected end tag (script) in table context caused voodoo mode. Line: 1 Col: 183 Unexpected end tag (script). Ignored. Line: 1 Col: 196 Unexpected end tag (th). Ignored. Line: 1 Col: 201 Unexpected end tag (td). Ignored. Line: 1 Col: 206 Unexpected end tag (tr). Ignored. Line: 1 Col: 214 This element (frame) has no end tag. Line: 1 Col: 221 This element (area) has no end tag. Line: 1 Col: 228 Unexpected end tag (link). Ignored. Line: 1 Col: 236 This element (param) has no end tag. Line: 1 Col: 241 This element (hr) has no end tag. Line: 1 Col: 249 This element (input) has no end tag. Line: 1 Col: 255 Unexpected end tag (col). Ignored. Line: 1 Col: 262 Unexpected end tag (base). Ignored. Line: 1 Col: 269 Unexpected end tag (meta). Ignored. Line: 1 Col: 280 This element (basefont) has no end tag. Line: 1 Col: 290 This element (bgsound) has no end tag. Line: 1 Col: 298 This element (embed) has no end tag. Line: 1 Col: 307 This element (spacer) has no end tag. Line: 1 Col: 311 Unexpected end tag (p). Ignored. Line: 1 Col: 316 End tag (dd) seen too early. Expected other end tag. Line: 1 Col: 321 End tag (dt) seen too early. Expected other end tag. Line: 1 Col: 331 Unexpected end tag (caption). Ignored. Line: 1 Col: 342 Unexpected end tag (colgroup). Ignored. Line: 1 Col: 350 Unexpected end tag (tbody). Ignored. Line: 1 Col: 358 Unexpected end tag (tfoot). Ignored. Line: 1 Col: 366 Unexpected end tag (thead). Ignored. Line: 1 Col: 376 End tag (address) seen too early. Expected other end tag. Line: 1 Col: 389 End tag (blockquote) seen too early. Expected other end tag. Line: 1 Col: 398 End tag (center) seen too early. Expected other end tag. Line: 1 Col: 404 Unexpected end tag (dir). Ignored. Line: 1 Col: 410 End tag (div) seen too early. Expected other end tag. Line: 1 Col: 415 End tag (dl) seen too early. Expected other end tag. Line: 1 Col: 426 End tag (fieldset) seen too early. Expected other end tag. Line: 1 Col: 436 End tag (listing) seen too early. Expected other end tag. Line: 1 Col: 443 End tag (menu) seen too early. Expected other end tag. Line: 1 Col: 448 End tag (ol) seen too early. Expected other end tag. Line: 1 Col: 453 End tag (ul) seen too early. Expected other end tag. Line: 1 Col: 458 End tag (li) seen too early. Expected other end tag. Line: 1 Col: 465 End tag (nobr) violates step 1, paragraph 1 of the adoption agency algorithm. Line: 1 Col: 471 This element (wbr) has no end tag. Line: 1 Col: 487 End tag (button) seen too early. Expected other end tag. Line: 1 Col: 497 End tag (marquee) seen too early. Expected other end tag. Line: 1 Col: 506 End tag (object) seen too early. Expected other end tag. Line: 1 Col: 524 Unexpected end tag (html). Ignored. Line: 1 Col: 524 Unexpected end tag (frameset). Ignored. Line: 1 Col: 531 Unexpected end tag (head). Ignored. Line: 1 Col: 540 Unexpected end tag (iframe). Ignored. Line: 1 Col: 548 This element (image) has no end tag. Line: 1 Col: 558 This element (isindex) has no end tag. Line: 1 Col: 568 Unexpected end tag (noembed). Ignored. Line: 1 Col: 579 Unexpected end tag (noframes). Ignored. Line: 1 Col: 590 Unexpected end tag (noscript). Ignored. Line: 1 Col: 601 Unexpected end tag (optgroup). Ignored. Line: 1 Col: 610 Unexpected end tag (option). Ignored. Line: 1 Col: 622 Unexpected end tag (plaintext). Ignored. Line: 1 Col: 633 Unexpected end tag (textarea). Ignored. #document | | | |
| | | |

#data #errors Line: 1 Col: 10 Unexpected start tag (frameset). Expected DOCTYPE. Line: 1 Col: 10 Expected closing tag. Unexpected end of file. #document | | | tests11.dat000664001750001750 4137514116150214 20100 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t/html5lib-fail#data #errors #document | | | | | | attributeName="" | attributeType="" | baseFrequency="" | baseProfile="" | calcMode="" | clipPathUnits="" | contentScriptType="" | contentStyleType="" | diffuseConstant="" | edgeMode="" | externalResourcesRequired="" | filterRes="" | filterUnits="" | glyphRef="" | gradientTransform="" | gradientUnits="" | kernelMatrix="" | kernelUnitLength="" | keyPoints="" | keySplines="" | keyTimes="" | lengthAdjust="" | limitingConeAngle="" | markerHeight="" | markerUnits="" | markerWidth="" | maskContentUnits="" | maskUnits="" | numOctaves="" | pathLength="" | patternContentUnits="" | patternTransform="" | patternUnits="" | pointsAtX="" | pointsAtY="" | pointsAtZ="" | preserveAlpha="" | preserveAspectRatio="" | primitiveUnits="" | refX="" | refY="" | repeatCount="" | repeatDur="" | requiredExtensions="" | requiredFeatures="" | specularConstant="" | specularExponent="" | spreadMethod="" | startOffset="" | stdDeviation="" | stitchTiles="" | surfaceScale="" | systemLanguage="" | tableValues="" | targetX="" | targetY="" | textLength="" | viewBox="" | viewTarget="" | xChannelSelector="" | yChannelSelector="" | zoomAndPan="" #data #errors #document | | | | | | attributeName="" | attributeType="" | baseFrequency="" | baseProfile="" | calcMode="" | clipPathUnits="" | contentScriptType="" | contentStyleType="" | diffuseConstant="" | edgeMode="" | externalResourcesRequired="" | filterRes="" | filterUnits="" | glyphRef="" | gradientTransform="" | gradientUnits="" | kernelMatrix="" | kernelUnitLength="" | keyPoints="" | keySplines="" | keyTimes="" | lengthAdjust="" | limitingConeAngle="" | markerHeight="" | markerUnits="" | markerWidth="" | maskContentUnits="" | maskUnits="" | numOctaves="" | pathLength="" | patternContentUnits="" | patternTransform="" | patternUnits="" | pointsAtX="" | pointsAtY="" | pointsAtZ="" | preserveAlpha="" | preserveAspectRatio="" | primitiveUnits="" | refX="" | refY="" | repeatCount="" | repeatDur="" | requiredExtensions="" | requiredFeatures="" | specularConstant="" | specularExponent="" | spreadMethod="" | startOffset="" | stdDeviation="" | stitchTiles="" | surfaceScale="" | systemLanguage="" | tableValues="" | targetX="" | targetY="" | textLength="" | viewBox="" | viewTarget="" | xChannelSelector="" | yChannelSelector="" | zoomAndPan="" #data #errors #document | | | | | | attributeName="" | attributeType="" | baseFrequency="" | baseProfile="" | calcMode="" | clipPathUnits="" | contentScriptType="" | contentStyleType="" | diffuseConstant="" | edgeMode="" | externalResourcesRequired="" | filterRes="" | filterUnits="" | glyphRef="" | gradientTransform="" | gradientUnits="" | kernelMatrix="" | kernelUnitLength="" | keyPoints="" | keySplines="" | keyTimes="" | lengthAdjust="" | limitingConeAngle="" | markerHeight="" | markerUnits="" | markerWidth="" | maskContentUnits="" | maskUnits="" | numOctaves="" | pathLength="" | patternContentUnits="" | patternTransform="" | patternUnits="" | pointsAtX="" | pointsAtY="" | pointsAtZ="" | preserveAlpha="" | preserveAspectRatio="" | primitiveUnits="" | refX="" | refY="" | repeatCount="" | repeatDur="" | requiredExtensions="" | requiredFeatures="" | specularConstant="" | specularExponent="" | spreadMethod="" | startOffset="" | stdDeviation="" | stitchTiles="" | surfaceScale="" | systemLanguage="" | tableValues="" | targetX="" | targetY="" | textLength="" | viewBox="" | viewTarget="" | xChannelSelector="" | yChannelSelector="" | zoomAndPan="" #data #errors #document | | | | | | attributename="" | attributetype="" | basefrequency="" | baseprofile="" | calcmode="" | clippathunits="" | contentscripttype="" | contentstyletype="" | diffuseconstant="" | edgemode="" | externalresourcesrequired="" | filterres="" | filterunits="" | glyphref="" | gradienttransform="" | gradientunits="" | kernelmatrix="" | kernelunitlength="" | keypoints="" | keysplines="" | keytimes="" | lengthadjust="" | limitingconeangle="" | markerheight="" | markerunits="" | markerwidth="" | maskcontentunits="" | maskunits="" | numoctaves="" | pathlength="" | patterncontentunits="" | patterntransform="" | patternunits="" | pointsatx="" | pointsaty="" | pointsatz="" | preservealpha="" | preserveaspectratio="" | primitiveunits="" | refx="" | refy="" | repeatcount="" | repeatdur="" | requiredextensions="" | requiredfeatures="" | specularconstant="" | specularexponent="" | spreadmethod="" | startoffset="" | stddeviation="" | stitchtiles="" | surfacescale="" | systemlanguage="" | tablevalues="" | targetx="" | targety="" | textlength="" | viewbox="" | viewtarget="" | xchannelselector="" | ychannelselector="" | zoomandpan="" #data #errors #document | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | #data #errors #document | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | #data #errors #document | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | #data #errors #document | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | #data #errors #document | | | | | | �������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������tests14.dat�����������������������������������������������������������������������������������������000664��001750��001750�� 2045�14116150214� 20052� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-fail��������������������������������������������������������������������������������������������������������������������������������#data #errors #document | | | | | #data #errors #document | | | | | | #data #errors 15: Unexpected start tag html #document | | | abc:def="gh" | | | #data #errors 15: Unexpected start tag html #document | | | xml:lang="bar" | | #data #errors #document | | | 123="456" | | #data #errors #document | | | 123="456" | 789="012" | | #data #errors #document | | | | | 789="012" tests16.dat000664001750001750 12525114116150214 20121 0ustar00taitai000000000000HTML-HTML5-Parser-0.992/t/html5lib-fail#data | | | | | | | | | | | | | | | | | | | | | #errors #document | | | | | | | | | | | | | #errors #document | | | | #errors #document | | | | #errors #document | | | | #errors #document | | | | #errors #document | | | | #errors #document | | | | #errors #document | | | | #errors #document | | | | #errors #document | | | | #errors Line: 1 Col: 59 Unexpected end tag (script). #document | | | | #errors #document | | | | | "<body><script><!--...</script></body>" | <body> #data <!doctype html><textarea><!--<textarea></textarea>--></textarea> #errors Line: 1 Col: 64 Unexpected end tag (textarea). #document | <!DOCTYPE html> | <html> | <head> | <body> | <textarea> | "<!--<textarea>" | "-->" #data <!doctype html><textarea>&lt;/textarea></textarea> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <textarea> | "</textarea>" #data <!doctype html><textarea>&lt;</textarea> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <textarea> | "<" #data <!doctype html><textarea>a&lt;b</textarea> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <textarea> | "a<b" #data <!doctype html><iframe><!--<iframe></iframe>--></iframe> #errors Line: 1 Col: 56 Unexpected end tag (iframe). #document | <!DOCTYPE html> | <html> | <head> | <body> | <iframe> | "<!--<iframe>" | "-->" #data <!doctype html><iframe>...<!--X->...<!--/X->...</iframe> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <iframe> | "...<!--X->...<!--/X->..." #data <!doctype html><xmp><!--<xmp></xmp>--></xmp> #errors Line: 1 Col: 44 Unexpected end tag (xmp). #document | <!DOCTYPE html> | <html> | <head> | <body> | <xmp> | "<!--<xmp>" | "-->" #data <!doctype html><noembed><!--<noembed></noembed>--></noembed> #errors Line: 1 Col: 60 Unexpected end tag (noembed). #document | <!DOCTYPE html> | <html> | <head> | <body> | <noembed> | "<!--<noembed>" | "-->" #data <script> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 8 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | <body> #data <script>a #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 9 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "a" | <body> #data <script>< #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 9 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<" | <body> #data <script></ #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 10 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "</" | <body> #data <script></S #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 11 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "</S" | <body> #data <script></SC #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 12 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "</SC" | <body> #data <script></SCR #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 13 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "</SCR" | <body> #data <script></SCRI #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 14 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "</SCRI" | <body> #data <script></SCRIP #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 15 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "</SCRIP" | <body> #data <script></SCRIPT #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 16 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "</SCRIPT" | <body> #data <script></SCRIPT #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 17 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | <body> #data <script></s #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 11 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "</s" | <body> #data <script></sc #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 12 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "</sc" | <body> #data <script></scr #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 13 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "</scr" | <body> #data <script></scri #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 14 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "</scri" | <body> #data <script></scrip #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 15 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "</scrip" | <body> #data <script></script #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 16 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "</script" | <body> #data <script></script #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 17 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | <body> #data <script><! #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 10 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!" | <body> #data <script><!a #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 11 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!a" | <body> #data <script><!- #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 11 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!-" | <body> #data <script><!-a #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 12 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!-a" | <body> #data <script><!-- #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 12 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--" | <body> #data <script><!--a #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 13 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--a" | <body> #data <script><!--< #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 13 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<" | <body> #data <script><!--<a #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 14 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<a" | <body> #data <script><!--</ #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 14 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--</" | <body> #data <script><!--</script #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 20 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--</script" | <body> #data <script><!--</script #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 21 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--" | <body> #data <script><!--<s #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 14 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<s" | <body> #data <script><!--<script #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 19 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script" | <body> #data <script><!--<script #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 20 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script " | <body> #data <script><!--<script < #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 21 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script <" | <body> #data <script><!--<script <a #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 22 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script <a" | <body> #data <script><!--<script </ #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 22 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script </" | <body> #data <script><!--<script </s #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 23 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script </s" | <body> #data <script><!--<script </script #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 28 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script </script" | <body> #data <script><!--<script </scripta #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 29 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script </scripta" | <body> #data <script><!--<script </script #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 29 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script </script " | <body> #data <script><!--<script </script> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 29 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script </script>" | <body> #data <script><!--<script </script/ #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 29 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script </script/" | <body> #data <script><!--<script </script < #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 30 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script </script <" | <body> #data <script><!--<script </script <a #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 31 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script </script <a" | <body> #data <script><!--<script </script </ #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 31 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script </script </" | <body> #data <script><!--<script </script </script #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 38 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script </script </script" | <body> #data <script><!--<script </script </script #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 38 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script </script " | <body> #data <script><!--<script </script </script/ #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 38 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script </script " | <body> #data <script><!--<script </script </script> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. #document | <html> | <head> | <script> | "<!--<script </script " | <body> #data <script><!--<script - #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 21 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script -" | <body> #data <script><!--<script -a #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 22 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script -a" | <body> #data <script><!--<script -- #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 22 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script --" | <body> #data <script><!--<script --a #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 23 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script --a" | <body> #data <script><!--<script --> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 23 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script -->" | <body> #data <script><!--<script -->< #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 24 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script --><" | <body> #data <script><!--<script --></ #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 25 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script --></" | <body> #data <script><!--<script --></script #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 31 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script --></script" | <body> #data <script><!--<script --></script #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 32 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script -->" | <body> #data <script><!--<script --></script/ #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 32 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script -->" | <body> #data <script><!--<script --></script> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. #document | <html> | <head> | <script> | "<!--<script -->" | <body> #data <script><!--<script><\/script>--></script> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. #document | <html> | <head> | <script> | "<!--<script><\/script>-->" | <body> #data <script><!--<script></scr'+'ipt>--></script> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. #document | <html> | <head> | <script> | "<!--<script></scr'+'ipt>-->" | <body> #data <script><!--<script></script><script></script></script> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. #document | <html> | <head> | <script> | "<!--<script></script><script></script>" | <body> #data <script><!--<script></script><script></script>--><!--</script> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. #document | <html> | <head> | <script> | "<!--<script></script><script></script>--><!--" | <body> #data <script><!--<script></script><script></script>-- ></script> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. #document | <html> | <head> | <script> | "<!--<script></script><script></script>-- >" | <body> #data <script><!--<script></script><script></script>- -></script> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. #document | <html> | <head> | <script> | "<!--<script></script><script></script>- ->" | <body> #data <script><!--<script></script><script></script>- - ></script> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. #document | <html> | <head> | <script> | "<!--<script></script><script></script>- - >" | <body> #data <script><!--<script></script><script></script>-></script> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. #document | <html> | <head> | <script> | "<!--<script></script><script></script>->" | <body> #data <script><!--<script>--!></script>X #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 34 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script>--!></script>X" | <body> #data <script><!--<scr'+'ipt></script>--></script> #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 44 Unexpected end tag (script). #document | <html> | <head> | <script> | "<!--<scr'+'ipt>" | <body> | "-->" #data <script><!--<script></scr'+'ipt></script>X #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. Line: 1 Col: 42 Unexpected end of file. Expected end tag (script). #document | <html> | <head> | <script> | "<!--<script></scr'+'ipt></script>X" | <body> #data <style><!--<style></style>--></style> #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. Line: 1 Col: 37 Unexpected end tag (style). #document | <html> | <head> | <style> | "<!--<style>" | <body> | "-->" #data <style><!--</style>X #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. #document | <html> | <head> | <style> | "<!--" | <body> | "X" #data <style><!--...</style>...--></style> #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. Line: 1 Col: 36 Unexpected end tag (style). #document | <html> | <head> | <style> | "<!--..." | <body> | "...-->" #data <style><!--<br><html xmlns:v="urn:schemas-microsoft-com:vml"><!--[if !mso]><style></style>X #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. #document | <html> | <head> | <style> | "<!--<br><html xmlns:v="urn:schemas-microsoft-com:vml"><!--[if !mso]><style>" | <body> | "X" #data <style><!--...<style><!--...--!></style>--></style> #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. Line: 1 Col: 51 Unexpected end tag (style). #document | <html> | <head> | <style> | "<!--...<style><!--...--!>" | <body> | "-->" #data <style><!--...</style><!-- --><style>@import ...</style> #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. #document | <html> | <head> | <style> | "<!--..." | <!-- --> | <style> | "@import ..." | <body> #data <style>...<style><!--...</style><!-- --></style> #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. Line: 1 Col: 48 Unexpected end tag (style). #document | <html> | <head> | <style> | "...<style><!--..." | <!-- --> | <body> #data <style>...<!--[if IE]><style>...</style>X #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. #document | <html> | <head> | <style> | "...<!--[if IE]><style>..." | <body> | "X" #data <title><!--<title></title>--></title> #errors Line: 1 Col: 7 Unexpected start tag (title). Expected DOCTYPE. Line: 1 Col: 37 Unexpected end tag (title). #document | <html> | <head> | <title> | "<!--<title>" | <body> | "-->" #data <title>&lt;/title></title> #errors Line: 1 Col: 7 Unexpected start tag (title). Expected DOCTYPE. #document | <html> | <head> | <title> | "</title>" | <body> #data <title>foo/title><link></head><body>X #errors Line: 1 Col: 7 Unexpected start tag (title). Expected DOCTYPE. Line: 1 Col: 37 Unexpected end of file. Expected end tag (title). #document | <html> | <head> | <title> | "foo/title><link></head><body>X" | <body> #data <noscript><!--<noscript></noscript>--></noscript> #errors Line: 1 Col: 10 Unexpected start tag (noscript). Expected DOCTYPE. Line: 1 Col: 49 Unexpected end tag (noscript). #document | <html> | <head> | <noscript> | "<!--<noscript>" | <body> | "-->" #data <noscript><!--</noscript>X<noscript>--></noscript> #errors Line: 1 Col: 10 Unexpected start tag (noscript). Expected DOCTYPE. #document | <html> | <head> | <noscript> | "<!--" | <body> | "X" | <noscript> | "-->" #data <noscript><iframe></noscript>X #errors Line: 1 Col: 10 Unexpected start tag (noscript). Expected DOCTYPE. #document | <html> | <head> | <noscript> | "<iframe>" | <body> | "X" #data <noframes><!--<noframes>--> #errors Line: 1 Col: 10 Unexpected start tag (noframes). Expected DOCTYPE. Line: 1 Col: 49 Unexpected end tag (noframes). #document | | | | "<!--<noframes>" | <body> | "-->" #data <noframes><body><script><!--...</script></body> #errors Line: 1 Col: 10 Unexpected start tag (noframes). Expected DOCTYPE. #document | | | | "<body><script><!--...</script></body>" | <body> #data <textarea><!--<textarea></textarea>--></textarea> #errors Line: 1 Col: 10 Unexpected start tag (textarea). Expected DOCTYPE. Line: 1 Col: 49 Unexpected end tag (textarea). #document | <html> | <head> | <body> | <textarea> | "<!--<textarea>" | "-->" #data <textarea>&lt;/textarea></textarea> #errors Line: 1 Col: 10 Unexpected start tag (textarea). Expected DOCTYPE. #document | <html> | <head> | <body> | <textarea> | "</textarea>" #data <iframe><!--<iframe></iframe>--></iframe> #errors Line: 1 Col: 8 Unexpected start tag (iframe). Expected DOCTYPE. Line: 1 Col: 41 Unexpected end tag (iframe). #document | <html> | <head> | <body> | <iframe> | "<!--<iframe>" | "-->" #data <iframe>...<!--X->...<!--/X->...</iframe> #errors Line: 1 Col: 8 Unexpected start tag (iframe). Expected DOCTYPE. #document | <html> | <head> | <body> | <iframe> | "...<!--X->...<!--/X->..." #data <xmp><!--<xmp></xmp>--></xmp> #errors Line: 1 Col: 5 Unexpected start tag (xmp). Expected DOCTYPE. Line: 1 Col: 29 Unexpected end tag (xmp). #document | <html> | <head> | <body> | <xmp> | "<!--<xmp>" | "-->" #data <noembed><!--<noembed></noembed>--></noembed> #errors Line: 1 Col: 9 Unexpected start tag (noembed). Expected DOCTYPE. Line: 1 Col: 45 Unexpected end tag (noembed). #document | <html> | <head> | <body> | <noembed> | "<!--<noembed>" | "-->" #data <!doctype html><table> #errors Line 2 Col 0 Unexpected end of file. Expected table content. #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | " " #data <!doctype html><table><td><span><font></span><span> #errors Line 1 Col 26 Unexpected table cell start tag (td) in the table body phase. Line 1 Col 45 Unexpected end tag (span). Line 1 Col 51 Expected closing tag. Unexpected end of file. #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <span> | <font> | <font> | <span> #data <!doctype html><form><table></form><form></table></form> #errors 35: Stray end tag “formâ€. 41: Start tag “form†seen in “tableâ€. #document | <!DOCTYPE html> | <html> | <head> | <body> | <form> | <table> | <form> �������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������tests21.dat�����������������������������������������������������������������������������������������000664��001750��001750�� 4731�14116150214� 20054� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-fail��������������������������������������������������������������������������������������������������������������������������������#data <svg><![CDATA[foo]]> #errors #document | <html> | <head> | <body> | <svg svg> | "foo" #data <math><![CDATA[foo]]> #errors #document | <html> | <head> | <body> | <math math> | "foo" #data <div><![CDATA[foo]]> #errors #document | <html> | <head> | <body> | <div> | <!-- [CDATA[foo]] --> #data <svg><![CDATA[foo #errors #document | <html> | <head> | <body> | <svg svg> | "foo" #data <svg><![CDATA[foo #errors #document | <html> | <head> | <body> | <svg svg> | "foo" #data <svg><![CDATA[ #errors #document | <html> | <head> | <body> | <svg svg> #data <svg><![CDATA[]]> #errors #document | <html> | <head> | <body> | <svg svg> #data <svg><![CDATA[]] >]]> #errors #document | <html> | <head> | <body> | <svg svg> | "]] >" #data <svg><![CDATA[]] >]]> #errors #document | <html> | <head> | <body> | <svg svg> | "]] >" #data <svg><![CDATA[]] #errors #document | <html> | <head> | <body> | <svg svg> | "]]" #data <svg><![CDATA[] #errors #document | <html> | <head> | <body> | <svg svg> | "]" #data <svg><![CDATA[]>a #errors #document | <html> | <head> | <body> | <svg svg> | "]>a" #data <svg><foreignObject><div><![CDATA[foo]]> #errors #document | <html> | <head> | <body> | <svg svg> | <svg foreignObject> | <div> | <!-- [CDATA[foo]] --> #data <svg><![CDATA[<svg>]]> #errors #document | <html> | <head> | <body> | <svg svg> | "<svg>" #data <svg><![CDATA[</svg>a]]> #errors #document | <html> | <head> | <body> | <svg svg> | "</svg>a" #data <svg><![CDATA[<svg>a #errors #document | <html> | <head> | <body> | <svg svg> | "<svg>a" #data <svg><![CDATA[</svg>a #errors #document | <html> | <head> | <body> | <svg svg> | "</svg>a" #data <svg><![CDATA[<svg>]]><path> #errors #document | <html> | <head> | <body> | <svg svg> | "<svg>" | <svg path> #data <svg><![CDATA[<svg>]]></path> #errors #document | <html> | <head> | <body> | <svg svg> | "<svg>" #data <svg><![CDATA[<svg>]]><!--path--> #errors #document | <html> | <head> | <body> | <svg svg> | "<svg>" | <!-- path --> #data <svg><![CDATA[<svg>]]>path #errors #document | <html> | <head> | <body> | <svg svg> | "<svg>path" #data <svg><![CDATA[<!--svg-->]]> #errors #document | <html> | <head> | <body> | <svg svg> | "<!--svg-->" ���������������������������������������tests22.dat�����������������������������������������������������������������������������������������000664��001750��001750�� 6341�14116150214� 20054� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-fail��������������������������������������������������������������������������������������������������������������������������������#data <a><b><big><em><strong><div>X</a> #errors #document | <html> | <head> | <body> | <a> | <b> | <big> | <em> | <strong> | <big> | <em> | <strong> | <div> | <a> | "X" #data <a><b><div id=1><div id=2><div id=3><div id=4><div id=5><div id=6><div id=7><div id=8>A</a> #errors #document | <html> | <head> | <body> | <a> | <b> | <b> | <div> | id="1" | <a> | <div> | id="2" | <a> | <div> | id="3" | <a> | <div> | id="4" | <a> | <div> | id="5" | <a> | <div> | id="6" | <a> | <div> | id="7" | <a> | <div> | id="8" | <a> | "A" #data <a><b><div id=1><div id=2><div id=3><div id=4><div id=5><div id=6><div id=7><div id=8><div id=9>A</a> #errors #document | <html> | <head> | <body> | <a> | <b> | <b> | <div> | id="1" | <a> | <div> | id="2" | <a> | <div> | id="3" | <a> | <div> | id="4" | <a> | <div> | id="5" | <a> | <div> | id="6" | <a> | <div> | id="7" | <a> | <div> | id="8" | <a> | <div> | id="9" | "A" #data <a><b><div id=1><div id=2><div id=3><div id=4><div id=5><div id=6><div id=7><div id=8><div id=9><div id=10>A</a> #errors #document | <html> | <head> | <body> | <a> | <b> | <b> | <div> | id="1" | <a> | <div> | id="2" | <a> | <div> | id="3" | <a> | <div> | id="4" | <a> | <div> | id="5" | <a> | <div> | id="6" | <a> | <div> | id="7" | <a> | <div> | id="8" | <a> | <div> | id="9" | <div> | id="10" | "A" #data <cite><b><cite><i><cite><i><cite><i><div>X</b>TEST #errors Line: 1 Col: 6 Unexpected start tag (cite). Expected DOCTYPE. Line: 1 Col: 46 End tag (b) violates step 1, paragraph 3 of the adoption agency algorithm. Line: 1 Col: 50 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <body> | <cite> | <b> | <cite> | <i> | <cite> | <i> | <cite> | <i> | <i> | <i> | <div> | <b> | "X" | "TEST" �����������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������tests23.dat�����������������������������������������������������������������������������������������000664��001750��001750�� 5762�14116150214� 20063� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-fail��������������������������������������������������������������������������������������������������������������������������������#data <p><font size=4><font color=red><font size=4><font size=4><font size=4><font size=4><font size=4><font color=red><p>X #errors 3: Start tag seen without seeing a doctype first. Expected “<!DOCTYPE html>â€. 116: Unclosed elements. 117: End of file seen and there were open elements. #document | <html> | <head> | <body> | <p> | <font> | size="4" | <font> | color="red" | <font> | size="4" | <font> | size="4" | <font> | size="4" | <font> | size="4" | <font> | size="4" | <font> | color="red" | <p> | <font> | color="red" | <font> | size="4" | <font> | size="4" | <font> | size="4" | <font> | color="red" | "X" #data <p><font size=4><font size=4><font size=4><font size=4><p>X #errors #document | <html> | <head> | <body> | <p> | <font> | size="4" | <font> | size="4" | <font> | size="4" | <font> | size="4" | <p> | <font> | size="4" | <font> | size="4" | <font> | size="4" | "X" #data <p><font size=4><font size=4><font size=4><font size="5"><font size=4><p>X #errors #document | <html> | <head> | <body> | <p> | <font> | size="4" | <font> | size="4" | <font> | size="4" | <font> | size="5" | <font> | size="4" | <p> | <font> | size="4" | <font> | size="4" | <font> | size="5" | <font> | size="4" | "X" #data <p><font size=4 id=a><font size=4 id=b><font size=4><font size=4><p>X #errors #document | <html> | <head> | <body> | <p> | <font> | id="a" | size="4" | <font> | id="b" | size="4" | <font> | size="4" | <font> | size="4" | <p> | <font> | id="a" | size="4" | <font> | id="b" | size="4" | <font> | size="4" | <font> | size="4" | "X" #data <p><b id=a><b id=a><b id=a><b><object><b id=a><b id=a>X</object><p>Y #errors #document | <html> | <head> | <body> | <p> | <b> | id="a" | <b> | id="a" | <b> | id="a" | <b> | <object> | <b> | id="a" | <b> | id="a" | "X" | <p> | <b> | id="a" | <b> | id="a" | <b> | id="a" | <b> | "Y" ��������������tests5.dat������������������������������������������������������������������������������������������000664��001750��001750�� 6131�14116150214� 17772� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-fail��������������������������������������������������������������������������������������������������������������������������������#data <style> <!-- </style>x #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. Line: 1 Col: 22 Unexpected end of file. Expected end tag (style). #document | <html> | <head> | <style> | " <!-- " | <body> | "x" #data <style> <!-- </style> --> </style>x #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. #document | <html> | <head> | <style> | " <!-- " | " " | <body> | "--> x" #data <style> <!--> </style>x #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. #document | <html> | <head> | <style> | " <!--> " | <body> | "x" #data <style> <!---> </style>x #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. #document | <html> | <head> | <style> | " <!---> " | <body> | "x" #data <iframe> <!---> </iframe>x #errors Line: 1 Col: 8 Unexpected start tag (iframe). Expected DOCTYPE. #document | <html> | <head> | <body> | <iframe> | " <!---> " | "x" #data <iframe> <!--- </iframe>->x</iframe> --> </iframe>x #errors Line: 1 Col: 8 Unexpected start tag (iframe). Expected DOCTYPE. #document | <html> | <head> | <body> | <iframe> | " <!--- " | "->x --> x" #data <script> <!-- </script> --> </script>x #errors Line: 1 Col: 8 Unexpected start tag (script). Expected DOCTYPE. #document | <html> | <head> | <script> | " <!-- " | " " | <body> | "--> x" #data <title> <!-- </title> --> </title>x #errors Line: 1 Col: 7 Unexpected start tag (title). Expected DOCTYPE. #document | <html> | <head> | <title> | " <!-- " | " " | <body> | "--> x" #data <textarea> <!--- </textarea>->x</textarea> --> </textarea>x #errors Line: 1 Col: 10 Unexpected start tag (textarea). Expected DOCTYPE. #document | <html> | <head> | <body> | <textarea> | " <!--- " | "->x --> x" #data <style> <!</-- </style>x #errors Line: 1 Col: 7 Unexpected start tag (style). Expected DOCTYPE. #document | <html> | <head> | <style> | " <!</-- " | <body> | "x" #data <p><xmp></xmp> #errors XXX: Unknown #document | <html> | <head> | <body> | <p> | <xmp> #data <xmp> <!-- > --> </xmp> #errors Line: 1 Col: 5 Unexpected start tag (xmp). Expected DOCTYPE. #document | <html> | <head> | <body> | <xmp> | " <!-- > --> " #data <title>&amp;</title> #errors Line: 1 Col: 7 Unexpected start tag (title). Expected DOCTYPE. #document | <html> | <head> | <title> | "&" | <body> #data <title><!--&amp;--></title> #errors Line: 1 Col: 7 Unexpected start tag (title). Expected DOCTYPE. #document | <html> | <head> | <title> | "<!--&-->" | <body> #data <title><!--</title> #errors Line: 1 Col: 7 Unexpected start tag (title). Expected DOCTYPE. Line: 1 Col: 19 Unexpected end of file. Expected end tag (title). #document | <html> | <head> | <title> | "<!--" | <body> #data <noscript><!--</noscript>--></noscript> #errors Line: 1 Col: 10 Unexpected start tag (noscript). Expected DOCTYPE. #document | <html> | <head> | <noscript> | "<!--" | <body> | "-->" ���������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������tests6.dat������������������������������������������������������������������������������������������000664��001750��001750�� 35540�14116150214� 20021� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-fail��������������������������������������������������������������������������������������������������������������������������������#data <!doctype html></head> <head> #errors Line: 1 Col: 29 Unexpected start tag head. Ignored. #document | <!DOCTYPE html> | <html> | <head> | " " | <body> #data <!doctype html><form><div></form><div> #errors 33: End tag "form" seen but there were unclosed elements. 38: End of file seen and there were open elements. #document | <!DOCTYPE html> | <html> | <head> | <body> | <form> | <div> | <div> #data <!doctype html><title>&amp;</title> #errors #document | <!DOCTYPE html> | <html> | <head> | <title> | "&" | <body> #data <!doctype html><title><!--&amp;--></title> #errors #document | <!DOCTYPE html> | <html> | <head> | <title> | "<!--&-->" | <body> #data <!doctype> #errors Line: 1 Col: 9 No space after literal string 'DOCTYPE'. Line: 1 Col: 10 Unexpected > character. Expected DOCTYPE name. Line: 1 Col: 10 Erroneous DOCTYPE. #document | <!DOCTYPE > | <html> | <head> | <body> #data <!---x #errors Line: 1 Col: 6 Unexpected end of file in comment. Line: 1 Col: 6 Unexpected End of file. Expected DOCTYPE. #document | <!-- -x --> | <html> | <head> | <body> #data <body> <div> #errors Line: 1 Col: 6 Unexpected start tag (body). Line: 2 Col: 5 Expected closing tag. Unexpected end of file. #document-fragment div #document | " " | <div> #data <frameset></frameset> foo #errors Line: 1 Col: 10 Unexpected start tag (frameset). Expected DOCTYPE. Line: 2 Col: 3 Unexpected non-space characters in the after frameset phase. Ignored. #document | <html> | <head> | <frameset> | " " #data <frameset></frameset> <noframes> #errors Line: 1 Col: 10 Unexpected start tag (frameset). Expected DOCTYPE. Line: 2 Col: 10 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <frameset> | " " | <noframes> #data <frameset></frameset> <div> #errors Line: 1 Col: 10 Unexpected start tag (frameset). Expected DOCTYPE. Line: 2 Col: 5 Unexpected start tag (div) in the after frameset phase. Ignored. #document | <html> | <head> | <frameset> | " " #data <frameset></frameset> </html> #errors Line: 1 Col: 10 Unexpected start tag (frameset). Expected DOCTYPE. #document | <html> | <head> | <frameset> | " " #data <frameset></frameset> </div> #errors Line: 1 Col: 10 Unexpected start tag (frameset). Expected DOCTYPE. Line: 2 Col: 6 Unexpected end tag (div) in the after frameset phase. Ignored. #document | <html> | <head> | <frameset> | " " #data <form><form> #errors Line: 1 Col: 6 Unexpected start tag (form). Expected DOCTYPE. Line: 1 Col: 12 Unexpected start tag (form). Line: 1 Col: 12 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <body> | <form> #data <button><button> #errors Line: 1 Col: 8 Unexpected start tag (button). Expected DOCTYPE. Line: 1 Col: 16 Unexpected start tag (button) implies end tag (button). Line: 1 Col: 16 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <body> | <button> | <button> #data <table><tr><td></th> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 20 Unexpected end tag (th). Ignored. Line: 1 Col: 20 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> #data <table><caption><td> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 20 Unexpected end tag (td). Ignored. Line: 1 Col: 20 Unexpected table cell start tag (td) in the table body phase. Line: 1 Col: 20 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <body> | <table> | <caption> | <tbody> | <tr> | <td> #data <table><caption><div> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 21 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <body> | <table> | <caption> | <div> #data </caption><div> #errors Line: 1 Col: 10 Unexpected end tag (caption). Ignored. Line: 1 Col: 15 Expected closing tag. Unexpected end of file. #document-fragment caption #document | <div> #data <table><caption><div></caption> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 31 Unexpected end tag (caption). Missing end tag (div). Line: 1 Col: 31 Unexpected end of file. Expected table content. #document | <html> | <head> | <body> | <table> | <caption> | <div> #data <table><caption></table> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 24 Unexpected end table tag in caption. Generates implied end caption. #document | <html> | <head> | <body> | <table> | <caption> #data </table><div> #errors Line: 1 Col: 8 Unexpected end table tag in caption. Generates implied end caption. Line: 1 Col: 8 Unexpected end tag (caption). Ignored. Line: 1 Col: 13 Expected closing tag. Unexpected end of file. #document-fragment caption #document | <div> #data <table><caption></body></col></colgroup></html></tbody></td></tfoot></th></thead></tr> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 23 Unexpected end tag (body). Ignored. Line: 1 Col: 29 Unexpected end tag (col). Ignored. Line: 1 Col: 40 Unexpected end tag (colgroup). Ignored. Line: 1 Col: 47 Unexpected end tag (html). Ignored. Line: 1 Col: 55 Unexpected end tag (tbody). Ignored. Line: 1 Col: 60 Unexpected end tag (td). Ignored. Line: 1 Col: 68 Unexpected end tag (tfoot). Ignored. Line: 1 Col: 73 Unexpected end tag (th). Ignored. Line: 1 Col: 81 Unexpected end tag (thead). Ignored. Line: 1 Col: 86 Unexpected end tag (tr). Ignored. Line: 1 Col: 86 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <body> | <table> | <caption> #data <table><caption><div></div> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 27 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <body> | <table> | <caption> | <div> #data <table><tr><td></body></caption></col></colgroup></html> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 22 Unexpected end tag (body). Ignored. Line: 1 Col: 32 Unexpected end tag (caption). Ignored. Line: 1 Col: 38 Unexpected end tag (col). Ignored. Line: 1 Col: 49 Unexpected end tag (colgroup). Ignored. Line: 1 Col: 56 Unexpected end tag (html). Ignored. Line: 1 Col: 56 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> #data </table></tbody></tfoot></thead></tr><div> #errors Line: 1 Col: 8 Unexpected end tag (table). Ignored. Line: 1 Col: 16 Unexpected end tag (tbody). Ignored. Line: 1 Col: 24 Unexpected end tag (tfoot). Ignored. Line: 1 Col: 32 Unexpected end tag (thead). Ignored. Line: 1 Col: 37 Unexpected end tag (tr). Ignored. Line: 1 Col: 42 Expected closing tag. Unexpected end of file. #document-fragment td #document | <div> #data <table><colgroup>foo #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 20 Unexpected non-space characters in table context caused voodoo mode. Line: 1 Col: 20 Unexpected end of file. Expected table content. #document | <html> | <head> | <body> | "foo" | <table> | <colgroup> #data foo<col> #errors Line: 1 Col: 3 Unexpected end tag (colgroup). Ignored. #document-fragment colgroup #document | <col> #data <table><colgroup></col> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 23 This element (col) has no end tag. Line: 1 Col: 23 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <body> | <table> | <colgroup> #data <frameset><div> #errors Line: 1 Col: 10 Unexpected start tag (frameset). Expected DOCTYPE. Line: 1 Col: 15 Unexpected start tag token (div) in the frameset phase. Ignored. Line: 1 Col: 15 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <frameset> #data </frameset><frame> #errors Line: 1 Col: 11 Unexpected end tag token (frameset) in the frameset phase (innerHTML). #document-fragment frameset #document | <frame> #data <frameset></div> #errors Line: 1 Col: 10 Unexpected start tag (frameset). Expected DOCTYPE. Line: 1 Col: 16 Unexpected end tag token (div) in the frameset phase. Ignored. Line: 1 Col: 16 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <frameset> #data </body><div> #errors Line: 1 Col: 7 Unexpected end tag (body). Ignored. Line: 1 Col: 12 Expected closing tag. Unexpected end of file. #document-fragment body #document | <div> #data <table><tr><div> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 16 Unexpected start tag (div) in table context caused voodoo mode. Line: 1 Col: 16 Unexpected end of file. Expected table content. #document | <html> | <head> | <body> | <div> | <table> | <tbody> | <tr> #data </tr><td> #errors Line: 1 Col: 5 Unexpected end tag (tr). Ignored. #document-fragment tr #document | <td> #data </tbody></tfoot></thead><td> #errors Line: 1 Col: 8 Unexpected end tag (tbody). Ignored. Line: 1 Col: 16 Unexpected end tag (tfoot). Ignored. Line: 1 Col: 24 Unexpected end tag (thead). Ignored. #document-fragment tr #document | <td> #data <table><tr><div><td> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 16 Unexpected start tag (div) in table context caused voodoo mode. Line: 1 Col: 20 Unexpected implied end tag (div) in the table row phase. Line: 1 Col: 20 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <body> | <div> | <table> | <tbody> | <tr> | <td> #data <caption><col><colgroup><tbody><tfoot><thead><tr> #errors Line: 1 Col: 9 Unexpected start tag (caption). Line: 1 Col: 14 Unexpected start tag (col). Line: 1 Col: 24 Unexpected start tag (colgroup). Line: 1 Col: 31 Unexpected start tag (tbody). Line: 1 Col: 38 Unexpected start tag (tfoot). Line: 1 Col: 45 Unexpected start tag (thead). Line: 1 Col: 49 Unexpected end of file. Expected table content. #document-fragment tbody #document | <tr> #data <table><tbody></thead> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 22 Unexpected end tag (thead) in the table body phase. Ignored. Line: 1 Col: 22 Unexpected end of file. Expected table content. #document | <html> | <head> | <body> | <table> | <tbody> #data </table><tr> #errors Line: 1 Col: 8 Unexpected end tag (table). Ignored. Line: 1 Col: 12 Unexpected end of file. Expected table content. #document-fragment tbody #document | <tr> #data <table><tbody></body></caption></col></colgroup></html></td></th></tr> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 21 Unexpected end tag (body) in the table body phase. Ignored. Line: 1 Col: 31 Unexpected end tag (caption) in the table body phase. Ignored. Line: 1 Col: 37 Unexpected end tag (col) in the table body phase. Ignored. Line: 1 Col: 48 Unexpected end tag (colgroup) in the table body phase. Ignored. Line: 1 Col: 55 Unexpected end tag (html) in the table body phase. Ignored. Line: 1 Col: 60 Unexpected end tag (td) in the table body phase. Ignored. Line: 1 Col: 65 Unexpected end tag (th) in the table body phase. Ignored. Line: 1 Col: 70 Unexpected end tag (tr) in the table body phase. Ignored. Line: 1 Col: 70 Unexpected end of file. Expected table content. #document | <html> | <head> | <body> | <table> | <tbody> #data <table><tbody></div> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 20 Unexpected end tag (div) in table context caused voodoo mode. Line: 1 Col: 20 End tag (div) seen too early. Expected other end tag. Line: 1 Col: 20 Unexpected end of file. Expected table content. #document | <html> | <head> | <body> | <table> | <tbody> #data <table><table> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 14 Unexpected start tag (table) implies end tag (table). Line: 1 Col: 14 Unexpected end of file. Expected table content. #document | <html> | <head> | <body> | <table> | <table> #data <table></body></caption></col></colgroup></html></tbody></td></tfoot></th></thead></tr> #errors Line: 1 Col: 7 Unexpected start tag (table). Expected DOCTYPE. Line: 1 Col: 14 Unexpected end tag (body). Ignored. Line: 1 Col: 24 Unexpected end tag (caption). Ignored. Line: 1 Col: 30 Unexpected end tag (col). Ignored. Line: 1 Col: 41 Unexpected end tag (colgroup). Ignored. Line: 1 Col: 48 Unexpected end tag (html). Ignored. Line: 1 Col: 56 Unexpected end tag (tbody). Ignored. Line: 1 Col: 61 Unexpected end tag (td). Ignored. Line: 1 Col: 69 Unexpected end tag (tfoot). Ignored. Line: 1 Col: 74 Unexpected end tag (th). Ignored. Line: 1 Col: 82 Unexpected end tag (thead). Ignored. Line: 1 Col: 87 Unexpected end tag (tr). Ignored. Line: 1 Col: 87 Unexpected end of file. Expected table content. #document | <html> | <head> | <body> | <table> #data </table><tr> #errors Line: 1 Col: 8 Unexpected end tag (table). Ignored. Line: 1 Col: 12 Unexpected end of file. Expected table content. #document-fragment table #document | <tbody> | <tr> #data <body></body></html> #errors Line: 1 Col: 20 Unexpected html end tag in inner html mode. Line: 1 Col: 20 Unexpected EOF in inner html mode. #document-fragment html #document | <head> | <body> #data <html><frameset></frameset></html> #errors Line: 1 Col: 6 Unexpected start tag (html). Expected DOCTYPE. #document | <html> | <head> | <frameset> | " " #data <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01//EN"><html></html> #errors Line: 1 Col: 50 Erroneous DOCTYPE. Line: 1 Col: 63 Unexpected end tag (html) after the (implied) root element. #document | <!DOCTYPE html "-//W3C//DTD HTML 4.01//EN" ""> | <html> | <head> | <body> #data <param><frameset></frameset> #errors Line: 1 Col: 7 Unexpected start tag (param). Expected DOCTYPE. Line: 1 Col: 17 Unexpected start tag (frameset). #document | <html> | <head> | <frameset> #data <source><frameset></frameset> #errors Line: 1 Col: 7 Unexpected start tag (source). Expected DOCTYPE. Line: 1 Col: 17 Unexpected start tag (frameset). #document | <html> | <head> | <frameset> #data <track><frameset></frameset> #errors Line: 1 Col: 7 Unexpected start tag (track). Expected DOCTYPE. Line: 1 Col: 17 Unexpected start tag (frameset). #document | <html> | <head> | <frameset> #data </html><frameset></frameset> #errors 7: End tag seen without seeing a doctype first. Expected “<!DOCTYPE html>â€. 17: Stray “frameset†start tag. 17: “frameset†start tag seen. #document | <html> | <head> | <frameset> #data </body><frameset></frameset> #errors 7: End tag seen without seeing a doctype first. Expected “<!DOCTYPE html>â€. 17: Stray “frameset†start tag. 17: “frameset†start tag seen. #document | <html> | <head> | <frameset> ����������������������������������������������������������������������������������������������������������������������������������������������������������������tests9.dat������������������������������������������������������������������������������������������000664��001750��001750�� 27010�14116150214� 20015� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-fail��������������������������������������������������������������������������������������������������������������������������������#data <!DOCTYPE html><math></math> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> #data <!DOCTYPE html><body><math></math> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> #data <!DOCTYPE html><math><mi> #errors 25: End of file in a foreign namespace context. #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> | <math mi> #data <!DOCTYPE html><math><annotation-xml><svg><u> #errors 45: HTML start tag “u†in a foreign namespace context. 45: End of file seen and there were open elements. #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> | <math annotation-xml> | <svg svg> | <u> #data <!DOCTYPE html><body><select><math></math></select> #errors Line: 1 Col: 35 Unexpected start tag token (math) in the select phase. Ignored. Line: 1 Col: 42 Unexpected end tag (math) in the select phase. Ignored. #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> #data <!DOCTYPE html><body><select><option><math></math></option></select> #errors Line: 1 Col: 43 Unexpected start tag token (math) in the select phase. Ignored. Line: 1 Col: 50 Unexpected end tag (math) in the select phase. Ignored. #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> | <option> #data <!DOCTYPE html><body><table><math></math></table> #errors Line: 1 Col: 34 Unexpected start tag (math) in table context caused voodoo mode. Line: 1 Col: 41 Unexpected end tag (math) in table context caused voodoo mode. #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> | <table> #data <!DOCTYPE html><body><table><math><mi>foo</mi></math></table> #errors Line: 1 Col: 34 Unexpected start tag (math) in table context caused voodoo mode. Line: 1 Col: 46 Unexpected end tag (mi) in table context caused voodoo mode. Line: 1 Col: 53 Unexpected end tag (math) in table context caused voodoo mode. #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> | <math mi> | "foo" | <table> #data <!DOCTYPE html><body><table><math><mi>foo</mi><mi>bar</mi></math></table> #errors Line: 1 Col: 34 Unexpected start tag (math) in table context caused voodoo mode. Line: 1 Col: 46 Unexpected end tag (mi) in table context caused voodoo mode. Line: 1 Col: 58 Unexpected end tag (mi) in table context caused voodoo mode. Line: 1 Col: 65 Unexpected end tag (math) in table context caused voodoo mode. #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> | <math mi> | "foo" | <math mi> | "bar" | <table> #data <!DOCTYPE html><body><table><tbody><math><mi>foo</mi><mi>bar</mi></math></tbody></table> #errors Line: 1 Col: 41 Unexpected start tag (math) in table context caused voodoo mode. Line: 1 Col: 53 Unexpected end tag (mi) in table context caused voodoo mode. Line: 1 Col: 65 Unexpected end tag (mi) in table context caused voodoo mode. Line: 1 Col: 72 Unexpected end tag (math) in table context caused voodoo mode. #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> | <math mi> | "foo" | <math mi> | "bar" | <table> | <tbody> #data <!DOCTYPE html><body><table><tbody><tr><math><mi>foo</mi><mi>bar</mi></math></tr></tbody></table> #errors Line: 1 Col: 45 Unexpected start tag (math) in table context caused voodoo mode. Line: 1 Col: 57 Unexpected end tag (mi) in table context caused voodoo mode. Line: 1 Col: 69 Unexpected end tag (mi) in table context caused voodoo mode. Line: 1 Col: 76 Unexpected end tag (math) in table context caused voodoo mode. #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> | <math mi> | "foo" | <math mi> | "bar" | <table> | <tbody> | <tr> #data <!DOCTYPE html><body><table><tbody><tr><td><math><mi>foo</mi><mi>bar</mi></math></td></tr></tbody></table> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <math math> | <math mi> | "foo" | <math mi> | "bar" #data <!DOCTYPE html><body><table><tbody><tr><td><math><mi>foo</mi><mi>bar</mi></math><p>baz</td></tr></tbody></table> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <math math> | <math mi> | "foo" | <math mi> | "bar" | <p> | "baz" #data <!DOCTYPE html><body><table><caption><math><mi>foo</mi><mi>bar</mi></math><p>baz</caption></table> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <caption> | <math math> | <math mi> | "foo" | <math mi> | "bar" | <p> | "baz" #data <!DOCTYPE html><body><table><caption><math><mi>foo</mi><mi>bar</mi><p>baz</table><p>quux #errors Line: 1 Col: 70 HTML start tag "p" in a foreign namespace context. Line: 1 Col: 81 Unexpected end table tag in caption. Generates implied end caption. #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <caption> | <math math> | <math mi> | "foo" | <math mi> | "bar" | <p> | "baz" | <p> | "quux" #data <!DOCTYPE html><body><table><caption><math><mi>foo</mi><mi>bar</mi>baz</table><p>quux #errors Line: 1 Col: 78 Unexpected end table tag in caption. Generates implied end caption. Line: 1 Col: 78 Unexpected end tag (caption). Missing end tag (math). #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <caption> | <math math> | <math mi> | "foo" | <math mi> | "bar" | "baz" | <p> | "quux" #data <!DOCTYPE html><body><table><colgroup><math><mi>foo</mi><mi>bar</mi><p>baz</table><p>quux #errors Line: 1 Col: 44 Unexpected start tag (math) in table context caused voodoo mode. Line: 1 Col: 56 Unexpected end tag (mi) in table context caused voodoo mode. Line: 1 Col: 68 Unexpected end tag (mi) in table context caused voodoo mode. Line: 1 Col: 71 HTML start tag "p" in a foreign namespace context. Line: 1 Col: 71 Unexpected start tag (p) in table context caused voodoo mode. #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> | <math mi> | "foo" | <math mi> | "bar" | <p> | "baz" | <table> | <colgroup> | <p> | "quux" #data <!DOCTYPE html><body><table><tr><td><select><math><mi>foo</mi><mi>bar</mi><p>baz</table><p>quux #errors Line: 1 Col: 50 Unexpected start tag token (math) in the select phase. Ignored. Line: 1 Col: 54 Unexpected start tag token (mi) in the select phase. Ignored. Line: 1 Col: 62 Unexpected end tag (mi) in the select phase. Ignored. Line: 1 Col: 66 Unexpected start tag token (mi) in the select phase. Ignored. Line: 1 Col: 74 Unexpected end tag (mi) in the select phase. Ignored. Line: 1 Col: 77 Unexpected start tag token (p) in the select phase. Ignored. Line: 1 Col: 88 Unexpected table element end tag (tables) in the select in table phase. #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <select> | "foobarbaz" | <p> | "quux" #data <!DOCTYPE html><body><table><select><math><mi>foo</mi><mi>bar</mi><p>baz</table><p>quux #errors Line: 1 Col: 36 Unexpected start tag (select) in table context caused voodoo mode. Line: 1 Col: 42 Unexpected start tag token (math) in the select phase. Ignored. Line: 1 Col: 46 Unexpected start tag token (mi) in the select phase. Ignored. Line: 1 Col: 54 Unexpected end tag (mi) in the select phase. Ignored. Line: 1 Col: 58 Unexpected start tag token (mi) in the select phase. Ignored. Line: 1 Col: 66 Unexpected end tag (mi) in the select phase. Ignored. Line: 1 Col: 69 Unexpected start tag token (p) in the select phase. Ignored. Line: 1 Col: 80 Unexpected table element end tag (tables) in the select in table phase. #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> | "foobarbaz" | <table> | <p> | "quux" #data <!DOCTYPE html><body></body></html><math><mi>foo</mi><mi>bar</mi><p>baz #errors Line: 1 Col: 41 Unexpected start tag (math). Line: 1 Col: 68 HTML start tag "p" in a foreign namespace context. #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> | <math mi> | "foo" | <math mi> | "bar" | <p> | "baz" #data <!DOCTYPE html><body></body><math><mi>foo</mi><mi>bar</mi><p>baz #errors Line: 1 Col: 34 Unexpected start tag token (math) in the after body phase. Line: 1 Col: 61 HTML start tag "p" in a foreign namespace context. #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> | <math mi> | "foo" | <math mi> | "bar" | <p> | "baz" #data <!DOCTYPE html><frameset><math><mi></mi><mi></mi><p><span> #errors Line: 1 Col: 31 Unexpected start tag token (math) in the frameset phase. Ignored. Line: 1 Col: 35 Unexpected start tag token (mi) in the frameset phase. Ignored. Line: 1 Col: 40 Unexpected end tag token (mi) in the frameset phase. Ignored. Line: 1 Col: 44 Unexpected start tag token (mi) in the frameset phase. Ignored. Line: 1 Col: 49 Unexpected end tag token (mi) in the frameset phase. Ignored. Line: 1 Col: 52 Unexpected start tag token (p) in the frameset phase. Ignored. Line: 1 Col: 58 Unexpected start tag token (span) in the frameset phase. Ignored. Line: 1 Col: 58 Expected closing tag. Unexpected end of file. #document | <!DOCTYPE html> | <html> | <head> | <frameset> #data <!DOCTYPE html><frameset></frameset><math><mi></mi><mi></mi><p><span> #errors Line: 1 Col: 42 Unexpected start tag (math) in the after frameset phase. Ignored. Line: 1 Col: 46 Unexpected start tag (mi) in the after frameset phase. Ignored. Line: 1 Col: 51 Unexpected end tag (mi) in the after frameset phase. Ignored. Line: 1 Col: 55 Unexpected start tag (mi) in the after frameset phase. Ignored. Line: 1 Col: 60 Unexpected end tag (mi) in the after frameset phase. Ignored. Line: 1 Col: 63 Unexpected start tag (p) in the after frameset phase. Ignored. Line: 1 Col: 69 Unexpected start tag (span) in the after frameset phase. Ignored. #document | <!DOCTYPE html> | <html> | <head> | <frameset> #data <!DOCTYPE html><body xlink:href=foo><math xlink:href=foo></math> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | xlink:href="foo" | <math math> | xlink href="foo" #data <!DOCTYPE html><body xlink:href=foo xml:lang=en><math><mi xml:lang=en xlink:href=foo></mi></math> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | xlink:href="foo" | xml:lang="en" | <math math> | <math mi> | xlink href="foo" | xml lang="en" #data <!DOCTYPE html><body xlink:href=foo xml:lang=en><math><mi xml:lang=en xlink:href=foo /></math> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | xlink:href="foo" | xml:lang="en" | <math math> | <math mi> | xlink href="foo" | xml lang="en" #data <!DOCTYPE html><body xlink:href=foo xml:lang=en><math><mi xml:lang=en xlink:href=foo />bar</math> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | xlink:href="foo" | xml:lang="en" | <math math> | <math mi> | xlink href="foo" | xml lang="en" | "bar" ������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������tests_innerHTML_1.dat�������������������������������������������������������������������������������000664��001750��001750�� 15046�14116150214� 22032� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-fail��������������������������������������������������������������������������������������������������������������������������������#data <body><span> #errors #document-fragment body #document | <span> #data <span><body> #errors #document-fragment body #document | <span> #data <span><body> #errors #document-fragment div #document | <span> #data <body><span> #errors #document-fragment html #document | <head> | <body> | <span> #data <frameset><span> #errors #document-fragment body #document | <span> #data <span><frameset> #errors #document-fragment body #document | <span> #data <span><frameset> #errors #document-fragment div #document | <span> #data <frameset><span> #errors #document-fragment html #document | <head> | <frameset> #data <table><tr> #errors #document-fragment table #document | <tbody> | <tr> #data </table><tr> #errors #document-fragment table #document | <tbody> | <tr> #data <a> #errors #document-fragment table #document | <a> #data <a> #errors #document-fragment table #document | <a> #data <a><caption>a #errors #document-fragment table #document | <a> | <caption> | "a" #data <a><colgroup><col> #errors #document-fragment table #document | <a> | <colgroup> | <col> #data <a><tbody><tr> #errors #document-fragment table #document | <a> | <tbody> | <tr> #data <a><tfoot><tr> #errors #document-fragment table #document | <a> | <tfoot> | <tr> #data <a><thead><tr> #errors #document-fragment table #document | <a> | <thead> | <tr> #data <a><tr> #errors #document-fragment table #document | <a> | <tbody> | <tr> #data <a><th> #errors #document-fragment table #document | <a> | <tbody> | <tr> | <th> #data <a><td> #errors #document-fragment table #document | <a> | <tbody> | <tr> | <td> #data <table></table><tbody> #errors #document-fragment caption #document | <table> #data </table><span> #errors #document-fragment caption #document | <span> #data <span></table> #errors #document-fragment caption #document | <span> #data </caption><span> #errors #document-fragment caption #document | <span> #data <span></caption><span> #errors #document-fragment caption #document | <span> | <span> #data <span><caption><span> #errors #document-fragment caption #document | <span> | <span> #data <span><col><span> #errors #document-fragment caption #document | <span> | <span> #data <span><colgroup><span> #errors #document-fragment caption #document | <span> | <span> #data <span><html><span> #errors #document-fragment caption #document | <span> | <span> #data <span><tbody><span> #errors #document-fragment caption #document | <span> | <span> #data <span><td><span> #errors #document-fragment caption #document | <span> | <span> #data <span><tfoot><span> #errors #document-fragment caption #document | <span> | <span> #data <span><thead><span> #errors #document-fragment caption #document | <span> | <span> #data <span><th><span> #errors #document-fragment caption #document | <span> | <span> #data <span><tr><span> #errors #document-fragment caption #document | <span> | <span> #data <span></table><span> #errors #document-fragment caption #document | <span> | <span> #data </colgroup><col> #errors #document-fragment colgroup #document | <col> #data <a><col> #errors #document-fragment colgroup #document | <col> #data <caption><a> #errors #document-fragment tbody #document | <a> #data <col><a> #errors #document-fragment tbody #document | <a> #data <colgroup><a> #errors #document-fragment tbody #document | <a> #data <tbody><a> #errors #document-fragment tbody #document | <a> #data <tfoot><a> #errors #document-fragment tbody #document | <a> #data <thead><a> #errors #document-fragment tbody #document | <a> #data </table><a> #errors #document-fragment tbody #document | <a> #data <a><tr> #errors #document-fragment tbody #document | <a> | <tr> #data <a><td> #errors #document-fragment tbody #document | <a> | <tr> | <td> #data <a><td> #errors #document-fragment tbody #document | <a> | <tr> | <td> #data <a><td> #errors #document-fragment tbody #document | <a> | <tr> | <td> #data <td><table><tbody><a><tr> #errors #document-fragment tbody #document | <tr> | <td> | <a> | <table> | <tbody> | <tr> #data </tr><td> #errors #document-fragment tr #document | <td> #data <td><table><a><tr></tr><tr> #errors #document-fragment tr #document | <td> | <a> | <table> | <tbody> | <tr> | <tr> #data <caption><td> #errors #document-fragment tr #document | <td> #data <col><td> #errors #document-fragment tr #document | <td> #data <colgroup><td> #errors #document-fragment tr #document | <td> #data <tbody><td> #errors #document-fragment tr #document | <td> #data <tfoot><td> #errors #document-fragment tr #document | <td> #data <thead><td> #errors #document-fragment tr #document | <td> #data <tr><td> #errors #document-fragment tr #document | <td> #data </table><td> #errors #document-fragment tr #document | <td> #data <td><table></table><td> #errors #document-fragment tr #document | <td> | <table> | <td> #data <td><table></table><td> #errors #document-fragment tr #document | <td> | <table> | <td> #data <caption><a> #errors #document-fragment td #document | <a> #data <col><a> #errors #document-fragment td #document | <a> #data <colgroup><a> #errors #document-fragment td #document | <a> #data <tbody><a> #errors #document-fragment td #document | <a> #data <tfoot><a> #errors #document-fragment td #document | <a> #data <th><a> #errors #document-fragment td #document | <a> #data <thead><a> #errors #document-fragment td #document | <a> #data <tr><a> #errors #document-fragment td #document | <a> #data </table><a> #errors #document-fragment td #document | <a> #data </tbody><a> #errors #document-fragment td #document | <a> #data </td><a> #errors #document-fragment td #document | <a> #data </tfoot><a> #errors #document-fragment td #document | <a> #data </thead><a> #errors #document-fragment td #document | <a> #data </th><a> #errors #document-fragment td #document | <a> #data </tr><a> #errors #document-fragment td #document | <a> #data <table><td><td> #errors #document-fragment td #document | <table> | <tbody> | <tr> | <td> | <td> #data </select><option> #errors #document-fragment select #document | <option> #data <input><option> #errors #document-fragment select #document | <option> #data <keygen><option> #errors #document-fragment select #document | <option> #data <textarea><option> #errors #document-fragment select #document | <option> #data </html><!--abc--> #errors #document-fragment html #document | <head> | <body> | <!-- abc --> #data </frameset><frame> #errors #document-fragment frameset #document | <frame> #data #errors #document-fragment html #document | <head> | <body> ������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������adoption01.dat��������������������������������������������������������������������������������������000664��001750��001750�� 4277�14116150214� 20565� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <a><p></a></p> #errors #document | <html> | <head> | <body> | <a> | <p> | <a> #data <a>1<p>2</a>3</p> #errors #document | <html> | <head> | <body> | <a> | "1" | <p> | <a> | "2" | "3" #data <a>1<button>2</a>3</button> #errors #document | <html> | <head> | <body> | <a> | "1" | <button> | <a> | "2" | "3" #data <a>1<b>2</a>3</b> #errors #document | <html> | <head> | <body> | <a> | "1" | <b> | "2" | <b> | "3" #data <a>1<div>2<div>3</a>4</div>5</div> #errors #document | <html> | <head> | <body> | <a> | "1" | <div> | <a> | "2" | <div> | <a> | "3" | "4" | "5" #data <table><a>1<p>2</a>3</p> #errors #document | <html> | <head> | <body> | <a> | "1" | <p> | <a> | "2" | "3" | <table> #data <b><b><a><p></a> #errors #document | <html> | <head> | <body> | <b> | <b> | <a> | <p> | <a> #data <b><a><b><p></a> #errors #document | <html> | <head> | <body> | <b> | <a> | <b> | <b> | <p> | <a> #data <a><b><b><p></a> #errors #document | <html> | <head> | <body> | <a> | <b> | <b> | <b> | <b> | <p> | <a> #data <p>1<s id="A">2<b id="B">3</p>4</s>5</b> #errors #document | <html> | <head> | <body> | <p> | "1" | <s> | id="A" | "2" | <b> | id="B" | "3" | <s> | id="A" | <b> | id="B" | "4" | <b> | id="B" | "5" #data <table><a>1<td>2</td>3</table> #errors #document | <html> | <head> | <body> | <a> | "1" | <a> | "3" | <table> | <tbody> | <tr> | <td> | "2" #data <table>A<td>B</td>C</table> #errors #document | <html> | <head> | <body> | "AC" | <table> | <tbody> | <tr> | <td> | "B" #data <a><svg><tr><input></a> #errors #document | <html> | <head> | <body> | <a> | <svg svg> | <svg tr> | <svg input> ���������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������adoption02.dat��������������������������������������������������������������������������������������000664��001750��001750�� 571�14116150214� 20537� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <b>1<i>2<p>3</b>4 #errors #document | <html> | <head> | <body> | <b> | "1" | <i> | "2" | <i> | <p> | <b> | "3" | "4" #data <a><div><style></style><address><a> #errors #document | <html> | <head> | <body> | <a> | <div> | <a> | <style> | <address> | <a> | <a> ���������������������������������������������������������������������������������������������������������������������������������������comments01.dat��������������������������������������������������������������������������������������000664��001750��001750�� 3045�14116150214� 20565� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data FOO<!-- BAR -->BAZ #errors #document | <html> | <head> | <body> | "FOO" | <!-- BAR --> | "BAZ" #data FOO<!-- BAR --!>BAZ #errors #document | <html> | <head> | <body> | "FOO" | <!-- BAR --> | "BAZ" #data FOO<!-- BAR -- >BAZ #errors #document | <html> | <head> | <body> | "FOO" | <!-- BAR -- >BAZ --> #data FOO<!-- BAR -- <QUX> -- MUX -->BAZ #errors #document | <html> | <head> | <body> | "FOO" | <!-- BAR -- <QUX> -- MUX --> | "BAZ" #data FOO<!-- BAR -- <QUX> -- MUX --!>BAZ #errors #document | <html> | <head> | <body> | "FOO" | <!-- BAR -- <QUX> -- MUX --> | "BAZ" #data FOO<!-- BAR -- <QUX> -- MUX -- >BAZ #errors #document | <html> | <head> | <body> | "FOO" | <!-- BAR -- <QUX> -- MUX -- >BAZ --> #data FOO<!---->BAZ #errors #document | <html> | <head> | <body> | "FOO" | <!-- --> | "BAZ" #data FOO<!--->BAZ #errors #document | <html> | <head> | <body> | "FOO" | <!-- --> | "BAZ" #data FOO<!-->BAZ #errors #document | <html> | <head> | <body> | "FOO" | <!-- --> | "BAZ" #data <?xml version="1.0">Hi #errors #document | <!-- ?xml version="1.0" --> | <html> | <head> | <body> | "Hi" #data <?xml version="1.0"> #errors #document | <!-- ?xml version="1.0" --> | <html> | <head> | <body> #data <?xml version #errors #document | <!-- ?xml version --> | <html> | <head> | <body> #data FOO<!----->BAZ #errors #document | <html> | <head> | <body> | "FOO" | <!-- - --> | "BAZ" �������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������doctype01.dat���������������������������������������������������������������������������������������000664��001750��001750�� 13261�14116150214� 20430� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <!DOCTYPE html>Hello #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | "Hello" #data <!dOctYpE HtMl>Hello #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | "Hello" #data <!DOCTYPEhtml>Hello #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE>Hello #errors #document | <!DOCTYPE > | <html> | <head> | <body> | "Hello" #data <!DOCTYPE >Hello #errors #document | <!DOCTYPE > | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato>Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato >Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato taco>Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato taco "ddd>Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato sYstEM>Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato sYstEM >Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato sYstEM ggg>Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato SYSTEM taco >Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato SYSTEM 'taco"'>Hello #errors #document | <!DOCTYPE potato "" "taco""> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato SYSTEM "taco">Hello #errors #document | <!DOCTYPE potato "" "taco"> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato SYSTEM "tai'co">Hello #errors #document | <!DOCTYPE potato "" "tai'co"> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato SYSTEMtaco "ddd">Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato grass SYSTEM taco>Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato pUbLIc>Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato pUbLIc >Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato pUbLIcgoof>Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato PUBLIC goof>Hello #errors #document | <!DOCTYPE potato> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato PUBLIC "go'of">Hello #errors #document | <!DOCTYPE potato "go'of" ""> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato PUBLIC 'go'of'>Hello #errors #document | <!DOCTYPE potato "go" ""> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato PUBLIC 'go:hh of' >Hello #errors #document | <!DOCTYPE potato "go:hh of" ""> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE potato PUBLIC "W3C-//dfdf" SYSTEM ggg>Hello #errors #document | <!DOCTYPE potato "W3C-//dfdf" ""> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">Hello #errors #document | <!DOCTYPE html "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd"> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE ...>Hello #errors #document | <!DOCTYPE ...> | <html> | <head> | <body> | "Hello" #data <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> #errors #document | <!DOCTYPE html "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> | <html> | <head> | <body> #data <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Frameset//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-frameset.dtd"> #errors #document | <!DOCTYPE html "-//W3C//DTD XHTML 1.0 Frameset//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-frameset.dtd"> | <html> | <head> | <body> #data <!DOCTYPE root-element [SYSTEM OR PUBLIC FPI] "uri" [ <!-- internal declarations --> ]> #errors #document | <!DOCTYPE root-element> | <html> | <head> | <body> | "]>" #data <!DOCTYPE html PUBLIC "-//WAPFORUM//DTD XHTML Mobile 1.0//EN" "http://www.wapforum.org/DTD/xhtml-mobile10.dtd"> #errors #document | <!DOCTYPE html "-//WAPFORUM//DTD XHTML Mobile 1.0//EN" "http://www.wapforum.org/DTD/xhtml-mobile10.dtd"> | <html> | <head> | <body> #data <!DOCTYPE HTML SYSTEM "http://www.w3.org/DTD/HTML4-strict.dtd"><body><b>Mine!</b></body> #errors #document | <!DOCTYPE html "" "http://www.w3.org/DTD/HTML4-strict.dtd"> | <html> | <head> | <body> | <b> | "Mine!" #data <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN""http://www.w3.org/TR/html4/strict.dtd"> #errors #document | <!DOCTYPE html "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd"> | <html> | <head> | <body> #data <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN"'http://www.w3.org/TR/html4/strict.dtd'> #errors #document | <!DOCTYPE html "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd"> | <html> | <head> | <body> #data <!DOCTYPE HTML PUBLIC"-//W3C//DTD HTML 4.01//EN"'http://www.w3.org/TR/html4/strict.dtd'> #errors #document | <!DOCTYPE html "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd"> | <html> | <head> | <body> #data <!DOCTYPE HTML PUBLIC'-//W3C//DTD HTML 4.01//EN''http://www.w3.org/TR/html4/strict.dtd'> #errors #document | <!DOCTYPE html "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd"> | <html> | <head> | <body> �����������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������entities01.dat��������������������������������������������������������������������������������������000664��001750��001750�� 13421�14116150214� 20603� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data FOO&gt;BAR #errors #document | <html> | <head> | <body> | "FOO>BAR" #data FOO&gtBAR #errors #document | <html> | <head> | <body> | "FOO>BAR" #data FOO&gt BAR #errors #document | <html> | <head> | <body> | "FOO> BAR" #data FOO&gt;;;BAR #errors #document | <html> | <head> | <body> | "FOO>;;BAR" #data I'm &notit; I tell you #errors #document | <html> | <head> | <body> | "I'm ¬it; I tell you" #data I'm &notin; I tell you #errors #document | <html> | <head> | <body> | "I'm ∉ I tell you" #data FOO& BAR #errors #document | <html> | <head> | <body> | "FOO& BAR" #data FOO&<BAR> #errors #document | <html> | <head> | <body> | "FOO&" | <bar> #data FOO&&&&gt;BAR #errors #document | <html> | <head> | <body> | "FOO&&&>BAR" #data FOO&#41;BAR #errors #document | <html> | <head> | <body> | "FOO)BAR" #data FOO&#x41;BAR #errors #document | <html> | <head> | <body> | "FOOABAR" #data FOO&#X41;BAR #errors #document | <html> | <head> | <body> | "FOOABAR" #data FOO&#BAR #errors #document | <html> | <head> | <body> | "FOO&#BAR" #data FOO&#ZOO #errors #document | <html> | <head> | <body> | "FOO&#ZOO" #data FOO&#xBAR #errors #document | <html> | <head> | <body> | "FOOºR" #data FOO&#xZOO #errors #document | <html> | <head> | <body> | "FOO&#xZOO" #data FOO&#XZOO #errors #document | <html> | <head> | <body> | "FOO&#XZOO" #data FOO&#41BAR #errors #document | <html> | <head> | <body> | "FOO)BAR" #data FOO&#x41BAR #errors #document | <html> | <head> | <body> | "FOO䆺R" #data FOO&#x41ZOO #errors #document | <html> | <head> | <body> | "FOOAZOO" #data FOO&#x0000;ZOO #errors #document | <html> | <head> | <body> | "FOO�ZOO" #data FOO&#x0078;ZOO #errors #document | <html> | <head> | <body> | "FOOxZOO" #data FOO&#x0079;ZOO #errors #document | <html> | <head> | <body> | "FOOyZOO" #data FOO&#x0080;ZOO #errors #document | <html> | <head> | <body> | "FOO€ZOO" #data FOO&#x0081;ZOO #errors #document | <html> | <head> | <body> | "FOOÂZOO" #data FOO&#x0082;ZOO #errors #document | <html> | <head> | <body> | "FOO‚ZOO" #data FOO&#x0083;ZOO #errors #document | <html> | <head> | <body> | "FOOÆ’ZOO" #data FOO&#x0084;ZOO #errors #document | <html> | <head> | <body> | "FOO„ZOO" #data FOO&#x0085;ZOO #errors #document | <html> | <head> | <body> | "FOO…ZOO" #data FOO&#x0086;ZOO #errors #document | <html> | <head> | <body> | "FOO†ZOO" #data FOO&#x0087;ZOO #errors #document | <html> | <head> | <body> | "FOO‡ZOO" #data FOO&#x0088;ZOO #errors #document | <html> | <head> | <body> | "FOOˆZOO" #data FOO&#x0089;ZOO #errors #document | <html> | <head> | <body> | "FOO‰ZOO" #data FOO&#x008A;ZOO #errors #document | <html> | <head> | <body> | "FOOÅ ZOO" #data FOO&#x008B;ZOO #errors #document | <html> | <head> | <body> | "FOO‹ZOO" #data FOO&#x008C;ZOO #errors #document | <html> | <head> | <body> | "FOOÅ’ZOO" #data FOO&#x008D;ZOO #errors #document | <html> | <head> | <body> | "FOOÂZOO" #data FOO&#x008E;ZOO #errors #document | <html> | <head> | <body> | "FOOŽZOO" #data FOO&#x008F;ZOO #errors #document | <html> | <head> | <body> | "FOOÂZOO" #data FOO&#x0090;ZOO #errors #document | <html> | <head> | <body> | "FOOÂZOO" #data FOO&#x0091;ZOO #errors #document | <html> | <head> | <body> | "FOO‘ZOO" #data FOO&#x0092;ZOO #errors #document | <html> | <head> | <body> | "FOO’ZOO" #data FOO&#x0093;ZOO #errors #document | <html> | <head> | <body> | "FOO“ZOO" #data FOO&#x0094;ZOO #errors #document | <html> | <head> | <body> | "FOOâ€ZOO" #data FOO&#x0095;ZOO #errors #document | <html> | <head> | <body> | "FOO•ZOO" #data FOO&#x0096;ZOO #errors #document | <html> | <head> | <body> | "FOO–ZOO" #data FOO&#x0097;ZOO #errors #document | <html> | <head> | <body> | "FOO—ZOO" #data FOO&#x0098;ZOO #errors #document | <html> | <head> | <body> | "FOOËœZOO" #data FOO&#x0099;ZOO #errors #document | <html> | <head> | <body> | "FOOâ„¢ZOO" #data FOO&#x009A;ZOO #errors #document | <html> | <head> | <body> | "FOOÅ¡ZOO" #data FOO&#x009B;ZOO #errors #document | <html> | <head> | <body> | "FOO›ZOO" #data FOO&#x009C;ZOO #errors #document | <html> | <head> | <body> | "FOOÅ“ZOO" #data FOO&#x009D;ZOO #errors #document | <html> | <head> | <body> | "FOOÂZOO" #data FOO&#x009E;ZOO #errors #document | <html> | <head> | <body> | "FOOžZOO" #data FOO&#x009F;ZOO #errors #document | <html> | <head> | <body> | "FOOŸZOO" #data FOO&#x00A0;ZOO #errors #document | <html> | <head> | <body> | "FOO ZOO" #data FOO&#xD7FF;ZOO #errors #document | <html> | <head> | <body> | "FOO퟿ZOO" #data FOO&#xD800;ZOO #errors #document | <html> | <head> | <body> | "FOO�ZOO" #data FOO&#xD801;ZOO #errors #document | <html> | <head> | <body> | "FOO�ZOO" #data FOO&#xDFFE;ZOO #errors #document | <html> | <head> | <body> | "FOO�ZOO" #data FOO&#xDFFF;ZOO #errors #document | <html> | <head> | <body> | "FOO�ZOO" #data FOO&#xE000;ZOO #errors #document | <html> | <head> | <body> | "FOOZOO" #data FOO&#x10FFFE;ZOO #errors #document | <html> | <head> | <body> | "FOOô¿¾ZOO" #data FOO&#x1087D4;ZOO #errors #document | <html> | <head> | <body> | "FOOôˆŸ”ZOO" #data FOO&#x10FFFF;ZOO #errors #document | <html> | <head> | <body> | "FOOô¿¿ZOO" #data FOO&#x110000;ZOO #errors #document | <html> | <head> | <body> | "FOO�ZOO" #data FOO&#xFFFFFF;ZOO #errors #document | <html> | <head> | <body> | "FOO�ZOO" �����������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������html5test-com.dat�����������������������������������������������������������������������������������000664��001750��001750�� 5162�14116150214� 21306� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <div<div> #errors #document | <html> | <head> | <body> | <div<div> #data <div foo<bar=''> #errors #document | <html> | <head> | <body> | <div> | foo<bar="" #data <div foo=`bar`> #errors #document | <html> | <head> | <body> | <div> | foo="`bar`" #data <div \"foo=''> #errors #document | <html> | <head> | <body> | <div> | \"foo="" #data <a href='\nbar'></a> #errors #document | <html> | <head> | <body> | <a> | href="\nbar" #data <!DOCTYPE html> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> #data &lang;&rang; #errors #document | <html> | <head> | <body> | "⟨⟩" #data &apos; #errors #document | <html> | <head> | <body> | "'" #data &ImaginaryI; #errors #document | <html> | <head> | <body> | "â…ˆ" #data &Kopf; #errors #document | <html> | <head> | <body> | "ð•‚" #data &notinva; #errors #document | <html> | <head> | <body> | "∉" #data <?import namespace="foo" implementation="#bar"> #errors #document | <!-- ?import namespace="foo" implementation="#bar" --> | <html> | <head> | <body> #data <!--foo--bar--> #errors #document | <!-- foo--bar --> | <html> | <head> | <body> #data <![CDATA[x]]> #errors #document | <!-- [CDATA[x]] --> | <html> | <head> | <body> #data <textarea><!--</textarea>--></textarea> #errors #document | <html> | <head> | <body> | <textarea> | "<!--" | "-->" #data <textarea><!--</textarea>--> #errors #document | <html> | <head> | <body> | <textarea> | "<!--" | "-->" #data <style><!--</style>--></style> #errors #document | <html> | <head> | <style> | "<!--" | <body> | "-->" #data <style><!--</style>--> #errors #document | <html> | <head> | <style> | "<!--" | <body> | "-->" #data <ul><li>A </li> <li>B</li></ul> #errors #document | <html> | <head> | <body> | <ul> | <li> | "A " | " " | <li> | "B" #data <table><form><input type=hidden><input></form><div></div></table> #errors #document | <html> | <head> | <body> | <input> | <div> | <table> | <form> | <input> | type="hidden" #data <i>A<b>B<p></i>C</b>D #errors #document | <html> | <head> | <body> | <i> | "A" | <b> | "B" | <b> | <p> | <b> | <i> | "C" | "D" #data <div></div> #errors #document | <html> | <head> | <body> | <div> #data <svg></svg> #errors #document | <html> | <head> | <body> | <svg svg> #data <math></math> #errors #document | <html> | <head> | <body> | <math math> ��������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������inbody01.dat����������������������������������������������������������������������������������������000664��001750��001750�� 665�14116150214� 20211� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <button>1</foo> #errors #document | <html> | <head> | <body> | <button> | "1" #data <foo>1<p>2</foo> #errors #document | <html> | <head> | <body> | <foo> | "1" | <p> | "2" #data <dd>1</foo> #errors #document | <html> | <head> | <body> | <dd> | "1" #data <foo>1<dd>2</foo> #errors #document | <html> | <head> | <body> | <foo> | "1" | <dd> | "2" ���������������������������������������������������������������������������isindex.dat�����������������������������������������������������������������������������������������000664��001750��001750�� 1074�14116150214� 20242� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <isindex> #errors #document | <html> | <head> | <body> | <form> | <hr> | <label> | "This is a searchable index. Enter search keywords: " | <input> | name="isindex" | <hr> #data <isindex name="A" action="B" prompt="C" foo="D"> #errors #document | <html> | <head> | <body> | <form> | action="B" | <hr> | <label> | "C" | <input> | foo="D" | name="isindex" | <hr> #data <form><isindex> #errors #document | <html> | <head> | <body> | <form> ��������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������pending-spec-changes-plain-text-unsafe.dat����������������������������������������������������������000664��001750��001750�� 163�14116150214� 26101� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <body><table>�filler�text� #errors #document | <html> | <head> | <body> | "fillertext" | <table> �������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������pending-spec-changes.dat����������������������������������������������������������������������������000664��001750��001750�� 2455�14116150214� 22565� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <input type="hidden"><frameset> #errors 21: Start tag seen without seeing a doctype first. Expected “<!DOCTYPE html>â€. 31: “frameset†start tag seen. 31: End of file seen and there were open elements. #document | <html> | <head> | <frameset> #data <!DOCTYPE html><table><caption><svg>foo</table>bar #errors 47: End tag “table†did not match the name of the current open element (“svgâ€). 47: “table†closed but “caption†was still open. 47: End tag “table†seen, but there were open elements. 36: Unclosed element “svgâ€. #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <caption> | <svg svg> | "foo" | "bar" #data <table><tr><td><svg><desc><td></desc><circle> #errors 7: Start tag seen without seeing a doctype first. Expected “<!DOCTYPE html>â€. 30: A table cell was implicitly closed, but there were open elements. 26: Unclosed element “descâ€. 20: Unclosed element “svgâ€. 37: Stray end tag “descâ€. 45: End of file seen and there were open elements. 45: Unclosed element “circleâ€. 7: Unclosed element “tableâ€. #document | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <svg svg> | <svg desc> | <td> | <circle> �������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������scriptdata01.dat������������������������������������������������������������������������������������000664��001750��001750�� 10301�14116150214� 21107� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data FOO<script>'Hello'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "'Hello'" | "BAR" #data FOO<script></script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "BAR" #data FOO<script></script >BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "BAR" #data FOO<script></script/>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "BAR" #data FOO<script></script/ >BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "BAR" #data FOO<script type="text/plain"></scriptx>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | type="text/plain" | "</scriptx>BAR" #data FOO<script></script foo=">" dd>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "BAR" #data FOO<script>'<'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "'<'" | "BAR" #data FOO<script>'<!'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "'<!'" | "BAR" #data FOO<script>'<!-'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "'<!-'" | "BAR" #data FOO<script>'<!--'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "'<!--'" | "BAR" #data FOO<script>'<!---'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "'<!---'" | "BAR" #data FOO<script>'<!-->'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "'<!-->'" | "BAR" #data FOO<script>'<!-->'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "'<!-->'" | "BAR" #data FOO<script>'<!-- potato'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "'<!-- potato'" | "BAR" #data FOO<script>'<!-- <sCrIpt'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "'<!-- <sCrIpt'" | "BAR" #data FOO<script type="text/plain">'<!-- <sCrIpt>'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | type="text/plain" | "'<!-- <sCrIpt>'</script>BAR" #data FOO<script type="text/plain">'<!-- <sCrIpt> -'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | type="text/plain" | "'<!-- <sCrIpt> -'</script>BAR" #data FOO<script type="text/plain">'<!-- <sCrIpt> --'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | type="text/plain" | "'<!-- <sCrIpt> --'</script>BAR" #data FOO<script>'<!-- <sCrIpt> -->'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | "'<!-- <sCrIpt> -->'" | "BAR" #data FOO<script type="text/plain">'<!-- <sCrIpt> --!>'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | type="text/plain" | "'<!-- <sCrIpt> --!>'</script>BAR" #data FOO<script type="text/plain">'<!-- <sCrIpt> -- >'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | type="text/plain" | "'<!-- <sCrIpt> -- >'</script>BAR" #data FOO<script type="text/plain">'<!-- <sCrIpt '</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | type="text/plain" | "'<!-- <sCrIpt '</script>BAR" #data FOO<script type="text/plain">'<!-- <sCrIpt/'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | type="text/plain" | "'<!-- <sCrIpt/'</script>BAR" #data FOO<script type="text/plain">'<!-- <sCrIpt\'</script>BAR #errors #document | <html> | <head> | <body> | "FOO" | <script> | type="text/plain" | "'<!-- <sCrIpt\'" | "BAR" #data FOO<script type="text/plain">'<!-- <sCrIpt/'</script>BAR</script>QUX #errors #document | <html> | <head> | <body> | "FOO" | <script> | type="text/plain" | "'<!-- <sCrIpt/'</script>BAR" | "QUX" �������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������tables01.dat����������������������������������������������������������������������������������������000664��001750��001750�� 5322�14116150214� 20212� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <table><th> #errors #document | <html> | <head> | <body> | <table> | <tbody> | <tr> | <th> #data <table><td> #errors #document | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> #data <table><col foo='bar'> #errors #document | <html> | <head> | <body> | <table> | <colgroup> | <col> | foo="bar" #data <table><colgroup></html>foo #errors #document | <html> | <head> | <body> | "foo" | <table> | <colgroup> #data <table></table><p>foo #errors #document | <html> | <head> | <body> | <table> | <p> | "foo" #data <table></body></caption></col></colgroup></html></tbody></td></tfoot></th></thead></tr><td> #errors #document | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> #data <table><select><option>3</select></table> #errors #document | <html> | <head> | <body> | <select> | <option> | "3" | <table> #data <table><select><table></table></select></table> #errors #document | <html> | <head> | <body> | <select> | <table> | <table> #data <table><select></table> #errors #document | <html> | <head> | <body> | <select> | <table> #data <table><select><option>A<tr><td>B</td></tr></table> #errors #document | <html> | <head> | <body> | <select> | <option> | "A" | <table> | <tbody> | <tr> | <td> | "B" #data <table><td></body></caption></col></colgroup></html>foo #errors #document | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | "foo" #data <table><td>A</table>B #errors #document | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | "A" | "B" #data <table><tr><caption> #errors #document | <html> | <head> | <body> | <table> | <tbody> | <tr> | <caption> #data <table><tr></body></caption></col></colgroup></html></td></th><td>foo #errors #document | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | "foo" #data <table><td><tr> #errors #document | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <tr> #data <table><td><button><td> #errors #document | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <button> | <td> #data <table><tr><td><svg><desc><td> #errors #document | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <svg svg> | <svg desc> | <td> ��������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������tests10.dat�����������������������������������������������������������������������������������������000664��001750��001750�� 33335�14116150214� 20127� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <!DOCTYPE html><svg></svg> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> #data <!DOCTYPE html><svg></svg><![CDATA[a]]> #errors 29: Bogus comment #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> | <!-- [CDATA[a]] --> #data <!DOCTYPE html><body><svg></svg> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> #data <!DOCTYPE html><body><select><svg></svg></select> #errors 35: Stray “svg†start tag. 42: Stray end tag “svg†#document | <!DOCTYPE html> | <html> | <head> | <body> | <select> #data <!DOCTYPE html><body><select><option><svg></svg></option></select> #errors 43: Stray “svg†start tag. 50: Stray end tag “svg†#document | <!DOCTYPE html> | <html> | <head> | <body> | <select> | <option> #data <!DOCTYPE html><body><table><svg></svg></table> #errors 34: Start tag “svg†seen in “tableâ€. 41: Stray end tag “svgâ€. #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> | <table> #data <!DOCTYPE html><body><table><svg><g>foo</g></svg></table> #errors 34: Start tag “svg†seen in “tableâ€. 46: Stray end tag “gâ€. 53: Stray end tag “svgâ€. #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> | <svg g> | "foo" | <table> #data <!DOCTYPE html><body><table><svg><g>foo</g><g>bar</g></svg></table> #errors 34: Start tag “svg†seen in “tableâ€. 46: Stray end tag “gâ€. 58: Stray end tag “gâ€. 65: Stray end tag “svgâ€. #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> | <svg g> | "foo" | <svg g> | "bar" | <table> #data <!DOCTYPE html><body><table><tbody><svg><g>foo</g><g>bar</g></svg></tbody></table> #errors 41: Start tag “svg†seen in “tableâ€. 53: Stray end tag “gâ€. 65: Stray end tag “gâ€. 72: Stray end tag “svgâ€. #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> | <svg g> | "foo" | <svg g> | "bar" | <table> | <tbody> #data <!DOCTYPE html><body><table><tbody><tr><svg><g>foo</g><g>bar</g></svg></tr></tbody></table> #errors 45: Start tag “svg†seen in “tableâ€. 57: Stray end tag “gâ€. 69: Stray end tag “gâ€. 76: Stray end tag “svgâ€. #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> | <svg g> | "foo" | <svg g> | "bar" | <table> | <tbody> | <tr> #data <!DOCTYPE html><body><table><tbody><tr><td><svg><g>foo</g><g>bar</g></svg></td></tr></tbody></table> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <svg svg> | <svg g> | "foo" | <svg g> | "bar" #data <!DOCTYPE html><body><table><tbody><tr><td><svg><g>foo</g><g>bar</g></svg><p>baz</td></tr></tbody></table> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <svg svg> | <svg g> | "foo" | <svg g> | "bar" | <p> | "baz" #data <!DOCTYPE html><body><table><caption><svg><g>foo</g><g>bar</g></svg><p>baz</caption></table> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <caption> | <svg svg> | <svg g> | "foo" | <svg g> | "bar" | <p> | "baz" #data <!DOCTYPE html><body><table><caption><svg><g>foo</g><g>bar</g><p>baz</table><p>quux #errors 70: HTML start tag “p†in a foreign namespace context. 81: “table†closed but “caption†was still open. #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <caption> | <svg svg> | <svg g> | "foo" | <svg g> | "bar" | <p> | "baz" | <p> | "quux" #data <!DOCTYPE html><body><table><caption><svg><g>foo</g><g>bar</g>baz</table><p>quux #errors 78: “table†closed but “caption†was still open. 78: Unclosed elements on stack. #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <caption> | <svg svg> | <svg g> | "foo" | <svg g> | "bar" | "baz" | <p> | "quux" #data <!DOCTYPE html><body><table><colgroup><svg><g>foo</g><g>bar</g><p>baz</table><p>quux #errors 44: Start tag “svg†seen in “tableâ€. 56: Stray end tag “gâ€. 68: Stray end tag “gâ€. 71: HTML start tag “p†in a foreign namespace context. 71: Start tag “p†seen in “tableâ€. #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> | <svg g> | "foo" | <svg g> | "bar" | <p> | "baz" | <table> | <colgroup> | <p> | "quux" #data <!DOCTYPE html><body><table><tr><td><select><svg><g>foo</g><g>bar</g><p>baz</table><p>quux #errors 50: Stray “svg†start tag. 54: Stray “g†start tag. 62: Stray end tag “g†66: Stray “g†start tag. 74: Stray end tag “g†77: Stray “p†start tag. 88: “table†end tag with “select†open. #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <select> | "foobarbaz" | <p> | "quux" #data <!DOCTYPE html><body><table><select><svg><g>foo</g><g>bar</g><p>baz</table><p>quux #errors 36: Start tag “select†seen in “tableâ€. 42: Stray “svg†start tag. 46: Stray “g†start tag. 54: Stray end tag “g†58: Stray “g†start tag. 66: Stray end tag “g†69: Stray “p†start tag. 80: “table†end tag with “select†open. #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> | "foobarbaz" | <table> | <p> | "quux" #data <!DOCTYPE html><body></body></html><svg><g>foo</g><g>bar</g><p>baz #errors 41: Stray “svg†start tag. 68: HTML start tag “p†in a foreign namespace context. #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> | <svg g> | "foo" | <svg g> | "bar" | <p> | "baz" #data <!DOCTYPE html><body></body><svg><g>foo</g><g>bar</g><p>baz #errors 34: Stray “svg†start tag. 61: HTML start tag “p†in a foreign namespace context. #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> | <svg g> | "foo" | <svg g> | "bar" | <p> | "baz" #data <!DOCTYPE html><frameset><svg><g></g><g></g><p><span> #errors 31: Stray “svg†start tag. 35: Stray “g†start tag. 40: Stray end tag “g†44: Stray “g†start tag. 49: Stray end tag “g†52: Stray “p†start tag. 58: Stray “span†start tag. 58: End of file seen and there were open elements. #document | <!DOCTYPE html> | <html> | <head> | <frameset> #data <!DOCTYPE html><frameset></frameset><svg><g></g><g></g><p><span> #errors 42: Stray “svg†start tag. 46: Stray “g†start tag. 51: Stray end tag “g†55: Stray “g†start tag. 60: Stray end tag “g†63: Stray “p†start tag. 69: Stray “span†start tag. #document | <!DOCTYPE html> | <html> | <head> | <frameset> #data <!DOCTYPE html><body xlink:href=foo><svg xlink:href=foo></svg> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | xlink:href="foo" | <svg svg> | xlink href="foo" #data <!DOCTYPE html><body xlink:href=foo xml:lang=en><svg><g xml:lang=en xlink:href=foo></g></svg> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | xlink:href="foo" | xml:lang="en" | <svg svg> | <svg g> | xlink href="foo" | xml lang="en" #data <!DOCTYPE html><body xlink:href=foo xml:lang=en><svg><g xml:lang=en xlink:href=foo /></svg> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | xlink:href="foo" | xml:lang="en" | <svg svg> | <svg g> | xlink href="foo" | xml lang="en" #data <!DOCTYPE html><body xlink:href=foo xml:lang=en><svg><g xml:lang=en xlink:href=foo />bar</svg> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | xlink:href="foo" | xml:lang="en" | <svg svg> | <svg g> | xlink href="foo" | xml lang="en" | "bar" #data <svg></path> #errors #document | <html> | <head> | <body> | <svg svg> #data <div><svg></div>a #errors #document | <html> | <head> | <body> | <div> | <svg svg> | "a" #data <div><svg><path></div>a #errors #document | <html> | <head> | <body> | <div> | <svg svg> | <svg path> | "a" #data <div><svg><path></svg><path> #errors #document | <html> | <head> | <body> | <div> | <svg svg> | <svg path> | <path> #data <div><svg><path><foreignObject><math></div>a #errors #document | <html> | <head> | <body> | <div> | <svg svg> | <svg path> | <svg foreignObject> | <math math> | "a" #data <div><svg><path><foreignObject><p></div>a #errors #document | <html> | <head> | <body> | <div> | <svg svg> | <svg path> | <svg foreignObject> | <p> | "a" #data <!DOCTYPE html><svg><desc><div><svg><ul>a #errors 40: HTML start tag “ul†in a foreign namespace context. 41: End of file in a foreign namespace context. #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> | <svg desc> | <div> | <svg svg> | <ul> | "a" #data <!DOCTYPE html><svg><desc><svg><ul>a #errors 35: HTML start tag “ul†in a foreign namespace context. 36: End of file in a foreign namespace context. #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> | <svg desc> | <svg svg> | <ul> | "a" #data <!DOCTYPE html><p><svg><desc><p> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | <svg svg> | <svg desc> | <p> #data <!DOCTYPE html><p><svg><title><p> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | <svg svg> | <svg title> | <p> #data <div><svg><path><foreignObject><p></foreignObject><p> #errors #document | <html> | <head> | <body> | <div> | <svg svg> | <svg path> | <svg foreignObject> | <p> | <p> #data <math><mi><div><object><div><span></span></div></object></div></mi><mi> #errors #document | <html> | <head> | <body> | <math math> | <math mi> | <div> | <object> | <div> | <span> | <math mi> #data <math><mi><svg><foreignObject><div><div></div></div></foreignObject></svg></mi><mi> #errors #document | <html> | <head> | <body> | <math math> | <math mi> | <svg svg> | <svg foreignObject> | <div> | <div> | <math mi> #data <svg><script></script><path> #errors #document | <html> | <head> | <body> | <svg svg> | <svg script> | <svg path> #data <table><svg></svg><tr> #errors #document | <html> | <head> | <body> | <svg svg> | <table> | <tbody> | <tr> #data <math><mi><mglyph> #errors #document | <html> | <head> | <body> | <math math> | <math mi> | <math mglyph> #data <math><mi><malignmark> #errors #document | <html> | <head> | <body> | <math math> | <math mi> | <math malignmark> #data <math><mo><mglyph> #errors #document | <html> | <head> | <body> | <math math> | <math mo> | <math mglyph> #data <math><mo><malignmark> #errors #document | <html> | <head> | <body> | <math math> | <math mo> | <math malignmark> #data <math><mn><mglyph> #errors #document | <html> | <head> | <body> | <math math> | <math mn> | <math mglyph> #data <math><mn><malignmark> #errors #document | <html> | <head> | <body> | <math math> | <math mn> | <math malignmark> #data <math><ms><mglyph> #errors #document | <html> | <head> | <body> | <math math> | <math ms> | <math mglyph> #data <math><ms><malignmark> #errors #document | <html> | <head> | <body> | <math math> | <math ms> | <math malignmark> #data <math><mtext><mglyph> #errors #document | <html> | <head> | <body> | <math math> | <math mtext> | <math mglyph> #data <math><mtext><malignmark> #errors #document | <html> | <head> | <body> | <math math> | <math mtext> | <math malignmark> #data <math><annotation-xml><svg></svg></annotation-xml><mi> #errors #document | <html> | <head> | <body> | <math math> | <math annotation-xml> | <svg svg> | <math mi> #data <math><annotation-xml><svg><foreignObject><div><math><mi></mi></math><span></span></div></foreignObject><path></path></svg></annotation-xml><mi> #errors #document | <html> | <head> | <body> | <math math> | <math annotation-xml> | <svg svg> | <svg foreignObject> | <div> | <math math> | <math mi> | <span> | <svg path> | <math mi> #data <math><annotation-xml><svg><foreignObject><math><mi><svg></svg></mi><mo></mo></math><span></span></foreignObject><path></path></svg></annotation-xml><mi> #errors #document | <html> | <head> | <body> | <math math> | <math annotation-xml> | <svg svg> | <svg foreignObject> | <math math> | <math mi> | <svg svg> | <math mo> | <span> | <svg path> | <math mi> ���������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������tests12.dat�����������������������������������������������������������������������������������������000664��001750��001750�� 3114�14116150214� 20101� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <!DOCTYPE html><body><p>foo<math><mtext><i>baz</i></mtext><annotation-xml><svg><desc><b>eggs</b></desc><g><foreignObject><P>spam<TABLE><tr><td><img></td></table></foreignObject></g><g>quux</g></svg></annotation-xml></math>bar #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | "foo" | <math math> | <math mtext> | <i> | "baz" | <math annotation-xml> | <svg svg> | <svg desc> | <b> | "eggs" | <svg g> | <svg foreignObject> | <p> | "spam" | <table> | <tbody> | <tr> | <td> | <img> | <svg g> | "quux" | "bar" #data <!DOCTYPE html><body>foo<math><mtext><i>baz</i></mtext><annotation-xml><svg><desc><b>eggs</b></desc><g><foreignObject><P>spam<TABLE><tr><td><img></td></table></foreignObject></g><g>quux</g></svg></annotation-xml></math>bar #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | "foo" | <math math> | <math mtext> | <i> | "baz" | <math annotation-xml> | <svg svg> | <svg desc> | <b> | "eggs" | <svg g> | <svg foreignObject> | <p> | "spam" | <table> | <tbody> | <tr> | <td> | <img> | <svg g> | "quux" | "bar" ����������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������tests15.dat�����������������������������������������������������������������������������������������000664��001750��001750�� 10057�14116150214� 20130� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <!DOCTYPE html><p><b><i><u></p> <p>X #errors Line: 1 Col: 31 Unexpected end tag (p). Ignored. Line: 1 Col: 36 Expected closing tag. Unexpected end of file. #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | <b> | <i> | <u> | <b> | <i> | <u> | " " | <p> | "X" #data <p><b><i><u></p> <p>X #errors Line: 1 Col: 3 Unexpected start tag (p). Expected DOCTYPE. Line: 1 Col: 16 Unexpected end tag (p). Ignored. Line: 2 Col: 4 Expected closing tag. Unexpected end of file. #document | <html> | <head> | <body> | <p> | <b> | <i> | <u> | <b> | <i> | <u> | " " | <p> | "X" #data <!doctype html></html> <head> #errors Line: 1 Col: 22 Unexpected end tag (html) after the (implied) root element. #document | <!DOCTYPE html> | <html> | <head> | <body> | " " #data <!doctype html></body><meta> #errors Line: 1 Col: 22 Unexpected end tag (body) after the (implied) root element. #document | <!DOCTYPE html> | <html> | <head> | <body> | <meta> #data <html></html><!-- foo --> #errors Line: 1 Col: 6 Unexpected start tag (html). Expected DOCTYPE. Line: 1 Col: 13 Unexpected end tag (html) after the (implied) root element. #document | <html> | <head> | <body> | <!-- foo --> #data <!doctype html></body><title>X</title> #errors Line: 1 Col: 22 Unexpected end tag (body) after the (implied) root element. #document | <!DOCTYPE html> | <html> | <head> | <body> | <title> | "X" #data <!doctype html><table> X<meta></table> #errors Line: 1 Col: 24 Unexpected non-space characters in table context caused voodoo mode. Line: 1 Col: 30 Unexpected start tag (meta) in table context caused voodoo mode. #document | <!DOCTYPE html> | <html> | <head> | <body> | " X" | <meta> | <table> #data <!doctype html><table> x</table> #errors Line: 1 Col: 24 Unexpected non-space characters in table context caused voodoo mode. #document | <!DOCTYPE html> | <html> | <head> | <body> | " x" | <table> #data <!doctype html><table> x </table> #errors Line: 1 Col: 25 Unexpected non-space characters in table context caused voodoo mode. #document | <!DOCTYPE html> | <html> | <head> | <body> | " x " | <table> #data <!doctype html><table><tr> x</table> #errors Line: 1 Col: 28 Unexpected non-space characters in table context caused voodoo mode. #document | <!DOCTYPE html> | <html> | <head> | <body> | " x" | <table> | <tbody> | <tr> #data <!doctype html><table>X<style> <tr>x </style> </table> #errors Line: 1 Col: 23 Unexpected non-space characters in table context caused voodoo mode. #document | <!DOCTYPE html> | <html> | <head> | <body> | "X" | <table> | <style> | " <tr>x " | " " #data <!doctype html><div><table><a>foo</a> <tr><td>bar</td> </tr></table></div> #errors Line: 1 Col: 30 Unexpected start tag (a) in table context caused voodoo mode. Line: 1 Col: 37 Unexpected end tag (a) in table context caused voodoo mode. #document | <!DOCTYPE html> | <html> | <head> | <body> | <div> | <a> | "foo" | <table> | " " | <tbody> | <tr> | <td> | "bar" | " " #data <frame></frame></frame><frameset><frame><frameset><frame></frameset><noframes></frameset><noframes> #errors 6: Start tag seen without seeing a doctype first. Expected “<!DOCTYPE html>â€. 13: Stray start tag “frameâ€. 21: Stray end tag “frameâ€. 29: Stray end tag “frameâ€. 39: “frameset†start tag after “body†already open. 105: End of file seen inside an [R]CDATA element. 105: End of file seen and there were open elements. XXX: These errors are wrong, please fix me! #document | <html> | <head> | <frameset> | <frame> | <frameset> | <frame> | <noframes> | "</frameset><noframes>" #data <!DOCTYPE html><object></html> #errors 1: Expected closing tag. Unexpected end of file #document | <!DOCTYPE html> | <html> | <head> | <body> | <object> ���������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������tests17.dat�����������������������������������������������������������������������������������������000664��001750��001750�� 3721�14116150214� 20112� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <!doctype html><table><tbody><select><tr> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> | <table> | <tbody> | <tr> #data <!doctype html><table><tr><select><td> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> | <table> | <tbody> | <tr> | <td> #data <!doctype html><table><tr><td><select><td> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <select> | <td> #data <!doctype html><table><tr><th><select><td> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | <th> | <select> | <td> #data <!doctype html><table><caption><select><tr> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <caption> | <select> | <tbody> | <tr> #data <!doctype html><select><tr> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> #data <!doctype html><select><td> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> #data <!doctype html><select><th> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> #data <!doctype html><select><tbody> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> #data <!doctype html><select><thead> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> #data <!doctype html><select><tfoot> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> #data <!doctype html><select><caption> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> #data <!doctype html><table><tr></table>a #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | "a" �����������������������������������������������tests18.dat�����������������������������������������������������������������������������������������000664��001750��001750�� 10056�14116150214� 20132� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <!doctype html><plaintext></plaintext> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <plaintext> | "</plaintext>" #data <!doctype html><table><plaintext></plaintext> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <plaintext> | "</plaintext>" | <table> #data <!doctype html><table><tbody><plaintext></plaintext> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <plaintext> | "</plaintext>" | <table> | <tbody> #data <!doctype html><table><tbody><tr><plaintext></plaintext> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <plaintext> | "</plaintext>" | <table> | <tbody> | <tr> #data <!doctype html><table><tbody><tr><plaintext></plaintext> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <plaintext> | "</plaintext>" | <table> | <tbody> | <tr> #data <!doctype html><table><td><plaintext></plaintext> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <plaintext> | "</plaintext>" #data <!doctype html><table><caption><plaintext></plaintext> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <caption> | <plaintext> | "</plaintext>" #data <!doctype html><table><tr><style></script></style>abc #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | "abc" | <table> | <tbody> | <tr> | <style> | "</script>" #data <!doctype html><table><tr><script></style></script>abc #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | "abc" | <table> | <tbody> | <tr> | <script> | "</style>" #data <!doctype html><table><caption><style></script></style>abc #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <caption> | <style> | "</script>" | "abc" #data <!doctype html><table><td><style></script></style>abc #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <style> | "</script>" | "abc" #data <!doctype html><select><script></style></script>abc #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> | <script> | "</style>" | "abc" #data <!doctype html><table><select><script></style></script>abc #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> | <script> | "</style>" | "abc" | <table> #data <!doctype html><table><tr><select><script></style></script>abc #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> | <script> | "</style>" | "abc" | <table> | <tbody> | <tr> #data <!doctype html><frameset></frameset><noframes>abc #errors #document | <!DOCTYPE html> | <html> | <head> | <frameset> | <noframes> | "abc" #data <!doctype html><frameset></frameset><noframes>abc #errors #document | | | | | | "abc" | <!-- abc --> #data <!doctype html><frameset></frameset></html><noframes>abc #errors #document | <!DOCTYPE html> | <html> | <head> | <frameset> | <noframes> | "abc" #data <!doctype html><frameset></frameset></html><noframes>abc #errors #document | | | | | | "abc" | <!-- abc --> #data <!doctype html><table><tr></tbody><tfoot> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | <tfoot> #data <!doctype html><table><td><svg></svg>abc<td> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | <tbody> | <tr> | <td> | <svg svg> | "abc" | <td> ����������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������tests19.dat�����������������������������������������������������������������������������������������000664��001750��001750�� 42027�14116150214� 20136� 0����������������������������������������������������������������������������������������������������ustar�00tai�����������������������������tai�����������������������������000000��000000��HTML-HTML5-Parser-0.992/t/html5lib-pass��������������������������������������������������������������������������������������������������������������������������������#data <!doctype html><math><mn DefinitionUrl="foo"> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> | <math mn> | definitionURL="foo" #data <!doctype html><html></p><!--foo--> #errors #document | <!DOCTYPE html> | <html> | <!-- foo --> | <head> | <body> #data <!doctype html><head></head></p><!--foo--> #errors #document | <!DOCTYPE html> | <html> | <head> | <!-- foo --> | <body> #data <!doctype html><body><p><pre> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | <pre> #data <!doctype html><body><p><listing> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | <listing> #data <!doctype html><p><plaintext> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | <plaintext> #data <!doctype html><p><h1> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | <h1> #data <!doctype html><form><isindex> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <form> #data <!doctype html><isindex action="POST"> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <form> | action="POST" | <hr> | <label> | "This is a searchable index. Enter search keywords: " | <input> | name="isindex" | <hr> #data <!doctype html><isindex prompt="this is isindex"> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <form> | <hr> | <label> | "this is isindex" | <input> | name="isindex" | <hr> #data <!doctype html><isindex type="hidden"> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <form> | <hr> | <label> | "This is a searchable index. Enter search keywords: " | <input> | name="isindex" | type="hidden" | <hr> #data <!doctype html><isindex name="foo"> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <form> | <hr> | <label> | "This is a searchable index. Enter search keywords: " | <input> | name="isindex" | <hr> #data <!doctype html><ruby><p><rp> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <ruby> | <p> | <rp> #data <!doctype html><ruby><div><span><rp> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <ruby> | <div> | <span> | <rp> #data <!doctype html><ruby><div><p><rp> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <ruby> | <div> | <p> | <rp> #data <!doctype html><ruby><p><rt> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <ruby> | <p> | <rt> #data <!doctype html><ruby><div><span><rt> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <ruby> | <div> | <span> | <rt> #data <!doctype html><ruby><div><p><rt> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <ruby> | <div> | <p> | <rt> #data <!doctype html><math/><foo> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <math math> | <foo> #data <!doctype html><svg/><foo> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <svg svg> | <foo> #data <!doctype html><div></body><!--foo--> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <div> | <!-- foo --> #data <!doctype html><h1><div><h3><span></h1>foo #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <h1> | <div> | <h3> | <span> | "foo" #data <!doctype html><p></h3>foo #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | "foo" #data <!doctype html><h3><li>abc</h2>foo #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <h3> | <li> | "abc" | "foo" #data <!doctype html><table>abc<!--foo--> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | "abc" | <table> | <!-- foo --> #data <!doctype html><table> <!--foo--> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <table> | " " | <!-- foo --> #data <!doctype html><table> b <!--foo--> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | " b " | <table> | <!-- foo --> #data <!doctype html><select><option><option> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> | <option> | <option> #data <!doctype html><select><option></optgroup> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> | <option> #data <!doctype html><select><option></optgroup> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <select> | <option> #data <!doctype html><p><math><mi><p><h1> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | <math math> | <math mi> | <p> | <h1> #data <!doctype html><p><math><mo><p><h1> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | <math math> | <math mo> | <p> | <h1> #data <!doctype html><p><math><mn><p><h1> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | <math math> | <math mn> | <p> | <h1> #data <!doctype html><p><math><ms><p><h1> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | <math math> | <math ms> | <p> | <h1> #data <!doctype html><p><math><mtext><p><h1> #errors #document | <!DOCTYPE html> | <html> | <head> | <body> | <p> | <math math> | <math mtext> | <p> | <h1> #data <!doctype html><frameset> #errors #document | | | | #data #errors #document | | | a="b" | c="d" | | #data #errors #document | | | a="b" | c="d" | | #data #errors #document | | | | | #data #errors #document | | | | | " " #data abc #errors #document | | | | #data

#errors #document | | | | #data

#errors #document | | | | #data #errors #document | | | #data #errors #document | | | | #data

#errors #document | | | | | #data

a #errors #document | | | | |

| "a" #data

#errors #document | | | | | #data


#errors
#document
| 
| 
|   
|   
|     

#data

#errors
#document
| 
| 
|   
|   
|     
#data
  • #errors #document | | | | |
  • #data
    #errors #document | | | | |
    #data
    #errors #document | | | | |
    #data
  • #errors #document | | | | |
    #data #errors #document | | | | | #data #errors #document | | | | | #data #errors #document | | | | | #data
    #errors #document | | | | |
    #data #errors #document | | | | | #data #errors #document | | | | | #data #errors #document | | | | | #data #errors #document | | | | | #data #errors #document | | | | | #data
    #errors #document | | | | |
    #data #errors #document | | | | | #errors #document | | | | | #errors #document | | | | | #errors #document | | | | |