oga/lib/oga
Yorick Peterse 13e2c3d82f Better handling of incorrect XML/HTML tags
The XML/HTML lexer is now capable of processing most invalid XML/HTML
(that I can think of at least). This is achieved by inserting missing
closing tags (where needed) and/or ignoring excessive closing tags. For
example, HTML such as this:

    <a></a></p>

Results in the following tokens:

    [:T_ELEM_START, nil, 1]
    [:T_ELEM_NAME, 'a', 1]
    [:T_ELEM_CLOSE, nil, 1]

In turn this HTML:

    <a>

Results in these tokens:

    [:T_ELEM_START, nil, 1]
    [:T_ELEM_NAME, 'a', 1]
    [:T_ELEM_CLOSE, nil, 1]

Fixes #84
2015-04-19 23:19:02 +02:00
..
css Added CSS/XPath Parser.parse_with_cache 2015-03-23 00:22:59 +01:00
html Lazy decoding of XML/HTML entities. 2015-03-05 23:00:43 +01:00
xml Better handling of incorrect XML/HTML tags 2015-04-19 23:19:02 +02:00
xpath XPath queries match nodes in the default namespace 2015-03-26 01:13:55 +01:00
entity_decoder.rb Added Oga::EntityDecoder 2015-04-07 21:18:15 +02:00
lru.rb Added LRU#maximum=/maximum 2015-03-23 00:26:48 +01:00
oga.rb Corrected YARD argument name. 2014-09-16 14:50:17 +02:00
version.rb Release 0.3.4 2015-04-19 22:19:02 +02:00