On Mon, 2013-02-25 at 23:37 -0500, Anil Jangam wrote:
> Team,
>
>
> I observed that HTML parser (hubbub-0.1.2) is breaking when it finds a
> SEMICOLON in the text field. I am giving below an example of the text
> string.
[...]
> <meta http-equiv="content-type" content="text/html; charset=UTF-8" />
> When it finds the ';', it stops working. When I remove this ';' from
> the string, it works fine. Can you please check, if this is an issue
> with the parser or if I am missing anything?
Can you explain what you mean by "stops working"? The output below is
exactly what I would expect to see, given the input, above.
> ELEMENT meta
> ATTRIBUTE http-equiv
> TEXT
> content=content-type
> ATTRIBUTE content
> TEXT
> content=text/html; charset=UTF-8
J.
No comments:
Post a Comment