On Thu, Apr 09, 2015 at 05:20:52PM +0200, David Feugey wrote:
> When I made a page with accents, all is OK with Unicode.
> For example "élément"
>
> But if I use HTML codes, (éléments), NetSurf considers that
> there are 3 words "é"+"lé"+"memts". A cut after each special characters.
>
> And so carriage return is sometimes applied at the wrong place.
> Will this bug be corrected?
>
> Bye, David
This is in the tracker already as bugs #467 [1], #408 [2] and #476 [3]
It is caused because our text reflow and word breaking algorithm does
not meet the standard and breaks words where it really ought not to.
For an explanation of the complexity involved in reflowing text
efficiently on constrained system computerphile did a couple of
excellent videos. [4][5] While these specifically talk about e-readers,
a browser faces similar challenges (and more)
Fixing this is earmarked for our 4.0 series and needs a re-written
render engine. The new engine is a job comparable in size to the
entire project to date and has not yet been started.
[1] http://bugs.netsurf-browser.org/mantis/view.php?id=467
[2] http://bugs.netsurf-browser.org/mantis/view.php?id=408
[3] http://bugs.netsurf-browser.org/mantis/view.php?id=476
[4] https://www.youtube.com/watch?v=kzdugwr4Fgk
[5] https://www.youtube.com/watch?v=CdbvgRqyC-0
--
Regards Vincent
http://www.kyllikki.org/
No comments:
Post a Comment