[wp-trac] [WordPress Trac] #30130: Normalize characters with combining marks to precomposed characters
WordPress Trac
noreply at wordpress.org
Mon Nov 3 11:31:59 UTC 2014
#30130: Normalize characters with combining marks to precomposed characters
-------------------------------------------------+-------------------------
Reporter: zodiac1978 | Owner:
Type: enhancement | Status: new
Priority: normal | Milestone: Awaiting
Component: TinyMCE | Review
Severity: normal | Version: trunk
Keywords: has-patch dev-feedback needs- | Resolution:
testing | Focuses:
-------------------------------------------------+-------------------------
Comment (by zodiac1978):
Replying to [comment:12 azaozz]:
> - Happens only when copying from a PDF file that is viewed in the
Preview app on Mac OSX 10.7.5 and 10.9.5 (and possibly all versions in
between). Works properly in 10.10.
- Do '''not''' work properly in 10.10 (it is not easy to catch, because in
Chrome search works and the appearance in the text editor looks right, but
try to paste it in visual editor or in Firefox, then you can see the
problem)
> - Doesn't happen when copying from Acrobat?
- Acrobat and Adobe Reader also have this problem but just with the first
character (I don't know why - weird behavior)
> Perhaps some additional tests:
> - What if copying from the same PDF file that is viewed in the internal
viewer in Chrome?
Problem is not there if pasting from internal PDF-viewer from Firefox or
from internal PDF viewer Chrome or from Adobe Reader Plugin in Safari.
> - Does it happen for all PDF files?
Well, I can't test every PDF file ... ;)
My test pdf is from LibreOffice 4.2 (PDF Version 1.4)
> If we decide to fix this, thinking it should probably be fixed from JS
on one of the events fired in the `paste` TinyMCE plugin. There we can run
it only on pasting on MacOS, etc.
I can't help with that (beside of testing), but maybe this is the better
way to solve this.
If someone wants to do more tests: If you turn on Permalinks and paste the
words from the pdf into the title, then WordPress replaces "ü" with "ue",
"ä" with "ae" and "ö" with "oe" for the permalink. This replacement
doesn't work if the character isn't precombined, but a vowel followed by a
diaeresis.
In the database the post_name should look something like this "fuenf-
gaebe-schoen-direct-enter" - if you have the wrong characters you see
something like this: "fu%cc%88nf-ga%cc%88be-scho%cc%88n-firefox".
%cc%88 is the diaeresis
(http://www.fileformat.info/info/unicode/char/0308/index.htm)
Seems to be broken in FF and Chrome. Preview app to Safari seems to be
okay.
--
Ticket URL: <https://core.trac.wordpress.org/ticket/30130#comment:13>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform
More information about the wp-trac
mailing list