[wp-trac] [WordPress Trac] #30130: Normalize characters with combining marks to precomposed characters

WordPress Trac noreply at wordpress.org
Mon Nov 3 11:31:59 UTC 2014


#30130: Normalize characters with combining marks to precomposed characters
-------------------------------------------------+-------------------------
 Reporter:  zodiac1978                           |       Owner:
     Type:  enhancement                          |      Status:  new
 Priority:  normal                               |   Milestone:  Awaiting
Component:  TinyMCE                              |  Review
 Severity:  normal                               |     Version:  trunk
 Keywords:  has-patch dev-feedback needs-        |  Resolution:
  testing                                        |     Focuses:
-------------------------------------------------+-------------------------

Comment (by zodiac1978):

 Replying to [comment:12 azaozz]:
 > - Happens only when copying from a PDF file that is viewed in the
 Preview app on Mac OSX 10.7.5 and 10.9.5 (and possibly all versions in
 between). Works properly in 10.10.

 - Do '''not''' work properly in 10.10 (it is not easy to catch, because in
 Chrome search works and the appearance in the text editor looks right, but
 try to paste it in visual editor or in Firefox, then you can see the
 problem)

 > - Doesn't happen when copying from Acrobat?

 - Acrobat and Adobe Reader also have this problem but just with the first
 character (I don't know why - weird behavior)


 > Perhaps some additional tests:
 > - What if copying from the same PDF file that is viewed in the internal
 viewer in Chrome?

 Problem is not there if pasting from internal PDF-viewer from Firefox or
 from internal PDF viewer Chrome or from Adobe Reader Plugin in Safari.

 > - Does it happen for all PDF files?

 Well, I can't test every PDF file ... ;)
 My test pdf is from LibreOffice 4.2 (PDF Version 1.4)

 > If we decide to fix this, thinking it should probably be fixed from JS
 on one of the events fired in the `paste` TinyMCE plugin. There we can run
 it only on pasting on MacOS, etc.

 I can't help with that (beside of testing), but maybe this is the better
 way to solve this.

 If someone wants to do more tests: If you turn on Permalinks and paste the
 words from the pdf into the title, then WordPress replaces "ü" with "ue",
 "ä" with "ae" and "ö" with "oe" for the permalink. This replacement
 doesn't work if the character isn't precombined, but a vowel followed by a
 diaeresis.

 In the database the post_name should look something like this "fuenf-
 gaebe-schoen-direct-enter" - if you have the wrong characters you see
 something like this: "fu%cc%88nf-ga%cc%88be-scho%cc%88n-firefox".

 %cc%88 is the diaeresis
 (http://www.fileformat.info/info/unicode/char/0308/index.htm)

 Seems to be broken in FF and Chrome. Preview app to Safari seems to be
 okay.

--
Ticket URL: <https://core.trac.wordpress.org/ticket/30130#comment:13>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform


More information about the wp-trac mailing list