[wp-trac] [WordPress Trac] #15197: WXR export/import umbrella ticket

WordPress Trac wp-trac at lists.automattic.com
Fri Oct 29 18:49:39 UTC 2010


#15197: WXR export/import umbrella ticket
--------------------------+-------------------------------------------------
 Reporter:  duck_         |       Owner:  duck_    
     Type:  defect (bug)  |      Status:  assigned 
 Priority:  normal        |   Milestone:  3.1      
Component:  Export        |     Version:           
 Severity:  normal        |    Keywords:  has-patch
--------------------------+-------------------------------------------------

Comment(by lloydbudd):

 Replying to [comment:26 duck_]:
 > Yeah. Looks like the same error that nacin got, I think that if you were
 to go to line 2102605 you would find a Unicode character of some kind, if
 you were to delete that your problems should go away. See
 [http://core.trac.wordpress.org/ticket/15197#comment:17 my comment above
 about the previous error involving ⌘. The fix would be to apply encoding
 to CDATA sections for all new WXR files (if that is the correct thing to
 do, I'm not big on character encoding and XML etc.) and also to have a
 fallback to the previous parser which doesn't care about this kind of
 thing.

 Thanks for the detailed info. I fired up a hex editor, and confirmed that
 is the nature of the issue. A comment has "2102605:41 PCDATA invalid Char
 value 31" a unit separator.

 It seems pretty easy for a comment to invalidate the WXR.  I'm very
 interested in seeing this fixed in the exporter, so we can one day phase
 out the out importer with confidence.

-- 
Ticket URL: <http://core.trac.wordpress.org/ticket/15197#comment:29>
WordPress Trac <http://core.trac.wordpress.org/>
WordPress blogging software


More information about the wp-trac mailing list