[wp-trac] [WordPress Trac] #40759: Word Count Discrepancies

WordPress Trac noreply at wordpress.org
Sun May 14 13:46:51 UTC 2017


#40759: Word Count Discrepancies
--------------------------+------------------------------
 Reporter:  pento         |       Owner:
     Type:  defect (bug)  |      Status:  new
 Priority:  normal        |   Milestone:  Awaiting Review
Component:  Editor        |     Version:
 Severity:  normal        |  Resolution:
 Keywords:                |     Focuses:  administration
--------------------------+------------------------------
Description changed by pento:

Old description:

> I've noticed several discrepancies between how WordPress, Pages, Google
> Docs, and Word count words. Given the following text, all three count
> things quite differently.
>
> {{{
> a 1
> foo-bar
> e.g.
> jack & jill
> 5 @ $4.99
> .
> fuzz at baz.blog
> }}}
>
> || ||= WordPress =||= Word =||= Pages =||= Docs =||
> || Individual Words (a, jack, jill) ||  3  ||  3  ||  3  ||  3  ||
> || Individual Numbers (1, 5) ||  0  ||  2  ||  2  ||  2  ||
> || Hyphenated Words (foo-bar) ||  1  ||  1  ||  2  ||  1  ||
> || Abbreviations (e.g.) ||  1  ||  1  ||  2  ||  2  ||
> || Punctuation that translates to a word (&) ||  0  ||  1  ||  0  ||  0
> ||
> || Punctuation that translates to a word in this usage (@) ||  0  ||  1
> ||  0  ||  0  ||
> || Punctuation that doesn't translate to a word (.) ||  0  ||  1  ||  0
> ||  0  ||
> || Compound number ($4.99) ||  0  ||  1  ||  1  ||  2  ||
> || Email address (`fuzz at baz.blog`) ||  1  ||  1  ||  3  ||  3  ||
>
> I tend to fall in the camp of "what would a reasonable native speaker
> count as a word", which is probably closest to Word's definition, minus
> the punctuation that doesn't translate to a word.

New description:

 I've noticed several discrepancies between how WordPress, Pages, Google
 Docs, and Word count words. Given the following text, all four count
 things quite differently.

 {{{
 a 1
 foo-bar
 e.g.
 jack & jill
 5 @ $4.99
 .
 fuzz at baz.blog
 }}}

 || ||= WordPress =||= Word =||= Pages =||= Docs =||
 || Individual Words (a, jack, jill) ||  3  ||  3  ||  3  ||  3  ||
 || Individual Numbers (1, 5) ||  0  ||  2  ||  2  ||  2  ||
 || Hyphenated Words (foo-bar) ||  1  ||  1  ||  2  ||  1  ||
 || Abbreviations (e.g.) ||  1  ||  1  ||  2  ||  2  ||
 || Punctuation that translates to a word (&) ||  0  ||  1  ||  0  ||  0
 ||
 || Punctuation that translates to a word in this usage (@) ||  0  ||  1
 ||  0  ||  0  ||
 || Punctuation that doesn't translate to a word (.) ||  0  ||  1  ||  0
 ||  0  ||
 || Compound number ($4.99) ||  0  ||  1  ||  1  ||  2  ||
 || Email address (`fuzz at baz.blog`) ||  1  ||  1  ||  3  ||  3  ||

 I tend to fall in the camp of "what would a reasonable native speaker
 count as a word", which is probably closest to Word's definition, minus
 the punctuation that doesn't translate to a word.

--

--
Ticket URL: <https://core.trac.wordpress.org/ticket/40759#comment:3>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform


More information about the wp-trac mailing list