[wp-trac] [WordPress Trac] #40759: Word Count Discrepancies
WordPress Trac
noreply at wordpress.org
Sun May 14 13:46:51 UTC 2017
#40759: Word Count Discrepancies
--------------------------+------------------------------
Reporter: pento | Owner:
Type: defect (bug) | Status: new
Priority: normal | Milestone: Awaiting Review
Component: Editor | Version:
Severity: normal | Resolution:
Keywords: | Focuses: administration
--------------------------+------------------------------
Description changed by pento:
Old description:
> I've noticed several discrepancies between how WordPress, Pages, Google
> Docs, and Word count words. Given the following text, all three count
> things quite differently.
>
> {{{
> a 1
> foo-bar
> e.g.
> jack & jill
> 5 @ $4.99
> .
> fuzz at baz.blog
> }}}
>
> || ||= WordPress =||= Word =||= Pages =||= Docs =||
> || Individual Words (a, jack, jill) || 3 || 3 || 3 || 3 ||
> || Individual Numbers (1, 5) || 0 || 2 || 2 || 2 ||
> || Hyphenated Words (foo-bar) || 1 || 1 || 2 || 1 ||
> || Abbreviations (e.g.) || 1 || 1 || 2 || 2 ||
> || Punctuation that translates to a word (&) || 0 || 1 || 0 || 0
> ||
> || Punctuation that translates to a word in this usage (@) || 0 || 1
> || 0 || 0 ||
> || Punctuation that doesn't translate to a word (.) || 0 || 1 || 0
> || 0 ||
> || Compound number ($4.99) || 0 || 1 || 1 || 2 ||
> || Email address (`fuzz at baz.blog`) || 1 || 1 || 3 || 3 ||
>
> I tend to fall in the camp of "what would a reasonable native speaker
> count as a word", which is probably closest to Word's definition, minus
> the punctuation that doesn't translate to a word.
New description:
I've noticed several discrepancies between how WordPress, Pages, Google
Docs, and Word count words. Given the following text, all four count
things quite differently.
{{{
a 1
foo-bar
e.g.
jack & jill
5 @ $4.99
.
fuzz at baz.blog
}}}
|| ||= WordPress =||= Word =||= Pages =||= Docs =||
|| Individual Words (a, jack, jill) || 3 || 3 || 3 || 3 ||
|| Individual Numbers (1, 5) || 0 || 2 || 2 || 2 ||
|| Hyphenated Words (foo-bar) || 1 || 1 || 2 || 1 ||
|| Abbreviations (e.g.) || 1 || 1 || 2 || 2 ||
|| Punctuation that translates to a word (&) || 0 || 1 || 0 || 0
||
|| Punctuation that translates to a word in this usage (@) || 0 || 1
|| 0 || 0 ||
|| Punctuation that doesn't translate to a word (.) || 0 || 1 || 0
|| 0 ||
|| Compound number ($4.99) || 0 || 1 || 1 || 2 ||
|| Email address (`fuzz at baz.blog`) || 1 || 1 || 3 || 3 ||
I tend to fall in the camp of "what would a reasonable native speaker
count as a word", which is probably closest to Word's definition, minus
the punctuation that doesn't translate to a word.
--
--
Ticket URL: <https://core.trac.wordpress.org/ticket/40759#comment:3>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform
More information about the wp-trac
mailing list