[wp-trac] [WordPress Trac] #51769: Full-text search support

WordPress Trac noreply at wordpress.org
Wed Aug 3 10:51:37 UTC 2022


#51769: Full-text search support
--------------------------------------+------------------------------
 Reporter:  zieladam                  |       Owner:  (none)
     Type:  enhancement               |      Status:  new
 Priority:  normal                    |   Milestone:  Awaiting Review
Component:  Database                  |     Version:
 Severity:  normal                    |  Resolution:
 Keywords:  dev-feedback 2nd-opinion  |     Focuses:
--------------------------------------+------------------------------

Comment (by zieladam):

 Good notes @azaozz! Here's some more thoughts:


 > 1. Speed consideration for when saving to the DB. If there's a fulltext
 index on post_content, it will have to be updated every time a new post is
 saved or a post is updated. Normally that should be quite fast, but might
 become problematic for very big sites with millions of rows in wp_posts
 where the index will be several MB (saving a post is quite slow afaik).
 Perhaps worth a look/test.

 The trade-off here is between the speed of writes and reads.

 [https://makandracards.com/makandra/12813-performance-analysis-of-mysql-s
 -fulltext-indexes-and-like-queries-for-full-text-search: MATCH is much
 faster than LIKE] so the searching would be faster.

 I haven't found any INSERT benchmarks for the fulltext feature though, I
 found a few complaints about slow writes when using fulltext index and
 some discussion how it's not suitable for larger datasets because of slow
 INSERTs, but not much charts, tables, and reliable data.

 > 2. Disk space: how much bigger the DB will be with a fulltext index on
 post_content?

 As for the disk space: I did a few quick searches and found that the index
 could take somewhere between 10% and 80% of the size of the text. That's
 all anecdotal as I haven't done any measurements myself. While that's a
 pretty large range, it already tells us that the size of the index is an
 important factor here.

-- 
Ticket URL: <https://core.trac.wordpress.org/ticket/51769#comment:14>
WordPress Trac <https://core.trac.wordpress.org/>
WordPress publishing platform


More information about the wp-trac mailing list