乱码乱a∨中文字幕,在线免费激情视频,亚洲欧美久久夜夜潮,国产在线网址

  1. <sub id="hjl7n"></sub>

    1. <sub id="hjl7n"></sub>

      <legend id="hjl7n"></legend>

      當(dāng)前位置:首頁(yè) >  站長(zhǎng) >  數(shù)據(jù)庫(kù) >  正文

      postgresql 中的 like 查詢優(yōu)化方案

       2021-05-24 16:28  來源: 腳本之家   我來投稿 撤稿糾錯(cuò)

        阿里云優(yōu)惠券 先領(lǐng)券再下單

      當(dāng)時(shí)數(shù)量量比較龐大的時(shí)候,做模糊查詢效率很慢,為了優(yōu)化查詢效率,嘗試如下方法做效率對(duì)比。

      一、對(duì)比情況說明:

      1、數(shù)據(jù)量100w條數(shù)據(jù)

      2、執(zhí)行sql

      二、對(duì)比結(jié)果

      explain analyze SELECT
       c_patent,
       c_applyissno,
       d_applyissdate,
       d_applydate,
       c_patenttype_dimn,
       c_newlawstatus,
       c_abstract
      FROM
       public.t_knowl_patent_zlxx_temp
      WHERE
       c_applicant LIKE '%本溪滿族自治縣連山關(guān)鎮(zhèn)安平安養(yǎng)殖場(chǎng)%';

       

      1、未建索時(shí)執(zhí)行計(jì)劃:

      "Gather (cost=1000.00..83803.53 rows=92 width=1278) (actual time=217.264..217.264 rows=0 loops=1)
       Workers Planned: 2
       Workers Launched: 2
       -> Parallel Seq Scan on t_knowl_patent_zlxx (cost=0.00..82794.33 rows=38 width=1278) (actual time=212.355..212.355 rows=0 loops=3)
        Filter: ((c_applicant)::text ~~ '%本溪滿族自治縣連山關(guān)鎮(zhèn)安平安養(yǎng)殖場(chǎng)%'::text)
        Rows Removed by Filter: 333333
      Planning time: 0.272 ms
      Execution time: 228.116 ms"

      2、btree索引

      建索引語(yǔ)句

      1CREATE INDEX idx_public_t_knowl_patent_zlxx_applicant ON public.t_knowl_patent_zlxx(c_applicant varchar_pattern_ops);

      執(zhí)行計(jì)劃

      "Gather (cost=1000.00..83803.53 rows=92 width=1278) (actual time=208.253..208.253 rows=0 loops=1)
       Workers Planned: 2
       Workers Launched: 2
       -> Parallel Seq Scan on t_knowl_patent_zlxx (cost=0.00..82794.33 rows=38 width=1278) (actual time=203.573..203.573 rows=0 loops=3)
        Filter: ((c_applicant)::text ~~ '%本溪滿族自治縣連山關(guān)鎮(zhèn)安平安養(yǎng)殖場(chǎng)%'::text)
        Rows Removed by Filter: 333333
      Planning time: 0.116 ms
      Execution time: 218.189 ms"

      但是如果將查詢sql稍微改動(dòng)一下,把like查詢中的前置%去掉是這樣的

      Index Scan using idx_public_t_knowl_patent_zlxx_applicant on t_knowl_patent_zlxx_temp (cost=0.55..8.57 rows=92 width=1278) (actual time=0.292..0.292 rows=0 loops=1)
       Index Cond: (((c_applicant)::text ~>=~ '本溪滿族自治縣連山關(guān)鎮(zhèn)安平安養(yǎng)殖場(chǎng)'::text) AND ((c_applicant)::text ~<~ '本溪滿族自治縣連山關(guān)鎮(zhèn)安平安養(yǎng)殖圻'::text))
       Filter: ((c_applicant)::text ~~ '本溪滿族自治縣連山關(guān)鎮(zhèn)安平安養(yǎng)殖場(chǎng)%'::text)
      Planning time: 0.710 ms
      Execution time: 0.378 ms

      3、gin索引

      創(chuàng)建索引語(yǔ)句(postgresql要求在9.6版本及以上)

      create extension pg_trgm;CREATE INDEX idx_public_t_knowl_patent_zlxx_applicant ON public.t_knowl_patent_zlxx USING gin (c_applicant gin_trgm_ops);

      執(zhí)行計(jì)劃

      Bitmap Heap Scan on t_knowl_patent_zlxx (cost=244.71..600.42 rows=91 width=1268) (actual time=0.649..0.649 rows=0 loops=1)
       Recheck Cond: ((c_applicant)::text ~~ '%本溪滿族自治縣連山關(guān)鎮(zhèn)安平安養(yǎng)殖場(chǎng)%'::text)
       -> Bitmap Index Scan on idx_public_t_knowl_patent_zlxx_applicant (cost=0.00..244.69 rows=91 width=0) (actual time=0.647..0.647 rows=0 loops=1)
        Index Cond: ((c_applicant)::text ~~ '%本溪滿族自治縣連山關(guān)鎮(zhèn)安平安養(yǎng)殖場(chǎng)%'::text)
      Planning time: 0.673 ms
      Execution time: 0.740 ms

      三、結(jié)論

      btree索引可以讓后置% "abc%"的模糊匹配走索引,gin + gp_trgm可以讓前后置% "%abc%" 走索引。但是gin 索引也有弊端,以下情況可能導(dǎo)致無法命中:

      搜索字段少于3個(gè)字符時(shí),不會(huì)命中索引,這是gin自身機(jī)制導(dǎo)致。

      當(dāng)搜索字段過長(zhǎng)時(shí),比如email檢索,可能也不會(huì)命中索引,造成原因暫時(shí)未知。

      補(bǔ)充:PostgreSQL LIKE 查詢效率提升實(shí)驗(yàn)

      一、未做索引的查詢效率

      作為對(duì)比,先對(duì)未索引的查詢做測(cè)試

      EXPLAIN ANALYZE select * from gallery_map where author = '曹志耘';
                   QUERY PLAN            
      -----------------------------------------------------------------------------------------------------------------
       Seq Scan on gallery_map (cost=0.00..7002.32 rows=1025 width=621) (actual time=0.011..39.753 rows=1031 loops=1)
       Filter: ((author)::text = '曹志耘'::text)
       Rows Removed by Filter: 71315
       Planning time: 0.194 ms
       Execution time: 39.879 ms
      (5 rows)
       
      Time: 40.599 ms
      EXPLAIN ANALYZE select * from gallery_map where author like '曹志耘';
                   QUERY PLAN            
      -----------------------------------------------------------------------------------------------------------------
       Seq Scan on gallery_map (cost=0.00..7002.32 rows=1025 width=621) (actual time=0.017..41.513 rows=1031 loops=1)
       Filter: ((author)::text ~~ '曹志耘'::text)
       Rows Removed by Filter: 71315
       Planning time: 0.188 ms
       Execution time: 41.669 ms
      (5 rows)
       
      Time: 42.457 ms
       
      EXPLAIN ANALYZE select * from gallery_map where author like '曹志耘%';
                   QUERY PLAN            
      -----------------------------------------------------------------------------------------------------------------
       Seq Scan on gallery_map (cost=0.00..7002.32 rows=1028 width=621) (actual time=0.017..41.492 rows=1031 loops=1)
       Filter: ((author)::text ~~ '曹志耘%'::text)
       Rows Removed by Filter: 71315
       Planning time: 0.307 ms
       Execution time: 41.633 ms
      (5 rows)
       
      Time: 42.676 ms

       

      很顯然都會(huì)做全表掃描

      二、創(chuàng)建btree索引

      PostgreSQL默認(rèn)索引是btree

      CREATE INDEX ix_gallery_map_author ON gallery_map (author);
       
      EXPLAIN ANALYZE select * from gallery_map where author = '曹志耘'; 
                      QUERY PLAN               
      -------------------------------------------------------------------------------------------------------------------------------------
       Bitmap Heap Scan on gallery_map (cost=36.36..2715.37 rows=1025 width=621) (actual time=0.457..1.312 rows=1031 loops=1)
       Recheck Cond: ((author)::text = '曹志耘'::text)
       Heap Blocks: exact=438
       -> Bitmap Index Scan on ix_gallery_map_author (cost=0.00..36.10 rows=1025 width=0) (actual time=0.358..0.358 rows=1031 loops=1)
         Index Cond: ((author)::text = '曹志耘'::text)
       Planning time: 0.416 ms
       Execution time: 1.422 ms
      (7 rows)
       
      Time: 2.462 ms
       
      EXPLAIN ANALYZE select * from gallery_map where author like '曹志耘';
                      QUERY PLAN               
      -------------------------------------------------------------------------------------------------------------------------------------
       Bitmap Heap Scan on gallery_map (cost=36.36..2715.37 rows=1025 width=621) (actual time=0.752..2.119 rows=1031 loops=1)
       Filter: ((author)::text ~~ '曹志耘'::text)
       Heap Blocks: exact=438
       -> Bitmap Index Scan on ix_gallery_map_author (cost=0.00..36.10 rows=1025 width=0) (actual time=0.560..0.560 rows=1031 loops=1)
         Index Cond: ((author)::text = '曹志耘'::text)
       Planning time: 0.270 ms
       Execution time: 2.295 ms
      (7 rows)
       
      Time: 3.444 ms
      EXPLAIN ANALYZE select * from gallery_map where author like '曹志耘%';
                   QUERY PLAN            
      -----------------------------------------------------------------------------------------------------------------
       Seq Scan on gallery_map (cost=0.00..7002.32 rows=1028 width=621) (actual time=0.015..41.389 rows=1031 loops=1)
       Filter: ((author)::text ~~ '曹志耘%'::text)
       Rows Removed by Filter: 71315
       Planning time: 0.260 ms
       Execution time: 41.518 ms
      (5 rows)
       
      Time: 42.430 ms
      EXPLAIN ANALYZE select * from gallery_map where author like '%研究室';
                   QUERY PLAN            
      -----------------------------------------------------------------------------------------------------------------
       Seq Scan on gallery_map (cost=0.00..7002.32 rows=2282 width=621) (actual time=0.064..52.824 rows=2152 loops=1)
       Filter: ((author)::text ~~ '%研究室'::text)
       Rows Removed by Filter: 70194
       Planning time: 0.254 ms
       Execution time: 53.064 ms
      (5 rows)
       
      Time: 53.954 ms

       

      可以看到,等于、like的全匹配是用到索引的,like的模糊查詢還是全表掃描

      三、創(chuàng)建gin索引

      CREATE EXTENSION pg_trgm;
       
      CREATE INDEX ix_gallery_map_author ON gallery_map USING gin (author gin_trgm_ops);
      EXPLAIN ANALYZE select * from gallery_map where author like '曹%';
                      QUERY PLAN               
      -------------------------------------------------------------------------------------------------------------------------------------
       Bitmap Heap Scan on gallery_map (cost=19.96..2705.69 rows=1028 width=621) (actual time=0.419..1.771 rows=1031 loops=1)
       Recheck Cond: ((author)::text ~~ '曹%'::text)
       Heap Blocks: exact=438
       -> Bitmap Index Scan on ix_gallery_map_author (cost=0.00..19.71 rows=1028 width=0) (actual time=0.312..0.312 rows=1031 loops=1)
         Index Cond: ((author)::text ~~ '曹%'::text)
       Planning time: 0.358 ms
       Execution time: 1.916 ms
      (7 rows)
       
      Time: 2.843 ms
      EXPLAIN ANALYZE select * from gallery_map where author like '%耘%';
                   QUERY PLAN            
      -----------------------------------------------------------------------------------------------------------------
       Seq Scan on gallery_map (cost=0.00..7002.32 rows=1028 width=621) (actual time=0.015..51.641 rows=1031 loops=1)
       Filter: ((author)::text ~~ '%耘%'::text)
       Rows Removed by Filter: 71315
       Planning time: 0.268 ms
       Execution time: 51.957 ms
      (5 rows)
       
      Time: 52.899 ms
      EXPLAIN ANALYZE select * from gallery_map where author like '%研究室%';
                      QUERY PLAN               
      -------------------------------------------------------------------------------------------------------------------------------------
       Bitmap Heap Scan on gallery_map (cost=31.83..4788.42 rows=2559 width=621) (actual time=0.914..4.195 rows=2402 loops=1)
       Recheck Cond: ((author)::text ~~ '%研究室%'::text)
       Heap Blocks: exact=868
       -> Bitmap Index Scan on ix_gallery_map_author (cost=0.00..31.19 rows=2559 width=0) (actual time=0.694..0.694 rows=2402 loops=1)
         Index Cond: ((author)::text ~~ '%研究室%'::text)
       Planning time: 0.306 ms
       Execution time: 4.403 ms
      (7 rows)
       
      Time: 5.227 ms

      gin_trgm索引的效果好多了

      由于pg_trgm的索引是把字符串切成多個(gè)3元組,然后使用這些3元組做匹配,所以gin_trgm索引對(duì)于少于3個(gè)字符(包括漢字)的查詢,只有前綴匹配會(huì)走索引

      另外,還測(cè)試了btree_gin,效果和btree一樣

      注意:

      gin_trgm要求數(shù)據(jù)庫(kù)必須使用UTF-8編碼

      demo_v1 # \l demo_v1
              List of databases
       Name | Owner | Encoding | Collate | Ctype | Access privileges
      ---------+-----------+----------+-------------+-------------+-------------------
       demo_v1 | wmpp_user | UTF8  | en_US.UTF-8 | en_US.UTF-8 |

      文章來源:腳本之家

      來源地址:https://www.jb51.net/article/204875.htm

      申請(qǐng)創(chuàng)業(yè)報(bào)道,分享創(chuàng)業(yè)好點(diǎn)子。點(diǎn)擊此處,共同探討創(chuàng)業(yè)新機(jī)遇!

      相關(guān)文章

      熱門排行

      信息推薦