博客
关于我
强烈建议你试试无所不能的chatGPT,快点击我
PostgreSQL 收缩膨胀表或索引 - pg_squeeze or pg_repack
阅读量:6713 次
发布时间:2019-06-25

本文共 2599 字,大约阅读时间需要 8 分钟。

PostgreSQL 收缩膨胀表或索引 - pg_squeeze or pg_repack

作者

digoal

日期

2016-10-30

标签

PostgreSQL , pg_repack , pg_reorg , 表膨胀收缩 , 自动回收垃圾 , 自动收缩 , pg_squeeze


背景

PostgreSQL的表或索引发生膨胀后,用户可以使用vacuum full或rewrite table(如cluster)的方式重建表。

但是vacuum full或者rewrite都需要持有排它锁,会堵塞读操作。

为了减少锁冲突,社区有一个名为pg_reorg或pg_repack的插件,使用了增量的方式重组数据,最后通过切换FILENODE完成数据重组。

仅仅在切换FILENODE时需要持有排他锁,非常短暂,影响比VACUUM FULL和rewrite的方式小多了。

但是pg_reorg或pg_repack都需要建触发器,记录下增量重组时,原表产生的增量数据。

因此重组时,触发器会带来一定的开销,对被重组的表,有一定的DML性能影响。

本文将要介绍另一个重组插件,名为pg_squeeze,它使用REDO和logical replication实现增量重组,不需要建立触发器,但是要求表上面有PK或者UK。

pg_squeeze的优点

相比pg_repack或pg_reorg,pg_squeeze不需要建触发器,所以在重组时对原表的DML几乎没有性能影响。

pg_squeeze支持自动的重组,即通过设置阈值、比较用户表与阈值,自动启动WORKER进程,将数据复制到重组表,最后加锁,切换FILENODE。

pg_squeeze 使用注意

由于pg_squeeze需要使用logical replication,所以必须设置足够多的slots,而且必须注意可能与STANDBY争抢SLOTS,必须预留足够的SLOTS。

另外由于pg_squeeze可以自动,也可以不设置自动的收缩。 对于自动的收缩,建议不要对繁忙的数据库开启,以免在高峰期触发,带来一定的性能影响。

参考

pg_squeeze, an open-source PostgreSQL extension from Cybertec, enables automatic and transparent fixing of one of the few weak points of PostgreSQL – bloated tables.

Unlike with built-in commands “VACUUM FULL” or “CLUSTER”, with “pg_squeeze” there are no extended periods of full table locking,

thus reads and writes are not blocked during the rebuild!
Also the rebuilding process is very efficient due to a novel approach of using transaction log files and logical decoding (instead of triggers) to capture possible data changes to the table being rebuild.
This helps to save firstly on disk space and IO throughput and even more importantly enables very short locking-times, making it a perfect fit for mission-critical OLTP systems.

How does pg_squeeze work?

The extension is implemented as a background worker process (a framework introduced in version 9.4)

that periodically monitors user-defined tables and when it detects that a table exceeded the “bloat threshold”,
it kicks in and rebuilds that table automatically! Rebuilding happens concurrently in the background with minimal storage and computational overhead due to use of Postgres’ built-in
replication slots together with logical decoding to extract possible table changes happening during the rebuild from XLOG.
Bloat threshold is of course configurable and bloat ratio calculation is based on the Free Space Map (taking also FILLFACTOR into account) or under certain conditions on the “pgstattuple”
extension when it’s available. Additionally many customization parameters like “minimum table size” can be set,
with non-suitable tables being ignored. Also reordering by an index or moving the table or indexes to new tablespace is possible.

转载地址:http://coolo.baihongyu.com/

你可能感兴趣的文章
“亚健康”网络安全环境是规模性攻击的温床
查看>>
MaxCompute - ODPS重装上阵 第四弹 - CTE,VALUES,SEMIJOIN
查看>>
DirectInput8Create
查看>>
SGI OpenGL Teapot
查看>>
启创卓越智慧园区创新中心落户津宁蓉
查看>>
10月10日云栖精选夜读:阿里云Tech Insight 企业迁云实战专场强势来袭!
查看>>
物联网带动医疗服务升级:六大原则守护网路安全
查看>>
WCF后续之旅(1): WCF是如何通过Binding进行通信的
查看>>
【眼力测试】看图识字
查看>>
设计模式小结
查看>>
快播关闭服务器,你怎么看?
查看>>
免费好用的阿里云云盾证书服务(https证书)申请步骤
查看>>
2017杭州云栖大会100位大咖视频+讲义全分享
查看>>
【云栖大会】持续拥抱开源阿里云计算能力三大突破
查看>>
在linux下制作静态库和动态链接库的方法
查看>>
ZeroMQ试用笔记之REQ & ROUTER
查看>>
PowerDesigner列名、注释内容互换
查看>>
[译] 利用 Immutability(不可变性)编写更为简洁高效的代码
查看>>
云终端推动证券网系统升级
查看>>
Alibaba Cloud Network Attached Storage Now Available
查看>>