This extension provides a means to collect metrics within a PostgreSQL database to be used by an external collection source (Prometheus exporter, Icinga/Nagios scraper, etc). Certain metrics are collected, their results stored as materialized views or tables, and refreshed on a per-query configurable timer. This allows the metric scraper to not have to be concerned about the underlying runtime of queries that can be more expensive, especially as the size of the database grows (object size, table statistics, etc). It also allows a functional interface for some metrics to account for differences in underlying system catalogs between PostgreSQL major versions (Ex. new columns in pg_stat_activity, pg_stat_statements).
A background worker is provided to refresh the materialized views and tables automatically without the need for any third-party schedulers.
To see a practical application of this extension using Prometheus/Grafana we welcome you to try out pgMonitor - https://github.com/CrunchyData/pgmonitor
Requirement:
- PostgreSQL >= 12
In the directory where you downloaded pgmonitor, run
make install
If you do not want the background worker compiled and just want the plain SQL install, you can run this instead:
make NO_BGW=1 install
Note that without the BGW, refreshing of the materialized views and tables will require manual management outside of the pgMonitor extension.
The background worker must be loaded on database start by adding the library to shared_preload_libraries in postgresql.conf
shared_preload_libraries = 'pgmonitor_bgw' # (change requires restart)
You can also set other control variables for the BGW in postgresql.conf. These can be added/changed at anytime with a simple reload. See the documentation for more details.
pgmonitor_bgw.dbname
is required at a minimum for maintenance to run on the given database(s). This can be a comma separated list if pgMonitor is installed on more than one database to collect per-database metrics.
pgmonitor_bgw.dbname = 'proddb'
At this time pgmonitor_bgw.role
must be a superuser due to elevated privileges that are required to gather all metrics as well as refresh the materialized views. Work is underway to see if this can be run as a non-supuseruser. It currently defaults to postgres
if not set manually.
pgmonitor_bgw.role = 'postgres'
The interval defaults to 30 seconds and generally doesn't need to be changed. If you're trying to adjust materialized view or tables refresh timing, see the configuration tables below.
pgmonitor_bgw.interval = 30
Log into PostgreSQL and run the following commands. Schema is optional (but recommended) and can be whatever you wish, but it cannot be changed after installation. If you're using the BGW, the database cluster can be safely started without having the extension first created in the configured database(s). You can create the extension at any time and the BGW will automatically pick up that it exists without restarting the cluster (as long as shared_preload_libraries was set) and begin running maintenance as configured.
CREATE SCHEMA pgmonitor_ext;
CREATE EXTENSION pgmonitor SCHEMA pgmonitor_ext;
The names of all normal views are stored in metric_views
. All metrics that are either normal views or have the option to be backed by either a normal view or materialized view are stored here. If a metric is only backed by a materialized view and has no option for a normal view, it will only be stored in metrics_matviews
.
Table "pgmonitor_ext.metric_views"
Column | Type | Collation | Nullable | Default
----------------+---------+-----------+----------+-----------------------
view_schema | text | | not null | 'pgmonitor_ext'::text
view_name | text | | not null |
matview_source | boolean | | not null | false
active | boolean | | not null | true
scope | text | | not null | 'global'::text
Indexes:
view_schema
- Schema containing the view_name
view_name
- Name of the view or materialized view in system catalogs
matview_source
- Boolean to set whether the metric view is backed by a materialized view. If true, the view must be defined in a way that allows a choice to be made. Example: built in metrics use a function-backed view that checks this flag. Defaults to false.
active
- Boolean that external monitoring tools can use to determine whether this metric is actively used or not
scope
- Valid values are "global" or "database"
- "global" means the values of this metric are the same on every database in the instance (ex. connections, replication, etc)
- "database" means the values of this metric are only defined on a per database basis (database and table statisics, bloat, etc)
- Can be used by external scrape tools to be able to determine whether to collect these metrics only once per PostgreSQL instance or once per database inside that instance
Materialized views are stored in the metric_matviews
configuration table. The background worker uses this table to determine which materialized views are "active" to be refreshed and how often to refresh them.
Table "pgmonitor_ext.metric_matviews"
Column | Type | Collation | Nullable | Default
--------------------+--------------------------+-----------+----------+-----------------------
view_schema | text | | not null | 'pgmonitor_ext'::text
view_name | text | | not null |
concurrent_refresh | boolean | | not null | true
run_interval | interval | | not null | '00:14:00'::interval
last_run | timestamp with time zone | | |
last_run_time | interval | | |
active | boolean | | not null | true
scope | text | | not null | 'global'::text
view_schema
- Schema containing the view_name
view_name
- Name of the view or materialized view in system catalogs
concurrent_refresh
- Boolean to set whether the materalized view can be refreshed concurrently. It is highly recommended that all matviews be written in a manner to support a unique key. Concurrent refreshes avoid any contention while metrics are being scraped by external tools.
run_interval
- How often the materalized view should be refreshed. Must be a valid value of the PostgreSQL interval type
last_run
- Timestamp of the last time this materalized view was refreshed
last_run_time
- How long the last run of this refresh took
active
- Boolean to determine whether this materialized view is refreshed as part of automatic maintenance. If false, this matview will not be refreshed automatically. Defaults to true.
scope
- See
metric_views
for the purpose of this column
- See
For metrics that still require storage of results for fast scraping but cannot use a normal or materialized view, it is also possible to use a table and give pgMonitor an SQL statement to run to refresh that table. For example, the included pgBackRest metrics need to use a function that uses a COPY statement.
Table "pgmonitor_ext.metric_tables"
Column | Type | Collation | Nullable | Default
-------------------+--------------------------+-----------+----------+-----------------------
table_schema | text | | not null | 'pgmonitor_ext'::text
table_name | text | | not null |
refresh_statement | text | | not null |
run_interval | interval | | not null | '00:10:00'::interval
last_run | timestamp with time zone | | |
last_run_time | interval | | |
active | boolean | | not null | true
scope | text | | not null | 'global'::text
table_schema
- Schema containing the table_name
table_name
- Name of the table in system catalogs
refresh_statement
- The full SQL statement that is run to refresh the data in
table_name
. Ex:SELECT pgmonitor_ext.pgbackrest_info()
- The full SQL statement that is run to refresh the data in
active
- Boolean to determin whether maintenance will call the refresh_statement as part of regular maintenance. If false, this refresh will not run.
- See
metric_matviews
for purpose of remaining columns
NORMAL VIEWS:
ccp_archive_command_status
ccp_backrest_last_diff_backup
ccp_backrest_last_full_backup
ccp_backrest_last_incr_backup
ccp_backrest_last_info
ccp_backrest_oldest_full_backup
ccp_connection_stats
ccp_database_size
ccp_data_checksum_failure
ccp_locks
ccp_pg_is_in_recovery
ccp_postgresql_version
ccp_postmaster_runtime
ccp_postmaster_uptime
ccp_replication_lag
ccp_replication_lag_size
ccp_replication_slots
ccp_settings_pending_restart
ccp_stat_bgwriter
ccp_stat_checkpointer
ccp_stat_database
ccp_stat_io_bgwriter
ccp_stat_user_tables
ccp_table_size
ccp_transaction_wraparound
ccp_wal_activity
MAT VIEWS:
ccp_database_size_matview
ccp_pg_hba_checksum
ccp_pg_settings_checksum
ccp_sequence_exhaustion
ccp_stat_user_tables_matview
ccp_table_size_matview
TABLES:
metric_matviews
metric_tables
metric_views
pgbackrest_info
pg_hba_checksum
pg_settings_checksum
pg_stat_statements_reset_info
FUNCTIONS:
ccp_database_size_view_choice() RETURNS TABLE
ccp_replication_slots_func() RETURNS TABLE
ccp_stat_checkpointer_func() RETURNS TABLE
ccp_stat_io_bgwriter_func() RETURNS TABLE
ccp_stat_user_tables_func() RETURNS TABLE
ccp_stat_user_tables_view_choice() RETURNS TABLE
ccp_table_size_view_choice() RETURNS TABLE
pgbackrest_info() RETURNS SETOF pgbackrest_info
pg_settings_checksum_set_valid() RETURNS smallint
pg_hba_checksum_set_valid() RETURNS smallint
pg_hba_checksum(p_known_hba_hash text DEFAULT NULL)
pg_settings_checksum_set_valid() RETURNS smallint
pg_settings_checksum(p_known_settings_hash text DEFAULT NULL)
pg_stat_statements_func() RETURNS TABLE
pg_stat_statements_reset_info() RETURNS bigint
refresh_metrics_legacy (p_object_schema text DEFAULT 'monitor', p_object_name text DEFAULT NULL) RETURNS void
sequence_exhaustion (p_percent integer DEFAULT 75, OUT count bigint)
sequence_status() RETURNS TABLE (sequence_name text, last_value bigint, slots numeric, used numeric, percent int, cycle boolean, numleft numeric, table_usage text)
PROCEDURE:
refresh_metrics (p_object_schema text DEFAULT 'monitor', p_object_name text DEFAULT NULL)