PostgreSQL Tablespace
What is postgresql tablespace:
- Tablespace is a logical storage to map a logical name(tbs1) to a physical location on disk(/u02/tbs1)
- Postgres tablespaces are cluster level objects
- User/superuser must have CREATE privilege create tablespaces, but they can assign ownership of tablespaces to Non-superusers/Non-CREATE privilege users
- By default, the user who executed the CREATE TABLESPACE is the owner of the tablespace. The statement also allows assign the ownership of tablespace to another user specified in the OWNER clause.
- The name of the tablespace should not begin with pg_, because these names are reversed for the system tablespaces.
- The directory must be empty and must be owned by the PostgreSQL system user. The directory must be specified by an absolute path name.
- The location must not be on removable or transient storage, as the cluster might fail to function if the tablespace is missing or lost.
Uses Of Tablespace :
Assign default tablespace to particular user:
ALTER ROLE someuser SET default_tablespace = tbs1;
Disk space occupied by a tablespace:
select pg_size_pretty(pg_tablespace_size('tbs1'));
(or)
/u02/tbs1/du -c -h
- if a partition on which the cluster was initialized is out of space, you can create a new tablespace on a different partition
- users can explicitly say that they want to store some specific tables/indexes in that new “linked folder” or one can also decide to make it the default for new objects or even move all old objects over to the new tablespace.
- Tables, indexes, and entire databases can be assigned to particular tablespaces.
- Table partition - After version postgresql 10 , table partition is introduced based on tablespace this is one of the performance System in postgresql.
- You can tune IO using tablespaces but before that you have to understand this parameter "seq_page_cost, random_page_cost and effective_io_concurrency".
- you can easily move existing TABLES,INDEX, AND DATABASE to a new tablespace.
- pg_global : is used for shared system catalogs.
- pg_default : is the default tablespace of the template1 and template0 databases,default tablespace used for tables, indexes, and temporary files created within the database, if no TABLESPACE clause is given.whenever you create a table/Database without specifying a tablespace in the create table statement it will go to the pg_default tablespace.
Syntax For Tablespace Creation:
Before creating tablespace you have to create OS level directory and you have to change the directory owner permission to postgres user
Most “CREATE” SQL commands come with a “TABLESPACE” option using which you can specify the tablespace in which to create that SQL object. Let’s try a few:
If you want to know where exactly the files that make up the tables are you can use oid2name:
oid2name is a utility program that helps administrators to examine the file structure used by PostgreSQL.
CREATE TABLESPACE tablespace_name [ OWNER { new_owner | CURRENT_USER | SESSION_USER } ] LOCATION 'directory' [ WITH ( tablespace_option = value [, ... ] ) ]How To create tablespace:
Before creating tablespace you have to create OS level directory and you have to change the directory owner permission to postgres user
1.creating tbs1 directory: cd /tab1/ mkdir tbs1 2.changing tbs1 permission to postgres user: chown -R postgres:postgres tbs1 3.creating tablespace with name of tbs1 ,tbs1 is a logical name you can change whatever you want: CREATE TABLESPACE tbs1 LOCATION '/tab1/tbs1'; 4.Listing postgresql tablespace: postgres=# \db+ List of tablespaces Name | Owner | Location | Access privileges | Options | Size | Description ------------+----------+------------+-------------------+---------+---------+------------- pg_default | postgres | | | | 23 MB | pg_global | postgres | | | | 573 kB | tbs1 | postgres | /tab1/tbs1 | | | 0 bytes | (3 rows)Creating Tables With/Without Tablespace :
Most “CREATE” SQL commands come with a “TABLESPACE” option using which you can specify the tablespace in which to create that SQL object. Let’s try a few:
postgres=# create database dbname2 tablespace tbs1; CREATE DATABASE postgres=# \c dbname2 You are now connected to database "dbname2" as user "nijam". dbname2=# create table t1 (a int); CREATE TABLE dbname2=# create table t2 (a int) tablespace tbs1; CREATE TABLE dbname2=# create table t3 (a int) tablespace tbs2; CREATE TABLEHere’s what happened:
- We created a database called “dbname2” in the tablespace “tbs1”. The default tablespace for all objects in the database also becomes tbs1.
- The tables “t1” and “t2” are created in tbs1. You can explicitly specify the tablespace for the table, or use the database’s default.
- The table “t3” is created in the tbs2 tablespace. It is possible to have only some objects in another tablespace.
postgres=# create table t4 ( a int ); CREATE TABLE postgres=# select tablespace from pg_tables where tablename = 't4'; tablespace ------------ NULL (1 row)NULL, in this case, means default tablespace.
If you want to know where exactly the files that make up the tables are you can use oid2name:
oid2name is a utility program that helps administrators to examine the file structure used by PostgreSQL.
$ oid2name -t t4 From database "postgres": Filenode Table Name ---------------------- 24592 t4 $ find $PGDATA -name 2459* /u02/pgdata/PG961/base/13322/24592In addition oid2name tells you more about the databases and the default tablespace associated to them:
$ oid2name All databases: Oid Database Name Tablespace ---------------------------------- 13322 postgres pg_default 13321 template0 pg_default 1 template1 pg_defaultThere are three main patterns paths in Tablespace:
- 1.For files in the default tablespace: base/database_oid/table_and_index_files_oid
- 2.For files in Non-default tablespace:The directory $PGDATA/pg_tblspc contains symbolic links that point to each of the non-built-in tablespaces defined in the cluster. pg_tblspc / tablespace_oid / tablespace_version_subdir / database_oid /table_and_index_files_oid
- 3.For shared relations (see below): global/table_and_index_files_oid
postgres=#\! ls -l /u02/tbs1/ total 0 drwx------. 2 postgres postgres 6 Nov 25 11:03 PG_9.6_201608131At least a directory which contains the version of PostgreSQL was created. What is inside this directory?
postgres=#\! ls -l /u02/tbs1/PG_9.6_201608131/ total 0Nothing, so lets create a table in this brand new tablespace:
postgres=#create table t1 ( a int ) tablespace tbs1; CREATE TABLE postgres=# \d+ t1 Table "public.t1" Column | Type | Modifiers | Storage | Stats target | Description --------+---------+-----------+---------+--------------+------------- a | integer | | plain | | Tablespace: "tbs1"How does the directory look like now?:
postgres=#\! ls -l /u02/tbs1/PG_9.6_201608131/ total 0 drwx------. 2 postgres postgres 18 Nov 25 12:02 13322Ok, 13322 is the OID of the database which the table belongs to:
postgres=# \! oid2name All databases: Oid Database Name Tablespace ---------------------------------- 13322 postgres pg_default 13321 template0 pg_default 1 template1 pg_defaultAnd below that?
postgres=#\! ls -l /u02/tbs1/PG_9.6_201608131/13322/ total 0 -rw-------. 1 postgres postgres 0 Nov 25 12:02 24596This is the OID of the table.
For More Reference about Tablespace
Layout :
Basic Commands of Tablespace:
To determine the set of existing tablespaces:
select oid,spcname from pg_tablespace;
Following meta-command is also useful for listing the existing tablespaces:
\db+
Tablespace Rename:
alter tablespace tbs1 rename to tbs3;
Changing Tablespace ownership:
alter tablespace tbs1 owner to scott;
Tablespace reset:
alter tablepace tbs1 reset default_tablespace;
Tablespace Drop:
drop tablespace tbs1;
Note: A tablespace cannot be dropped until all objects in all databases using the tablespace have been removed.
To determine the set of existing tablespaces:
select oid,spcname from pg_tablespace;
Following meta-command is also useful for listing the existing tablespaces:
\db+
Tablespace Rename:
alter tablespace tbs1 rename to tbs3;
Changing Tablespace ownership:
alter tablespace tbs1 owner to scott;
Tablespace reset:
alter tablepace tbs1 reset default_tablespace;
Tablespace Drop:
drop tablespace tbs1;
Note: A tablespace cannot be dropped until all objects in all databases using the tablespace have been removed.
Assign default tablespace to particular user:
ALTER ROLE someuser SET default_tablespace = tbs1;
Disk space occupied by a tablespace:
select pg_size_pretty(pg_tablespace_size('tbs1'));
(or)
/u02/tbs1/du -c -h
Temporarily for current session while you are creating a batch of tables using
SET default_tablespace = tbs2;
Changing the default tablespace for the whole instance:
all newly created objects go into a new tablespace.
For table:
all newly created objects go into a new tablespace.
postgres=# alter system set default_tablespace='tbs3'; ALTER SYSTEM postgres=# select pg_reload_conf(); pg_reload_conf ---------------- t (1 row) postgres=# show default_tablespace ; default_tablespace -------------------- tbs3 (1 row)Tablespace creation with I/O cost:
CREATE TABLESPACE tbs3 LOCATION '/some_disk_mount' WITH (seq_page_cost=0.5, random_page_cost=0.75, effective_io_concurrency=10);How to find what tablespace a table/index is in on PostgreSQL?
For table:
SELECT tablespace FROM pg_tables WHERE tablename = 't1' AND schemaname = 'schema1';For index:
SELECT tablespace FROM pg_indexes WHERE indexname = 't1_index' AND schemaname = 'schema1';
Temporary Objects
- Temporary tables and indexes are created by PostgreSQL either when explicitly asked to (“CREATE TEMP TABLE..”) or when it needs to hold large datasets temporarily for completing a query.
- It is possible to tell PostgreSQL to place such objects in a separate tablespace. For example, if too many temporary tables are created in normal course, it might be possible to speed up your queries by placing such objects in a tablespace with faster hardware, faster/unjournaled/uncompressed filesystems, or even in-memory filesystems.
- The default value is an empty string, which results in all temporary objects being created in the default tablespace of the current database.
- It is a simple trick to speed up complex Postgres queries
- Use the option temp_tablespaces to tell Postgres which tablespace to use for creating temporary tables.
postgres=# alter system set temp_tablespaces='tbs2'; ALTER SYSTEM postgres=# select pg_reload_conf(); pg_reload_conf ---------------- t (1 row) postgres=# show temp_tablespaces ; temp_tablespaces ------------------ tbs2 (1 row)( or)
CREATE TABLESPACE tbs4 LOCATION '/u02/tbs1;We then modify postgresql.conf to tell postgres to use this tablespace for temporary objects:
temp_tablespaces = 'tbs4'Finally, restart postgres server
Temporary tables in PostgreSQL have three properties that distinguish them from ordinary tables:
- They're stored in a special schema, so that they are normally visible only to the creating backend.
- They are managed by the local buffer manager rather than the shared buffer manager.
- They are not WAL-logged.
Backup Of Tablespace:
Using pg_basebackup to back up a PostgreSQL cluster that has multiple tablespaces needs a couple of extra steps.
If you’re using tarball-format backup, each tablespace comes out as it’s own tarball (with the tablespace OID as the filename). While restoring, this must be restored to the same path (like “/tmp/space2”) that used to be present while restoring. This is probably a bit of a pain because the backup script needs to store this additional information also somewhere alongside the backup.
Here’s how the tarball backup happens:
Using pg_basebackup to back up a PostgreSQL cluster that has multiple tablespaces needs a couple of extra steps.
If you’re using tarball-format backup, each tablespace comes out as it’s own tarball (with the tablespace OID as the filename). While restoring, this must be restored to the same path (like “/tmp/space2”) that used to be present while restoring. This is probably a bit of a pain because the backup script needs to store this additional information also somewhere alongside the backup.
Here’s how the tarball backup happens:
/tmp$ pg_basebackup --format=t --gzip --compress=9 -D tarbackup /tmp$ ls -l tarbackup total 3684 -rw-r--r-- 1 alice alice 937355 May 8 13:22 16385.tar.gz -rw-r--r-- 1 alice alice 2812516 May 8 13:22 base.tar.gz -rw------- 1 alice alice 19259 May 8 13:22 pg_wal.tar.gzFor plain format backups, it is possible to specify a new location for each tablespace. The data from each tablespace is written out into a new location. Every tablespace (other than pg_default and pg_global) must be mapped to a new location in the command-line, like this:
/tmp$ pg_basebackup --format=p --tablespace-mapping=/tmp/space2=/tmp/space2backup -D plainbackup /tmp$ ls -l plainbackup/pg_tblspc/ total 0 lrwxrwxrwx 1 alice alice 17 May 8 13:35 16385 -> /tmp/space2backup
Streaming Replication with Tablespace:
More About Tablespace
https://www.tutorialdba.com/search/label/PostgreSQL%20Tablespace
Postgresql moving tablespaces
- Setting up a new standby for a primary that already has tablespaces involves bringing over the main data directory and each tablespace directories over to the standby. If you’re using pg_basebackup to do this, then use the plain format backup to also specify appropriate new locations for the tablespaces on the standby.
- Creating a tablespace on the primary of a replicated server is bit tricker, because the paths for the new tablespace go over unmodified to the standby. The standby server expects an existing directory at the same location as in the primary, and creates a tablespace at that location. Typically, you’d want to:
- prepare and mount filesystems at both primary and standby, mount points have to be the same
- create empty directories within mount points if needed
- create tablespace at primary
More About Tablespace
https://www.tutorialdba.com/search/label/PostgreSQL%20Tablespace
Postgresql moving tablespaces
Comments
Post a Comment