Basic Administration Flashcards

Question 1

Q

Name some security features

Answer

A

SSL
- host-based authentication
- object-level permissions
- logging
- groups & roles

Question 2

Q

Name some recovery and availability features

Answer

A

Streaming replication (async and sync), aka hot standby
- Cascading streaming replication
- pg_basebackup, hot backup
- PITR (point in time recovery)

Question 3

Q

Name some cool advanced features

Answer

A

triggers and functions
- many procedural languages (pgSQL, Perl, TCL, PHP, Java, Python, etc)
- custom procedural languages
- upgrade using pg_upgrade
- unlogged tables
- materialized views

Question 4

Q

What does MVCC stand for

Answer

A

Multi-Version Concurrency Control

Question 5

Q

List some characteristics of MVCC

Answer

A

Maintains data consistency.
- Each transaction sees a snapshot of the database as it was at the beginning of the transaction.
- Writers block writers; nothing else blocks.
- Concurrent transactions are isolated.
- Prevents transactions from seeing inconsistent data.

Question 6

Q

What does WAL stand for?

Answer

A

Write-Ahead Logs (or Write-Ahead Logging)

Question 7

Q

How does WAL work?

Answer

A

Records each data change before it actually takes place.
- Data is not considered ‘safe’ until the log is written to disk.
- Provides recovery in case of crash or failure.
- As DML is executed, changes are recorded in memory in WAL buffers (as well as the shared data buffers).
- When COMMIT is executed, the WAL buffers are flushed to the WAL segment files
- Later the dirty data buffers are written to disk

Question 8

Q

What is ACID?

Answer

A

Atomicity, Consistency, Isolation, Durability

Question 9

Q

List some commonly used pg_dump switches

Answer

A

a Data only
s Definitions (“schema”) only
n NAME Dump named schema (“namespace”)
t NAME Dump named table
Fp Plain format
Fc Custom format
Fd Directory format
Ft Tar format
f FILE Dump to named file
o Dump OIDs
j JOBS Parallelize directory format dumps
v Verbose

Question 10

Q

Name four general methods for backup

Answer

A

SQL dump (pg_dump, pg_dumpall)
- Filesystem dump
- Continuous archiving
- Streaming replication

Question 11

Q

List two methods (command line examples) for dealing with very large databases when doing SQL backups

Answer

A

pg_dump … | gzip -c > file.sql.gz

* pg_dump … | split -b 1m - filename

Question 12

Q

Describe two methods for restoring SQL dumps done with pg_dump

Answer

A

psql < TEXT_FORMAT_DUMP_FILE (or psql -f TEXT_FORMAT_DUMP_FILE)
- pg_restore NON_TEXT_FORMAT_DUMP_FILE

Question 13

Q

Using command defaults for pg_dump, when you restore a backup into a new cluster, is there anything you need to do first?

Answer

A

Yes, create the target database. By default, pg_dump does not include the command to create the target database.

Question 14

Q

What does the –create (-C) switch to pg_dump do?

Answer

A

pg_dump --create inserts a command to create and reconnect to the target database before the commands to populate that database.

Question 15

Q

What does the –clean switch to pg_dump do?

Answer

A

pg_dump --clean drops the target database prior to recreating it, when using the –create (-C) switch.

Question 16

Q

List important pg_restore options

Answer

A

pg_restore

d DB Connect to the specified database, and, if -C (–create) is not specified, restore into this database also.
C Create the database specified in the backup and restore into it
a Restore data only (assumes the SQL objects like tables have already been created)
s Restore object definitions (DDL) only, not data
t NAME Restore named table
n NAME Restore named schema/namespace
v Verbose

Question 17

Q

What command can dump an entire cluster as SQL?

Answer

A

pg_dumpall

Question 18

Q

What does pg_dumpall do?

Answer

A

pg_dumpall dumps all databases in the cluster, and also global objects such as roles and tablespaces.

Question 19

Q

List commonly used pg_dumpall switches

Answer

A

pg_dumpall

a Dump data only
s Dump definitions only
c Clean (drop) objects before recreating
g Dump global objects (roles, etc) only, not databases
r Dump roles only, not databases or other global objects
O Skip ownership commands
x Skip privileges
-disable-triggers Disable triggers during data-only restore
v Verbose

Question 20

Q

How many times does pg_dumpall ask for authentication credentials?

Answer

A

Once per database (and more?)

Question 21

Q

Why is it a good idea to use a pgpass file when using pg_dumpall?

Answer

A

A pgpass file is a good idea for pg_dumpall because the user’s credentials are requested once per database in the cluster.

Question 22

Q

List some characteristics of SQL dumps as a backup mechanism

Answer

A

Generate a text file containing SQL commands (or a binary representation thereof)
pg_dump is the relevant command (or pg_dumpall for the entire cluster)
pg_dump does not block readers or writers
pg_dump does not operate with special permissions
pg_dump dumps are internally consistent and are a snapshot of the database at the time pg_dump begins running

Question 23

Q

Next to SQL dump, what is the next most simple backup approach?

Answer

A

Simple file system level backup is an alternative to SQL dumps, provided that either 1) the database is shut down during the backup, or 2) the native snapshot feature of the filesystem is used (if available).

Question 24

Q

Why mightn’t filesystem snapshots work without database downtime, even if your filesystem supports this feature?

Answer

A

Filesystem snapshots might not work if the database is spread across multiple filesystems; if not, you would have to stop the database to take the multiple snapshots.

Question 25

Q

Describe the process of restoring a backup made via a filesystem snapshot

Answer

A

A filesystem snapshot backup can be restored by 1) copying the backup into place (including the WAL logs) and 2) starting the database, which will go into crash recovery mode and replay WAL logs.

Question 26

Q

How can you take your filesystem snapshot backup so that the restore is as fast as possible?

Answer

A

To minimize restoration time of a filesystem snapshot, force a CHECKPOINT right before the snapshot – this will minimize the amount of WAL logs that have to be replayed.

Question 27

Q

Can you use tar to back up a database? If so, describe how.

Answer

A

Yes, you can back up a PostgreSQL database by 1) shutting down the database; 2) using tar to copy all of the files in the database cluster; and 3) restarting the database.

Question 28

Q

How would you use rsync to backup a database?

Answer

A

You can back up a database using rsync as follows: 1) With the database running, use rsync to copy all the files. 2) Shut down the database. 3) Use rsync again to copy the files; this second rsync will create a consistent image of the database and will be quite fast, minimizing downtime.

Question 29

Q

After SQL dumps and simple filesystem backups, what is a third , more complicated mechanism for backups?

Answer

A

A third backup approach is continuous archiving of WAL logs (combined with a possibly inconsistent filesystem backup, such as produced by tar, even with the database running).

Question 30

Q

What is another name for continuous archiving as used for backup purposes?

Answer

A

Online backup

Question 31

Q

What is the primary purpose of WAL logs?

Answer

A

The primary purpose of WAL logs is to allow database commits to happen quickly (without the data being fully written to the final data pages) but to prevent loss of information in case of a crash – the WAL logs can be “played” when the database starts up after a crash, thus restoring the physical database to match its logical state at the time of the crash.

Question 32

Q

What is a secondary use to which WAL logs can be put?

Answer

A

Online backup – first, get WAL log archiving started, in which full and switched WAL logs are copied to backup storage before being recycled; then take a file system backup while the database is running (tar, rsync, etc). If the database crashes, or if you want to revert the database to a specified point in time (PIT), you can copy the original full backup into place along with the archived WAL files, and start the database.

Question 33

Q

How do you enable continuous archiving of WAL files?

Answer

A

wal_level must be set to archive or hot_standby (as opposed to, say, minimal).
archive_mode must be set to on (default is off).
archive_command must be defined – a command to copy the WAL files somewhere.

Question 34

Q

How can you achieve point in time recovery (PITR) with continuous archiving?

Answer

A

PITR can be accomplished by having the database replay the WAL files only up to a specified file, not all the way to the last file.

Question 35

Q

How can you use continuous archiving to achieve warm standby?

Answer

A

Have a second server loaded with the base backup file (filesystem-level backup), and feed the archived WAL files to this second server. At any time, the WAL files can be replayed on this second server so that the second server can take over from the first with a nearly identical state.

Question 36

Q

Why can’t pg_dump and pg_dumpall be used with online backup via continuous archiving?

Answer

A

pg_dump and pg_dumpall produce logical, not physical backups of the database. They don’t capture enough information for the WAL files to

Question 37

Q

What is the default size of WAL segment files?

Answer

A

16MB – this can be changed by recompiling PostgreSQL

Question 38

Q

How are WAL segment files named?

Answer

A

Numerically, according to the position in the abstract WAL sequence. I say abstract sequence because there is only a small number of physical files, which are recycled by renaming when a particular WAL file is no longer needed (it has been checkpointed; i.e. the changes it encodes have been reflected to the actual data pages).

Question 39

Q

What does the simplest useful archive_command value on Unix systems?

Answer

A

archive_command = ‘test ! -f /mnt/server/archivedir/%f && cp %p /mnt/server/archivedir/%f’

Question 40

Q

What does %f mean in the archive_command?

Answer

A

%f is the base file name of the WAL file

Question 41

Q

What does %p mean in the archive_command?

Answer

A

%p is the full path name of the WAL file

Question 42

Q

How do you specify a literal percent sign in the archive_command?

Question 43

Q

Describe the return value/exist status of the archive_command

Answer

A

0 if the file could be copied successfully; otherwise, non-zero.

Question 44

Q

What happens if the archive_command starts returning non-zero for every file (perhaps because of network error, unmounted fs, etc)?

Answer

A

The pg_xlog directory will continue to fill up with WAL files. If the containing filesystem fills up, PostgreSQL will do a “panic” shutdown – no committed transactions will be lost, however.

Question 45

Q

Does continuous archiving back up changes made to postgresql.conf, pg_hba.conf, etc?

Answer

A

No, you must have another approach to back up changes to the configuration files in the data directory.

Question 46

Q

If you exclude the PostgreSQL data directory from the server’s normal backup procedures, how can the config files be continuously backed up (since continuous archiving does not do so)?

Answer

A

Start PostgreSQL with -D config_dir, where config_dir is a directory outside of the data directory, some place where the normal operating system backup procedures will back the directory up. Then, in postgresql.conf within this directory, use the data_directory parameter to point to the actual data directory.

Question 47

Q

How could you temporarily turn off WAL archiving without stopping the server?

Answer

A

Set archive_command to the empty string and reload the server – WAL files will start to accumulate in the pg_xlog directory, though.

Question 48

Q

What is pg_basebackup?

Answer

A

pg_basebackup is a command for taking a base backup of the live database, to which you would apply archived WAL files in order to recover from a disaster.

Question 49

Q

Describe pg_receivexlog

Answer

A

streams transaction logs from a running cluster
uses streaming replication protocol
these files can be used for PITR
logs streamed in real time
can be used instead of archive command
example: pg_receivexlog -h localhost -D /usr/local/pgsql/archive

Question 50

Q

Describe how the archive_command cp -i %p /mnt/server/archivedir/%f </dev/null works

Answer

A

Uses the Unix file copy (cp) command to copy the source WAL file (%p - path) to the desired destination path (/mnt/server/archivedir/%f). The twist is that the interactive (-i) switch is used, presumably to prevent an already existing file of the same name from being overwritten. “But wait”, you say, “how can this command be interactive? There’s no human involved!” That’s where the redirection from /dev/null comes in – if a file would be overwritten, the prompt is written, and the empty response causes an error (status 1), which is what we would want in this scenario.

Question 51

Q

Describe the “low level API” for base backup that is an alternative to using the pg_basebackup command

Answer

A

Connect with psql and issue the command “select pg_start_backup(‘backup label’);”
Back up the data directory using tar or rsync, etc.
Now issue the SQL command “select pg_stop_backup()”.
[Note: no label is required for the stop; it’s assumed (or enforced) that there is only one backup happening at a time.]

Question 52

Q

What CLI command can be used to take a base backup of a live cluster?

Answer

A

pg_basebackup

Question 53

Q

Can a backup taken by pg_basebackup be used for PITR?

Question 54

Q

Can a backup taken by pg_basebackup be used for streaming replication?

Question 55

Q

What does pg_basebackup do, exactly?

Answer

A

It uses the low-level backup API (pg_start_backup(‘label’) and pg_stop_backup()) wrapped around some binary copying mechanism, i.e. it automatically puts the database in and out of backup mode.

Question 56

Q

List important pg_basebackup switches/options

Answer

A

D - destination for backup
F <p> - format: plain or tar
X - include log files created during the backup, so the backup is fully usable
z - compress with gzip
Z - compression level
P - enable progress reporting
h - host on which cluster is running
p - cluster port

Question 57

Q

Describe what this command is doing: pg_basebackup -h localhost -D /usr/local/pgsql/backup

Answer

A

pg_basebackup is being used to take a base backup of the cluster running on localhost; the files will be written to /usr/local/pgsql/backup/.

Question 58

Q

Describe the configuration steps required to use pg_basebackup

Answer

A

Modify pg_hba.conf to add a replication connection, e.g.: “host replication postgres IP_ADDR/32 trust”
archive_command = ‘cp -i %p /dest/dir/%f
archive_mode = on # requires restart
max_wal_senders = 3
wal_keep_segments = NUM
wal_level = archive

Question 59

Q

Describe the steps to performing (Point-In-TIme) Recovery

Answer

A

Stop the server (if it is running).
If you have enough space, keep a copy of the data directory and the transaction logs.
Remove all directories and files from the cluster data directory.
Restore the database files from the base backup (file system backup).
Verify the ownership of restored backup directories (must not be root).
Remove any files in pg_xlog/.
If you have any unarchived WAL segment files recovered from the crashed cluster, copy them in pg_xlog.
Create a recovery command file recovery.conf in the cluster data directory.
Start the server.
Upon completion of the recovery process, the server will rename recovery.conf to recovery.done.

Question 60

Q

List the settings (with types) in the recovery.conf file used for point-in-time recovery

Answer

A

restore_command (string) 
# unix: restore_command = 'cp /archive/dir/%f "%p"'
# windows: restore_command = 'copy c:\\archive\\\dir\\"%f" "%p"'
recovery_target_name (string)
recovery_target_time (timestamp)
recovery_target_xird (string)
recovery_target_inclusive (boolean)
recovery_target_timeline (string)
pause_at_recovery_target (boolean)

Question 61

Q

Why might one choose to set archive_timeout to something other than the default of 0?

Answer

A

When doing WAL archiving, remember that only full (16MB) WAL files are shipped, so if the transaction rate and volume are low, you could be exposed to losing data in case of catastrophe. The archive_timeout forces a WAL log to be shipped after the specified number of seconds has passed, regardless of whether it is full or not. If the value is too low (but not zero), it could lead to WAL bloat in the archive.

Question 62

Q

Describe what the pg_xlogdump contrib module and how to use it

Answer

A

pg_xlogdump displays the WAL in human-readable format
can give wrong results when the server is running
Syntax: pg_xlogdump [[startseg] [endseg]]

Question 63

Q

What is the name of the contrib module that can display the WAL in a human-readable format

Answer

A

pg_xlogdump

Question 64

Q

What are the two methods for upgrading PostgreSQL?

Answer

A

) Old-school: pg_dump and then restore into the new cluster
) New-school: pg_upgrade

Answer 62

A

Helps upgrade between major releases of PostgreSQL (that would ordinarily require dump and restore)
Supports upgrading PG 8.3.X or later to latest version
Verifies old and new clusters are binary compatible
Option for checking clusters (?)
Can do in-place or side-by-side upgrade
Side-by-side upgrade requires double storage
Can be done with parallel jobs in PG 9.3+

Answer 63

A

b old_bindir
B new_bindir
d old_datadir
D new_datadir
p old_port
P new_port
c check clusters only (no change)
j # of jobs
k use hard links instead of copying files
r retain SQL and log files even after successful completion
u user_name

Answer 64

A

Optimizer

Answer 65

A

row estimates (cardinality)
access method: sequential or index
join method: hash, nested loop, etc
join type and order
sorting and aggregates

Answer 66

A

EXPLAIN (options) statement
where option can be one of:
ANALYZE
VERBOSE
COSTS
BUFFERS
TIMING
FORMAT {TEXT|XML|JSON|YAML}

Answer 67

A

X=estimated startup cost
Y=estimated total cost
R=estimated rows output
W=estimated average width in bytes of rows output

Answer 68

A

Table statistics in pg_class include retuples, the estimated number of rows in a table, and relpages, the estimated number of disk blocks taken up by a table or index.

Answer 69

A

No, table statistics are updated using ANALYZE or VACUUM ANALYZE

Answer 70

A

pg_class and pg_statistic store the table statistics; pg_stats is a view on pg_statistic that is more commonly used.

Answer 71

A

The tables and indexes in the current database that the current user owns (is that right?)

Answer 72

A

Yes - “ANALYZE some_table” or “ANALYZE some_table(some_col)”

Answer 73

A

Yes, autovacuum runs ANALYZE by default.

Answer 74

A

No, DELETE merely marks a row as deleted; the row space can be reused or removed by VACUUM .

Answer 75

A

Can recover or reuse space occupied by obsolete rows (from deletes and updates)
Can (via ANALYZE option) update data statistics
Updates the visibility map, which speeds up index-only scans
Protects against loss of very old data due to transaction ID wraparound

Answer 76

A

It makes index-only scans more efficient (and more likely to be chosen by the query planner) - the full heap tuple doesn’t need to be read in order to determine whether an index entry is valid or not.

Answer 77

A

removes dead rows and marks the space available for future reuse.
Does not shrink the file except for dead rows at the end of the table

Answer 78

A

More aggressive algorithm (what does this mean?)
Rewrites the entire table with no dead space
Takes a hell of a lot more time
Requires at least twice the disk space of the original copy of the table

Answer 79

A

Each transaction has a 32-bit ID (XID) that is allocated serially, and each row version is marked with the XID of the transaction that created it. A transaction is not allowed to see rows with XIDs that are larger than its own, because these are “in the future”. But because there is a limit to how large XIDs can be (2^32, approx 4 billion), at some point XIDs wrap around to 0, at which point a row with a new, low XID is seen as older than higher XIDs, when actually it is newer. Vacuuming can “freeze” old rows by assigning them a special XID that means “old, visible by all”. This prevents wraparound failure, although I am not sure about the details.

Answer 80

A

At least once every two billion transactions.

Answer 81

A

The visibility map for a table keeps track of which pages contain only tuples that are visible to all current and future transactions (until the page is modified, anyway). This has two benefits: 1) it helps vacuum avoid looking at pages unnecessarily, and 2) it allows index scans to avoid grabbing the heap tuple merely to determine if the current transaction is allowed to see the tuple for an index entry. The VM is tiny compared to the table/heap, so for very large tables, the cost savings can be significant. This allows “index-only scans” to be used.

Answer 82

A

VACUUM updates it.

Answer 83

A

The visibility map is very small, so it is readily cached, and many index entries can be checked for visibility with very little memory or disk I/O.

Answer 84

A

REINDEX should be run when:

an index is corrupted (rare)
an index is bloated (many almost pages)
a storage parameter for the index (like fillfactor) has been changed
an index build with CONCURRENTLY failed, leaving an invalid index

Answer 85

A

REINDEX {INDEX|TABLE|DATABASE|SYSTEM} name [FORCE]

Answer 86

A

The pg_catalog schema is where system information about a database is stored.

Answer 87

A

The following objects are stored in the pg_catalog schema:

System tables (like pg_class)
System functions (e.g. pg_database_size)
System views (pg_stat_activity)

Answer 88

A

Yes, pg_catalog is effectively part of the search_path.

Answer 89

A

The \dS psql command lists tables and views from the pg_catalog schema (in addition to other tables and schemas in the search path).

Answer 90

A

pg_tables, pg_constraint (no s), pg_indexes, pg_trigger (no s), and pg_views - (thanks for the consistency, guys!) These are all views except for pg_constraint.

Answer 91

A

current_database() is the system function for showing the database to which you are connected

Answer 92

A

current_schema() is the system function for showing the first schema in the search path.

Answer 93

A

inet_client_addr, inet_client_port, inet_server_addr, inet_server_port

Answer 94

A

pg_postmaster_start_time

Answer 95

A

schema, relation name, relation type, and owner

Answer 96

A

size and description (the latter is empty for system objects)

Answer 97

A

session_user() is analogous to the UNIX “real user”, current_user() to the “effective user”

Answer 98

A

current_schemas(boolean)

Answer 99

A

If true, the schemas implicitly in the search path (usually pg_catalog) are also included. If false, just the schemas from the normal explicit search_path are included.

Answer 100

A

current_setting(setting_name_str)

Answer 101

A

set_config()

Answer 102

A

pg_cancel_backend(pid) cancels the current query in a backend process - the argument is pid

Answer 103

A

pg_terminate_backend(pid)

Answer 104

A

set_config(setting, new_value, is_local_to_transaction) (if not transaction-specific, then applies to the session).

Answer 105

A

pg_reload_conf() reloads the configuration files

Answer 106

A

pg_rotate_logfile

Answer 107

A

pg_start_backup(label, [fast]) and pg_stop_backup()

Answer 108

A

pg_tablespace_size(name_or_oid)

Answer 109

A

pg_database_size(name_or_oid)

Answer 110

A

pg_relation_size(name_or_oid)

Answer 111

A

pg_total_relation_size(name_or_oid)

Answer 112

A

Neither one!

Answer 113

A

pg_column_size(something)

Answer 114

A

pg_ls_dir, pg_read_file, pg_stat_file

Answer 115

A

pg_ls_dir(dir_relative_to_data_dir) - superuser only. E.g.: select pg_ls_dir(‘.’) lists all files in the data directory

Answer 116

A

pg_read_file(path) reads a file, one line per row

Answer 117

A

\df func_name

Answer 118

A

pg_stat_activity

Answer 119

A

pg_stat_database

Answer 120

A

pg_stat_user_tables

Answer 121

A

pg_stat_user_indexes

Answer 122

A

pg_stat_user_functions

Answer 123

A

-- Show all schemas explicitly in search path:
select current_schemas(False);

Answer 124

A

select viewname, definition from pg_views where schemaname = ‘edbstore’;

Answer 125

A

select pg_reload_conf();

Answer 126

A

select usename as user, now()-backend_start as session_time from pg_stat_activity;

Answer 127

A

select pg_terminate_backend(pid) from pg_stat_activity where usename = ‘blah’;

Answer 128

A

select datname, pg_size_pretty(pg_database_size(oid)) from pg_database order by pg_database_size(oid);

Answer 129

A

COPY moves data between tables and file-system files on the database server

Answer 130

A

COPY table_name FROM ‘filename’
– or:
COPY table_name FROM PROGRAM ‘command’

Answer 131

A

Copies data from a file into a table

Answer 132

A

COPY table_name TO ‘filename’
– or:
COPY table_name TO PROGRAM ‘command’

Answer 133

A

COPY { table_name [ ( column_name [, …] ) ] | ( query ) }
TO { ‘filename’ | PROGRAM ‘command’ | STDOUT }
[ [ WITH ] ( option [, …] ) ]
where option can be one of:
FORMAT format_name
OIDS [ boolean ]
FREEZE [ boolean ]
DELIMITER ‘delimiter_character’
NULL ‘null_string’
HEADER [ boolean ]
QUOTE ‘quote_character’
ESCAPE ‘escape_character’
FORCE_QUOTE { ( column_name [, …] ) | * }
FORCE_NOT_NULL ( column_name [, …] )
ENCODING ‘encoding_name’

Answer 134

A

COPY emp TO ‘/tmp/emp.csv’ WITH (FORMAT CSV, HEADER);

– Don’t forget the parentheses around options!

Answer 135

A

cat emp.csv | ssh remote.host “psql -U edbstore edbstore -c ‘copy emp from stdin;’”

Answer 136

A

COPY tablename FROM filename (FREEZE) will freeze the loaded rows. It can only be used if the target table was previously created or truncated in the same transaction. This prevents VACUUM from having to do this freezing at some point in the future. The caveat is that the rows will be visible to all other transactions as soon as they are loaded (before the end of the enclosing transaction) – this is a violation of MVCC.

Answer 137

A

COPY emp TO ‘/tmp/emp.csv’ WITH (FORMAT CSV, HEADER, DELIMITER ‘|’)

Answer 138

A

CREATE TABLE copyemp (LIKE emp);

Answer 139

A

ACID-compliant
Supports transactions
Supports savepoints
Uses Write-Ahead Logging

Answer 140

A

MVCC (connection scalability)
Table partitioning (size)
Tablespaces (size)

Answer 141

A

unlimited

Answer 142

A

Unlimited

Answer 143

A

250-1600, depending on column types

Answer 144

A

Unlimited

Answer 145

A

A table or index

Answer 146

A

Attribute

Answer 147

A

UC Berkeley

Answer 148

A

SQL Server, Informix, Ingres

Answer 149

A

The postmaster listens for connections from clients and spawns new a new backend process to handle each connection. The postmaster manages these backend processes as well as other background utility processes

Answer 150

A

No, it uses processes

Answer 151

A

Shared buffers (data buffers)
WAL buffers
Process array

Answer 152

A

bgwriter
stats collector
checkpointer
archiver
autovacuum
log writer
WAL writer

Answer 153

A

Data files and friends (indexes, visibility map)
WAL segments
Archived WAL
Error/diagnostic log files

Answer 154

A

Writes dirty data blocks to disk when room is needed for more blocks in shared memory.

Answer 155

A

background writer (bgwriter)

Answer 156

A

Flushes write-ahead log to disk

Answer 157

A

The WAL writer process

Answer 158

A

The checkpointer process

Answer 159

A

It performs checkpoints (syncing of dirty data blocks to disk) at intervals or otherwise according to configuration parameters

Answer 160

A

There is one autovacuum launcher process, which launches multiple autovacuum workers processes

Answer 161

A

The autovacuum launcher process

Answer 162

A

Launches autovacuum worker processes

Answer 163

A

Recover free space for reuse

Answer 164

A

Autovacuum worker processes

Answer 165

A

Logging collector

Answer 166

A

Routes log messages to syslog, eventlog or log files

Answer 167

A

Stats collector

Answer 168

A

Collects usage statistics by relation and block

Answer 169

A

Archives write-ahead log files in pg_xlog when full (e.g. copies them to a mounted SAN share).

Answer 170

A

Shared memory and semaphores

Answer 171

A

IP address, user, password, key

Answer 172

A

Verifying permissions in the database

Answer 173

A

Shared buffers

Answer 174

A

To read OS and disk reads

Answer 175

A

Shared buffer blocks are written to disk only when needed:

1) to make room for new block
2) at checkpoint time

Answer 176

A

When DML is executed to change data, the changes are made to the data blocks in shared memory and also (in a different form) to WAL buffers in shared memory. The WAL writer process flushes WAL buffers to WAL segment files on disk (the “transaction log”) periodically, or on commit, or when the WAL buffers are full. As of 9.2, there is a group commit feature which attempts to batch together WAL-writing from multiple commits that occur nearly at the same time.

Answer 177

A

Before commit: changes are stored in memory in the shared data buffers and also in the WAL buffers. (It is possible under conditions of high activity and/or tight memory that changes may be forced out to WAL files and data files).
After commit: changes have been written to write-ahead log files on disk (but not necessarily to the data files).
After checkpoint: changes have been written from the shared buffers to data files.

Answer 178

A

Parsing, optimizing, and execution

Answer 179

A

Syntax check
Call Traffic Cop (what is that?)
Identify query type
Command processor if needed
Break query in tokens

Answer 180

A

Planner generates plans using database statistics
Query cost calculation
Choose best plan

Answer 181

A

Haha, fooled you. Just one step: execution.

Answer 182

A

A cluster is a collection of one or more databases managed by one server instance

Answer 183

A

Each cluster has a separate:

data directory
TCP port
set of processes

Answer 184

A

PG_VERSION A file containing the major version number of PostgreSQL

base - Subdirectory containing per-database subdirectories

global - Subdirectory containing cluster-wide tables, such as pg_database

pg_clog - Subdirectory containing transaction commit status data

pg_multixact - Subdirectory containing multitransaction status data (used for shared row locks)

pg_notify - Subdirectory containing LISTEN/NOTIFY status data

pg_serial - Subdirectory containing information about committed serializable transactions

pg_snapshots - Subdirectory containing exported snapshots

pg_stat_tmp - Subdirectory containing temporary files for the statistics subsystem

pg_subtrans - Subdirectory containing subtransaction status data

pg_tblspc - Subdirectory containing symbolic links to tablespaces

pg_twophase - Subdirectory containing state files for prepared transactions

pg_xlog - Subdirectory containing WAL (Write Ahead Log) files

postmaster. opts - A file recording the command-line options the server was last started with
postmaster. pid A lock file recording the current postmaster process ID (PID), cluster data directory path, postmaster start timestamp, port number, Unix-domain socket directory path (empty on Windows), first valid listen_address (IP address or *, or empty if not listening on TCP), and shared memory segment ID (this file is not present after server shutdown)

Usually: postgresql.conf

Usually: pg_hba.conf

Usually: pg_ident.conf

Answer 185

A

Cluster-wide system tables, like pg_database

Answer 186

A

Per-database subdirectories for databases having data in the default tablespace

Answer 187

A

Write Ahead Log files

Answer 188

A

Transaction commit status data

Answer 189

A

Subtransaction commit status data

Answer 190

A

Multitransaction status data (used for shared row locks)

Answer 191

A

LISTEN/NOTIFY status data

Answer 192

A

Information about committed serializable transactions

Answer 193

A

Exported snapshots

Answer 194

A

Temporary files for the statistics subsystem

Answer 195

A

Symbolic links to tablespaces

Answer 196

A

A file recording the command-line options the server was last started with

Answer 197

A

A lock file recording the current postmaster process ID (PID), cluster data directory path, postmaster start timestamp, port number, Unix-domain socket directory path (empty on Windows), first valid listen_address (IP address or *, or empty if not listening on TCP), and shared memory segment ID (this file is not present after server shutdown)

Answer 198

A

State files for prepared transactions

Answer 199

A

One directory per database in the cluster, the directory being named with the OID of the database. This is the default location for the cluster’s databases and files, and the system catalogs are stored here at a minimum

Answer 200

A

One or more files for each table or index in the database. For ordinary relations, these files are named after the table or index’s filenode number, which can be found in pg_class.relfilenode. But for temporary relations, the file name is of the form tBBB_FFF, where BBB is the backend ID of the backend which created the file, and FFF is the filenode number. In either case, in addition to the main file (a/k/a main fork), each table and index has a free space map, which stores information about free space available in the relation. The free space map is stored in a file named with the filenode number plus the suffix _fsm. Tables also have a visibility map, stored in a fork with the suffix _vm, to track which pages are known to have no dead tuples. Unlogged tables and indexes have a third fork, known as the initialization fork, which is stored in a fork with the suffix _init.

Answer 201

A

Each tablespace is a directory. base is the default tablespace. All other tablespaces can be located anywhere, but a soft link to each tablespace must be placed in data/tblspc/.

Answer 202

A

Yes, a database can have files in multiple tablespaces. Each tablespace has a subdirectory for each database that has files in the tablespace, so the OID of the tablespace may appear in multiple tablespace directories.

Answer 203

A

A table or index is stored in one or more physical files.
For non-temporary relations, the first such file is named as the relation’s file node number (pg_class.relfilenode), and subsequent files for the same relation are named _N where N is a serial number.
For temporary relations, the file name is of the form tBBB_FFF, where BBB is the backend ID of the backend which created the file, and FFF is the filenode number

Answer 204

A

Visibility map: RELFILENODE_vm
Free space map: RELFILENODE_fsm
(For unlogged relations): initialization fork: RELFILENODE_init

Answer 205

A

bin - programs
data - data directory
doc - documentation
include - header files
installer, scripts - installer files (EDB)
lib - libraries
pgAdmin III (EDB)
StackBuilder (EDB)
pg_env.{bat,sh}

Answer 206

A

Page header - approx 24 bytes long; pointer(s) to free space in the page; general info
Row/index pointers - array of offset/length pairs pointing to row or index entries later in the page
Free space - unallocated space. New pointers are allocated from the front , new rows/index entries from the rear
Row/Index entries - actual row or index entries
Special - index access method-specific data (empty in regular tables) [Really? Empty?]

Answer 207

A

No; a password is required

Answer 208

A

EDB One-Click Installer
OS system package (RPM/YUM, Debian/Ubuntu DEB, FreeBSD port, Solaris package, Mac OS X Homebrew
Source code

Answer 209

A

PATH - should include the correct PG bin directory
PGDATA - points to data cluster directory
PGPORT - point to port on which cluster is running
PGUSER - default database user name
PGDATABASE - default database

Answer 210

A

Edit your shell .profile or .bash_profile

Answer 211

A

Use the Windows My Computer properties page

Answer 212

A

initdb creates a database cluster’s data directory

Answer 213

A

initdb –D

a - specifies the authentication method for local users
D - Database cluster directory
U - Select the database super user name
E - Specify the database encoding
k –data-checksums - Use checksums on data pages to help detect corruption
W – prompt for superuser password
X, –xlogdir=XLOGDIR location for the transaction log directory

Answer 214

A

postgtesql.conf - to set the correct listening address and port, and to set appropriate configuration in general
pg_hba.conf - to define what users should be able to connect to which databases from which IP addresses

Answer 215

A

−pg_ctl initdb [] - creates a new PostgreSQL database cluster
−pg_ctl start [] - Start the server
−pg_ctl stop [] - Stop the server
−pg_ctl restart [] - Restart the server
−pg_ctl status [] - Display server status
−pg_ctl reload [] - Reload configuration file
−pg_ctl promote [-D DATADIR] – Promote Standby to be Primary
−pg_ctl kill signal_name process_id – send a signal(ABRT HUP INT QUIT TERM USR1 USR2) to a process
−Pg_ctl register|unregister - a system service on Microsoft Windows

Answer 216

A

−-m smart (the defaults) waits for all clients to exit
−-m fast rolls back active transactions, closes open connections, and shuts down cleanly
−-m immediate performs an immediate, abnormal shutdown (i.e. a crash)

Answer 217

A

−-D to specify an alternate cluster location
−-l to specify an alternate log file, when starting the server
−-c, –core-files allow postgres to produce core files
Starting and Stopping the Server (pg_ctl)

Answer 218

A

\c [DBNAME [USERNAME]]

Answer 219

A

listen_addresses (default localhost) - IP addresses to listen on; ‘*’ means all
port (default 5432) - port to listen on
max_connections (default 100) - max concurrent connections
superuser_reserved_connections (default 3) - number of connections reserved for superusers (out of the defined max_connections)
unix_socket_directory (default /tmp) - directory to be used for UNIX socket connections
unix_socket_permissions (default 0777) - access permissions of the UNIX-domain socket

Answer 220

A

authentication_timeout (default 1 minute)
ssl (default: off) - enable SSL connections
ssl_ca_file - SSL certificate authority file
ssl_cert_file - SSL certification
ssl_key_file - SSL private key
ssl_ciphers - list of eligible SSL ciphers
ssl_renegotiation_limit (default 512 MB) - how much data can flow through the connection before renegotiation occurs

Answer 221

A

shared_buffers (default: <=128MB) - size of shared buffer pool; rule of thumb: 25% of system memory to a max of 8GB on Linux, or 512 MB on Windows
temp_buffers (default: 8 MB) - amount of memory used by each backend for caching temp table data
work_mem (default: 1MB) - amount of memory used for each sort or hash operation before switching to temporary disk files
maintenance_work_mem (default: 16 MB) - amount used for each index build or VACUUM
temp_file_limit (default: -1) - amount of disk space that a session can use for temporary files. Default is unlimited. Attempting to exceed the limit will abort a transaction.

Answer 222

A

25% of system memory up to a maximum of 512 MB.

Answer 223

A

25% of system memory, up to a maximum of 8 GB.

Answer 224

A

Memory setting controlling the size of the shared data buffer cache. Default is <= 128 MB.

Answer 225

A

temp_buffers (default: 8MB): Amount of memory used by each backend for caching temporary table data.

Answer 226

A

work_mem (default: 1MB): Amount of memory used for each sort or hash operation before switching to temporary disk files. Default is conservative, but don’t overdo it.

Answer 227

A

maintenance_work_mem (default: 16MB): Amount of memory used for each index build or VACUUM.

Answer 228

A

temp_file_limit (default -1): amount of disk space that a session can use for temporary files. A transaction attempting to exceed this limit will be cancelled. Default is unlimited.
Memory Settings

Answer 229

A

random_page_cost (default 4.0): Estimated cost of a random page fetch, in abstract cost units. May need to be reduced to account for caching effects.
seq_page_cost (default 1.0): Estimated cost of a sequential page fetch, in abstract cost units. May need to be reduced to account for caching effects. Must always set random_page_cost >= seq_page_cost.
effective_cache_size (default 128M): Used to estimate the cost of an index scan. Rule of thumb is 75% of system memory.
There are plenty of enable_* parameters which influence the planner in choosing an optimal plan. For example:
enable_indexonlyscan enables or disables the query planner’s use of index-only-scan plan types

Answer 230

A

random_page_cost (default 4.0): Estimated cost of a random page fetch, in abstract cost units. May need to be reduced to account for caching effects.

Answer 231

A

seq_page_cost (default 1.0): Estimated cost of a sequential page fetch, in abstract cost units. May need to be reduced to account for caching effects. Must always set random_page_cost >= seq_page_cost.

Answer 232

A

effective_cache_size (default 128M): Used to estimate the cost of an index scan. Rule of thumb is 75% of system memory.

Answer 233

A

wal_level
fsync
wal_buffers
checkpoint_segments
checkpoint_timeout

Answer 234

A

wal_level (default: minimal). Determines how much information is written to the WAL. Change this to enable WAL archiving/replication. Other values are archive and hot_standby.

Answer 235

A

fsync (default on): Turn this off to make your database much faster – and silently cause arbitrary corruption in case of a system crash.

Answer 236

A

wal_buffers (default: -1, autotune): The amount of memory used in shared memory for WAL data. The default setting of -1 selects a size equal to 1/32nd (about 3%) of shared_buffers

Answer 237

A

checkpoint_segments (default 3): Maximum number of 16MB WAL file segments between checkpoints. Default is too small!

Answer 238

A

checkpoint_timeout (default 5 minutes): Maximum time between checkpoints.

Answer 239

A

log_destination
logging_collector
log_directory
log_filename
log_file_mode
log_rotation_age
log_rotation_size

Answer 240

A

log_destination. Valid values are combinations of stderr, csvlog, syslog, and eventlog, depending on platform.

Answer 241

A

strftime (but system strftime is not used so you can’t use local extensions). postgresql-%Y-%M-%d.log

Answer 242

A

client_min_messages (default NOTICE). Messages of this severity level or above are sent to the client.
log_min_messages (default WARNING). Messages of this severity level or above are sent to the server.
log_min_error_statement (default ERROR). When a message of this severity or higher is written to the server log, the statement that caused it is logged along with it.
log_min_duration_statement (default -1, disabled): When a statement runs for at least this long, it is written to the server log, with its duration.

Answer 243

A

log_connections (default off): Log successful connections to the server log.
log_disconnections (default off): Log some information each time a session disconnects, including the duration of the session.
log_error_verbosity (default “default”): Can also select “terse” or “verbose”.
log_duration (default off): Log duration of each statement.
log_line_prefix: Additional details to log with each line.
log_statement (default none): Legal values are none, ddl, mod (DDL and all other data-modifying statements), or all.
log_temp_files (default -1): Log temporary files of this size or larger, in kilobytes.
log_checkpoints (default off): Causes checkpoints and restartpoints to be logged in the server log.

Answer 244

A

Logs temporary files of this size or larger, in kilobytes

Answer 245

A

bgwriter_delay
bgwriter_lru_maxpages
bgwriter_lru_multiplier

Answer 246

A

bgwriter_delay (default 200 ms): Specifies time between activity rounds for the background writer.

Answer 247

A

bgwriter_lru_maxpages (default 100): Maximum number of pages that the background writer may clean per activity round.

Answer 248

A

bgwriter_lru_multiplier (default 2.0): Multiplier on buffers scanned per round. By default, if system thinks 10 pages will be needed, it cleans 10 * bgwriter_lru_multiplier of 2.0 = 20.

Answer 249

A

The primary background writer tuning technique is to lower the bgwriter_delay.

Answer 250

A

search_path specifies the order in which schemas are searched. The default value for this parameter is: “$user”, public

Answer 251

A

search_path

Answer 252

A

default_tablespace is the name of the tablespace in which objects are created by default

Answer 253

A

temp_tablespaces holds the tablespace name(s) in which objects are created by default. (The temp table load is evenly spread across the tablespaces in this list).

Answer 254

A

Any statement that takes more than the specified number of milliseconds will be aborted. The default value is 0 (no maximum statement time).

Answer 255

A

vacuum_cost_delay
vacuum_cost_page_hit
vacuum_cost_page_miss
vacuum_cost_page_dirty
vacuum_cost_limit

Answer 256

A

vacuum_cost_delay is the length of time, in milliseconds, that the process will wait when the cost limit it exceeded. Default for manual VACUUM is 0, so if you want a low-impact manual vacuum, you should set this to a non-zero value (autovacuum uses 20 ms ….)

Answer 257

A

autovacuum (default on) turns on or off autovacuuming

Answer 258

A

Autovacuum tasks running longer than this duration (in milliseconds) are logged. -1 is the default, which disables this logging.

Answer 259

A

autovacuum_max_workers (default 3) is the max number of autovacuum worker processes that may be running at one time

Answer 260

A

autovacuum_max_workers

Answer 261

A

autovacuum!

Answer 262

A

include ‘filename’

Answer 263

A

include_dir ‘somedir’. All files named *.conf will be included from the named directory, in C locale filename order. Thus, you could name the files 00_foo.conf, 01_bar.conf, 02_baz.conf to control the order of loading while still having the names be meaningful.

Answer 264

A

superuser_reserved_connections

Answer 265

A

log_min_duration_statement

Answer 266

A

In postgresql.conf, set: max_connections = 200

and restart the server

Answer 267

A

In postgresql.conf, set: superuser_reserved_connections = 10

and restart the server

Answer 268

A

In postgresql.conf set: authentication_timeout = 10s

and reload the server

Answer 269

A

Set logging_collector to on

Answer 270

A

Set log_min_duration_statement to 5000

Answer 271

A

Set log_connections to on and set the log_line_prefix to something include ‘%u’

Answer 272

A

set autovacuum_max_workers to 6
set autovacuum_vacuum_threshold to 100
set autovacuum_vacuum_scale_factor to 0.3
set autovacuum_analyze_threshold to 100
set autoavacuum_cost_limit to 100

Answer 273

A

A database cluster contains:

Roles (users, groups)
Tablespaces
Databases

Answer 274

A

A database contains:

catalogs
extensions
schemas

Answer 275

A

tables
views
sequences
functions
event triggers

Answer 276

A

select datname from pg_database;

Answer 277

A

\l (lower-case L)

Answer 278

A

CREATE DATABASE name [ [ WITH ] [ OWNER [=] dbowner ] [ TEMPLATE [=] template ] [ ENCODING [=] encoding ] [ TABLESPACE [=] tablespace ] [ CONNECTION LIMIT [=] connlimit ] ]

Answer 279

A

create schema fooz authorization foozer;

Answer 280

A

“$user”, public

Answer 281

A

The current schema is the first schema in the search_path

Answer 282

A

When trying to resolve object names (tables, etc), PG searches the schemas in the search_path in order. When creating a table, by default the table is created in the “current schema” – the first schema in the search_path.

Answer 283

A

Users are roles that can log into any database; groups are roles that can NOT log into any database.

Answer 284

A

> create user fred with password 'flinstone';
> create database fred with owner fred;
> \c fred fred
> create schema fred;
> \dn
> \q

Answer 285

A

select datname from pg_database;
– Advanced: list databases with owners:
select d.datname, u.usename from pg_database d join pg_user u on (d.datdba = u.usesysid);

Answer 286

A

\l (lower case L)

Answer 287

A

Two methods:

(psql meta-command) \d+
(SQL) select schemaname, tablename, tableowner from pg_tables

Answer 288

A

Double-quotes are used to specify an exact name, preserving case

Answer 289

A

There are two different and equivalent usages of psql:
1) psql [DBNAME [USER]]
and
2) psql -U USER DBNAME

Answer 290

A

If the PGUSER and PGDATABASE environment variables are not defined, the default values for USER and DBNAME are the name of the operating system user. If PGUSER and PGDATABASE are defined, those values are used. If just PGUSER is defined, that value is also the default database name.

Answer 291

A

\c DBNAME [USER]

Answer 292

A

\c - NEWUSER

– (note the dash character)

Answer 293

A

\c DBNAME [USER]

Answer 294

A

\c - NEWUSER

– (note the dash character)

Answer 295

A

1) If the PGHOST and PGPORT environment variables are defined, they are used. Otherwise, on UNIX systems, a local UNIX socket connection is attempted, or, on Windows, a local TCP connection.

Answer 296

A

psql always runs commands in ~/.psqlrc unless -X is specified

Answer 297

A

Shows or saves the command history

Answer 298

A

Edits (and then executes) the query buffer

Answer 299

A

\w FILENAME

Answer 300

A

Saves the query buffer (to the specified FILENAME)

Answer 301

A

Two methods.

1) (Command line): -v NAME=VALUE
2) (Inside psql): \set NAME VALUE

Answer 302

A

Three ways:
1) Unadorned (numbers)
\:NAME
\set NAME 10
select :NAME;
2) Quoted strings
\:'NAME'
\set NAME testing
select :'NAME';
3) Identifiers
\:"NAME"
\set NAME empno
select :"NAME" from emp;

Answer 303

A

-o FILENAME or \o FILENAME (FILENAME may be a pipe)
Example:
\o | grep 4
select empno from emp;
  7499
  7654
  7844
  7934
(14 rows)

Answer 304

A

Three ways:
1) Unadorned (numbers)
\:NAME
\set NAME 10
select :NAME;
2) Quoted strings
\:'NAME'
\set NAME 'testing'
select :'NAME';
3) Identifiers
\:"NAME"
\set NAME 'empno'
select :"NAME" from emp;

Answer 305

A

-o FILENAME or \o FILENAME (FILENAME may be a pipe)

Answer 306

A

\g [FILENAME] (filename may be a pipe)

Answer 307

A

“Tuples only” mode means that column headings are not output, and the final count of rows is not output. Within psql, \t toggle tuples-only output. On the psql command line, -t turns on tuples-only mode. An alternative to \t is \pset tuples_only

Answer 308

A

Toggles expanded output, where columns are output as rows.

Answer 309

A

“Tuples only” mode means that column headings are not output, and the final count of rows is not output. Within psql, \x

Answer 310

A

\echo [-n] [string]
Prints a string on STDOUT, followed by a newline, unless the -n switch is used, which suppresses the newline. The output of \echo is not affected by -o or \o.

Answer 311

A

\qecho is like \echo, but its output is redirected by -o or \o.

Answer 312

A

name, owner, encoding, collation, ctype, access privileges

Answer 313

A

size, tablespace, and descripion

Answer 314

A

Lists schemas (namespaces)

Answer 315

A

name and owner

Answer 316

A

name, owner, access privileges, and description

Answer 317

A

List functions

Answer 318

A

schema, name, result data type, argument data types, type

Answer 319

A

Same as \df, but in addition: security, volatility, owner, language, source code

Answer 320

A

List info about indexes, sequences, tables, views, or System objects; any combination of letters is possible

Answer 321

A

Lists per-database role settings

Answer 322

A

List access privileges (for tables, views, and sequences, by default)

Answer 323

A

This command fetches and edits the definition of the named function (Can also be used to create new functions - just don’t supply a name, and your editor will be invoked on a function definition template).

Answer 324

A

\password

Answer 325

A

\d+ TABLENAME

Answer 326

A

\d+ also gives you storage, stats target, and description

Answer 327

A

Two ways:
1. With \g ("one-shot \o"):
select * from emp \g FILENAME
2. With \o:
\o FILENAME
select * from emp;
\o
3. From command line:
psql -U edbstore -o /tmp/emp.dat -c "select * from emp" edbstore

Answer 328

A

\t
\o FILENAME
select * from emp;
\o
\t

Answer 329

A

Three ways:

\i FILENAME
Command line: psql … < FILENAME
Command line: psql … -f FILENAME …

Answer 330

A

Two approaches, sort of:

\df+ myfunc (primitive)
\ef myfunc (opens your EDITOR to allow you to view and edit source code of the function)

Answer 331

A

It means either that no PostgreSQL server is running on the specified host, or that it is not listening on the specified host and port.

Answer 332

A

Server/application
Database
Object

Answer 333

A

Checking pg_hba.conf: there must be an entry matching type (e.g. host, hostssl), database name, user name, client IP address, and authentication method (md5, trust, etc).

Answer 334

A

Checking user/password combo
CONNECT privilege on the database
SCHEMA permissions

Answer 335

A

Object (e.g. table) level privileges, administered with GRANT and REVOKE

Answer 336

A

Type (e.g. host, hostssl)
Database name (or ‘all’)
User name (or ‘all’)
Host spec, incl IPv4, IPv6, or DNS hostname
Client IP address modified by CIDR mask
authentication method (trust, reject, md5, password, gss, sspi, krb5, ident, peer, pam, ldap, radius or cert.)

Answer 337

A

trust, reject, md5, password, gss, sspi, krb5, ident, peer, pam, ldap, radius or cert.

Answer 338

A

replication

Answer 339

A

Add a line to pg_hba.conf (usually in the cluster’s data directory, unless you have specified that it go elsewhere), like:
host somedb someuser a.b.c.d/32 md5
Reload the server to activate the change to pg_hba.conf - send a HUP signal to the postmaster, or execute the SQL “select pg_reload_conf();” as a superuser, or use “pg_ctl -D datadir reload” (or the OS service equivalent that wraps pg_ctl).

Answer 340

A

ALTER DEFAULT PRIVILEGES FOR ddl_user IN SCHEMA public GRANT SELECT ON TABLES
TO readonly_user;

Answer 341

A

DROP OWNED or REASSIGN OWNED

Answer 342

A

) pg_hba.conf - ability to authenticate
) CONNECT privilege on the database (this is default)
) USAGE privilege on the relevant schema
) SELECT, UPDATE, INSERT, DELETE, EXECUTE, etc privilege on the object in the schema (grant some_privs on all tables in schema foo to some_user)

Answer 343

A

bigint (int8)

Answer 344

A

bigserial

Answer 345

A

real (float4)

Answer 346

A

double precision (float8)

Answer 347

A

numeric(p,s) (p=precision, the total number of digits, and s=scale, the number of digits in the fractional part)

Answer 348

A

Yes, it has a ‘json’ data type for storing JSON (JavaScript Object Notation) data

Answer 349

A

int4range, int8range, numrange, tsrange, tstzrange, daterange (note: no floating point range)

Answer 350

A

The column name comes first, before the column type.

Answer 351

A

insert into departments(dep_id, name) values (1, ‘Finance’);

Answer 352

A

insert into departments(dep_id, name) values (1, ‘Finance’), (2, ‘Silly Walks’);

Answer 353

A

update departments set name=’development’ where dep_id = 1;

Answer 354

A

delete from departments where department_id = 2

Answer 355

A

$$Don’t want no stinkin’ single quotes$$
– or:
$foo$Don’t want it$foo$

Answer 356

A

Double quotes are used to delimit database object names that clash with keywords, contain mixed case, or contain special characters (something other than a-z, 0-9, or underscore).

Answer 357

A

check constraints
not-null constraints
unique constraints
primary keys
foreign keys

Answer 358

A

CREATE [TEMPORARY | TEMP] SEQUENCE name [INCREMENT [ BY ] increment]
[ MINVALUE minvalue | NO MINVALUE ] [ MAXVALUE maxvalue | NO MAXVALUE ]
[ START [ WITH ] start ] [ CACHE cache ] [ [ NO ] CYCLE ]

Answer 359

A

nextval(‘myseq’)

Answer 360

A

Advances the sequence and returns a new value. The single argument should be the name of the sequence, as a string.

Answer 361

A

currval returns the most recently used value for a specific sequence.

Answer 362

A

An error; currval is the most recently used/allocated value for the sequence, so it is undefined until nextval() has been called.

Answer 363

A

Sets the next value to be returned by the sequence.

Answer 364

A

Not by default; you have to create rules in order to get updatable views.

Answer 365

A

No, it is not secure to do so, unless the view is create with the “with (security_barrier)” option.

Answer 366

A

Subquery. Don’t ask me what this means, though.

Answer 367

A

PG 9.3 has materialized views, which are like pre-computed views, with the option to refresh the snapshot of data stored in the materialized view. You could also view materialized views as being like tables that can only be populated with a single query.

Answer 368

A

CREATE MATERIALIZED VIEW myview AS SELECT blah, blah blah;

Answer 369

A

REFRESH MATERIALIZED VIEW myview;

Answer 370

A

Subqueries appearing in FROM can be preceded by the key word LATERAL. This allows them to reference columns provided by preceding FROM items. (Without LATERAL, each subquery is evaluated independently and so cannot cross-reference any other FROM item.) LATERAL is primarily useful when the cross-referenced column is necessary for computing the row(s) to be joined. A common application is providing an argument value for a set-returning function.

Answer 371

A

B-tree (default)
Hash (not crash safe)
Index on expressions (use when quick retrieval is needed on a frequently used expression)
Partial index (index only rows that satisfy the WHERE clause, which need not include the indexed column; a query must use the same WHERE clause in order to use the partial index)
SP-GiST indexes (space-partitioned GiST supported partitioned search trees)

Answer 372

A

CREATE [ UNIQUE ] INDEX [ CONCURRENTLY ] [ name ] ON table_name [ USING method ]
( { column_name | ( expression ) } [ COLLATE collation ] [ opclass ] [ ASC |
DESC ] [ NULLS { FIRST | LAST } ] [, …] )
[ WITH ( storage_parameter = value [, … ] ) ]
[ TABLESPACE tablespace_name ]
[ WHERE predicate ]

Answer 373

A

No. (However, it is necessary to wrap the column argument of “USING “ in parentheses in a JOIN query that uses one).

Answer 374

A

It is equivalent to “SELECT * FROM “

Answer 375

A

No, OIDs are not dumped by default. Use the -o (little o) switch to dump OIDs

Answer 376

A

You must use the -Fd (directory format) switch along with -j. E.g.: pg_dump -Fd -j 4 would dump to directory format using 4 parallel jobs.

Answer 377

A

It should be restored into an empty database. If you drop the database dbname and recreate it, then you can run:
psql dbname < dbname.sql

Answer 378

A

Yes, all backups made by pg_dump are portable across architectures.

Answer 379

A

r -- SELECT ("read")
            w -- UPDATE ("write")
            a -- INSERT ("append")
            d -- DELETE
            D -- TRUNCATE
            x -- REFERENCES
            t -- TRIGGER
            X -- EXECUTE
            U -- USAGE
            C -- CREATE
            c -- CONNECT
            T -- TEMPORARY

Answer 380

A

recovery.conf in the data directory will have been renamed to recovery.done

Answer 381

A

Just:

restore_command = ‘cp /mnt/server/archivedir/%f “%p”’ or what have you

Answer 382

A

recover_command = 'cp /mnt/server/archivedir/%f "%p"'
recover_target_time = '