74. System Catalog Declarations and Initial Contents
PostgreSQL uses many different system catalogs to keep track of the existence and properties of database objects, such as tables and functions. Physically there is no difference between a system catalog and a plain user table, but the backend C code knows the structure and properties of each catalog, and can manipulate it directly at a low level. Thus, for example, it is inadvisable to attempt to alter the structure of a catalog on-the-fly; that would break assumptions built into the C code about how rows of the catalog are laid out. But the structure of the catalogs can change between major versions.
The structures of the catalogs are declared in specially formatted C header files in the src/include/catalog/
directory of the source tree. In particular, for each catalog there is a header file named after the catalog (e.g., pg_class.h
for pg_class
), which defines the set of columns the catalog has, as well as some other basic properties such as its OID. Other critical files defining the catalog structure include indexing.h
, which defines the indexes present on all the system catalogs, and toasting.h
, which defines TOAST tables for catalogs that need one.
Many of the catalogs have initial data that must be loaded into them during the “bootstrap” phase of initdb, to bring the system up to a point where it is capable of executing SQL commands. (For example, pg_class.h
must contain an entry for itself, as well as one for each other system catalog and index.) This initial data is kept in editable form in data files that are also stored in the src/include/catalog/
directory. For example, pg_proc.dat
describes all the initial rows that must be inserted into the pg_proc
catalog.
To create the catalog files and load this initial data into them, a backend running in bootstrap mode reads a BKI (Backend Interface) file containing commands and initial data. The postgres.bki
file used in this mode is prepared from the aforementioned header and data files, while building a PostgreSQL distribution, by a Perl script named genbki.pl
. Although it's specific to a particular PostgreSQL release, postgres.bki
is platform-independent and is installed in the share
subdirectory of the installation tree.
genbki.pl
also produces a derived header file for each catalog, for example pg_class_d.h
for the pg_class
catalog. This file contains automatically-generated macro definitions, and may contain other macros, enum declarations, and so on that can be useful for client C code that reads a particular catalog.
Most PostgreSQL developers don't need to be directly concerned with the BKI file, but almost any nontrivial feature addition in the backend will require modifying the catalog header files and/or initial data files. The rest of this chapter gives some information about that, and for completeness describes the BKI file format.