# 51.49. pg\_statistic

The catalog `pg_statistic` stores statistical data about the contents of the database. Entries are created by [ANALYZE](https://www.postgresql.org/docs/13/sql-analyze.html) and subsequently used by the query planner. Note that all the statistical data is inherently approximate, even assuming that it is up-to-date.

Normally there is one entry, with `stainherit` = `false`, for each table column that has been analyzed. If the table has inheritance children, a second entry with `stainherit` = `true` is also created. This row represents the column's statistics over the inheritance tree, i.e., statistics for the data you'd see with `SELECT`` `*`column`* FROM *`table`*\*, whereas the `stainherit` = `false` row represents the results of `SELECT`` `*`column`* FROM ONLY *`table`*.

`pg_statistic` also stores statistical data about the values of index expressions. These are described as if they were actual data columns; in particular, `starelid` references the index. No entry is made for an ordinary non-expression index column, however, since it would be redundant with the entry for the underlying table column. Currently, entries for index expressions always have `stainherit` = `false`.

Since different kinds of statistics might be appropriate for different kinds of data, `pg_statistic` is designed not to assume very much about what sort of statistics it stores. Only extremely general statistics (such as nullness) are given dedicated columns in `pg_statistic`. Everything else is stored in “slots”, which are groups of associated columns whose content is identified by a code number in one of the slot's columns. For more information see `src/include/catalog/pg_statistic.h`.

`pg_statistic` should not be readable by the public, since even statistical information about a table's contents might be considered sensitive. (Example: minimum and maximum values of a salary column might be quite interesting.) [`pg_stats`](https://www.postgresql.org/docs/13/view-pg-stats.html) is a publicly readable view on `pg_statistic` that only exposes information about those tables that are readable by the current user.

#### **Table 51.49. `pg_statistic` Columns**

| <p>Column Type</p><p>Description</p>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
| --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| <p><code>starelid</code> <code>oid</code> (references <a href="https://www.postgresql.org/docs/13/catalog-pg-class.html"><code>pg\_class</code></a>.<code>oid</code>)</p><p>The table or index that the described column belongs to</p>                                                                                                                                                                                                                                                                                           |
| <p><code>staattnum</code> <code>int2</code> (references <a href="https://www.postgresql.org/docs/13/catalog-pg-attribute.html"><code>pg\_attribute</code></a>.<code>attnum</code>)</p><p>The number of the described column</p>                                                                                                                                                                                                                                                                                                   |
| <p><code>stainherit</code> <code>bool</code></p><p>If true, the stats include inheritance child columns, not just the values in the specified relation</p>                                                                                                                                                                                                                                                                                                                                                                        |
| <p><code>stanullfrac</code> <code>float4</code></p><p>The fraction of the column's entries that are null</p>                                                                                                                                                                                                                                                                                                                                                                                                                      |
| <p><code>stawidth</code> <code>int4</code></p><p>The average stored width, in bytes, of nonnull entries</p>                                                                                                                                                                                                                                                                                                                                                                                                                       |
| <p><code>stadistinct</code> <code>float4</code></p><p>The number of distinct nonnull data values in the column. A value greater than zero is the actual number of distinct values. A value less than zero is the negative of a multiplier for the number of rows in the table; for example, a column in which about 80% of the values are nonnull and each nonnull value appears about twice on average could be represented by <code>stadistinct</code> = -0.4. A zero value means the number of distinct values is unknown.</p> |
| <p><code>stakind</code><em><code>N</code></em> <code>int2</code></p><p>A code number indicating the kind of statistics stored in the <em><code>N</code></em>th “slot” of the <code>pg\_statistic</code> row.</p>                                                                                                                                                                                                                                                                                                                  |
| <p><code>staop</code><em><code>N</code></em> <code>oid</code> (references <a href="https://www.postgresql.org/docs/13/catalog-pg-operator.html"><code>pg\_operator</code></a>.<code>oid</code>)</p><p>An operator used to derive the statistics stored in the <em><code>N</code></em>th “slot”. For example, a histogram slot would show the <code><</code> operator that defines the sort order of the data.</p>                                                                                                                 |
| <p><code>stacoll</code><em><code>N</code></em> <code>oid</code> (references <a href="https://www.postgresql.org/docs/13/catalog-pg-collation.html"><code>pg\_collation</code></a>.<code>oid</code>)</p><p>The collation used to derive the statistics stored in the <em><code>N</code></em>th “slot”. For example, a histogram slot for a collatable column would show the collation that defines the sort order of the data. Zero for noncollatable data.</p>                                                                    |
| <p><code>stanumbers</code><em><code>N</code></em> <code>float4\[]</code></p><p>Numerical statistics of the appropriate kind for the <em><code>N</code></em>th “slot”, or null if the slot kind does not involve numerical values</p>                                                                                                                                                                                                                                                                                              |
| <p><code>stavalues</code><em><code>N</code></em> <code>anyarray</code></p><p>Column data values of the appropriate kind for the <em><code>N</code></em>th “slot”, or null if the slot kind does not store any data values. Each array's element values are actually of the specific column's data type, or a related type such as an array's element type, so there is no way to define these columns' type more specifically than <code>anyarray</code>.</p>                                                                     |
