Version 9 (modified by 3 years ago) ( diff ) | ,
---|
RFC-7: Managing PostGIS Extension Paths with Fewer Files
Date: | 2022/02/11 |
Author: | Regina Obe |
Last Edited: | 2022/02/11 |
Last Updated By: | Regina Obe <lr at pcorp dot us> |
Status: | In Discussion |
PROBLEM STATEMENT
With each new version of PostGIS, we need to ship more and more extension upgrade scripts. In reality all the upgrade scripts are just a symlink to the main one or exact copies of the main one. This results in a users extensions folder cluttered with 100s or more of PostGIS related scripts.
Not only that, but for each new version we ship, we always need to remember to add the recently shipped micros of other versions to the extensions/updateable.mk script.
The largeness of the script files (when not symlinked, which is often the case), is the biggest issue of all, and will become even more of an issue with shipping spatial_ref_sys updates.
The solution of the above issues is to have: Solution 1, Solution 2, and Solution 3 implemented, as some solutions are not backward compatible but do provide extra benefits.
SOLTUION 1
This has already been committed in master branch for PostGIS 3.3.0. Embedded in:
SELECT postgis_extensions_upgrade();
What it does: Update the current version to ANY and then run the updates. This has the affect of always forcing the postgis--ANY--<current version>.sql script to run which handles all versions.
Benefits of this approach:
- We only need to ship one extension upgrade script file per extension. The postgis--ANY--<current version>.sql
- No clutter of 100s of 1000s of files.
- We don't need to maintain extensions/updateable.mk and remember to update it.
- It is possible to upgrade from postgis (older PostgreSQL Version) 3.3.0 to (new PostgreSQL version) 3.3.0. This is generally what is needed in pg_upgrade.
Drawbacks:
- People can only upgrade using
`
SELECT postgis_extensions_upgrade();
`
Or do the manually mock with the pg_catalog.pg_extension table themselves
- We are mucking with system tables, pg_catalog.pg_extension, which we have done for other things, but is a questionable practice.
- Packagers who have put in nice features to auto upgrade extensions during pg_upgrade, now have to treat us as special snowflakes.
SOLUTION 2
Beg and plea upstream to recognize a % or ANY as a first class citizen to mean any prior version.
This approach allows for defining a new line item in the control file, that denotes something like
upgrade_script_pattern = %--<version>
Where the % will be treated as a wildcard to match to provider the best match.
Benefits of this approach:
- We have made what we are doing, built into the PostgreSQL machinery, so no special care needs to be done by packagers or users.
It makes us not a special snowflake.
- It will help other extensions which have similar issues and just want a single upgrade script. We expect many extensions to benefit, though we will benefit the most.
Drawbacks:
- Old PostgreSQL versions, won't understand this new world order, so we still need to maintain a backward compatible way for 5 years or more.
SOLUTION 3
The 0-byte file and strip model. This approach creates 0 or very small byte files, with their existense solely to maintain PostgreSQL extension chain model. To reduce file bloat, we will have <version-minor>--<version-minor-curren>MAX files. Anyone who is coming from a prior minor will have a file such as postgis--3.2.1--3.2MAX version OR use the postgis_extensions_upgrade() function.
In this model, our extension files look like below:
our generated extension upgrade scripts will look like the below, with only the postgis--3.3MAX--3.3.0.sql having any upgrade statements in it.
: postgis--3.1MAX--3.3MAX.sql postgis--3.1.5--3.3MAX.sql postgis--3.2MAX--3.3MAX.sql postgis--3.3MAX--3.3.0.sql postgis--3.3.0--3.3MAX.sql
Benefits of this approach:
- We are staying between the lines. Respecting known practices, but still having much lighter weight scripts.
- A little less clutter, but we are relying on the user having the old version still installed so that they have a chain
postgis--<old-version-minor-micro>--<old-version-minor>MAX.sql
- A little little easier release cycle if we don't need to maintain extensions/updateable.mk and remember to update it.
- No change in upgrade extension workflow for packagers (or users who think of PostGIS as any other extension).
- It will work for all PostgreSQL versions supported
Downsides:
- We still have some clutter of files, and without addendum, we have just as much clutter of files as before.
- With addendum, it is not possible for a user to upgrade using ALTER EXTENSION from a version we don't ship a script for. They would have to use and that assumes this version -- has the ANY/MAX hack.
SELECT postgis_extensions_upgrade();
- It is not possible to upgrade from version (prior PostgreSQL version) 3.3.0 to (newer PostgreSQL version) 3.3.0 which is needed often in pg_upgrade to gain the benefits of new features in latest PostgreSQL version that are not exposed in older PostgreSQL versions. Such operation would just state "You are already at latest version" and not do anything, unless one does it via
SELECT postgis_extensions_upgrade();
which would have SOLUTION 1.