Changing tires at 100mph: A guide to zero downtime migrations

May 4, 2022

As a backend developer at a mobile app company, a common task was migrating a database schema. This could be to improve query performance, change column names/types, or adapt data to new use cases. While this may seem like a straightforward set of SQL commands, it becomes a complex choreographed dance to be achieved with zero downtime.

The steps are as follows:

Create the new empty table
Write to both old and new table
Copy data (in chunks) from old to new
Validate consistency
Switch reads to new table
Stop writes to the old table
Cleanup old table

This guide will go through the step-by-step process of migrating tables in PostgreSQL. While the examples are for a PostgreSQL table migration, the same steps can apply to almost any migration.

Background

Existing schema

Let’s suppose we have an existing schema with a table named old, with an API that runs CRUD operations against the table.

Old

CREATE EXTENSION IF NOT EXISTS "uuid-ossp";

CREATE TABLE IF NOT EXISTS old (
    old_id uuid PRIMARY KEY DEFAULT uuid_generate_v4(),
    data TEXT NOT NULL
);

-- Create
INSERT INTO old (data)
VALUES (?)
RETURNING *;

-- Read
SELECT *
FROM old
WHERE old_id = ?;

-- Update
UPDATE old
SET data = ?
WHERE old_id = ?;

-- Delete
DELETE
FROM old
WHERE old_id = ?;

Desired schema

The data column was type TEXT for flexibility. It is now used exclusively for timestamps. We then get a request from Product on a hot codepath to count all entries between 2 timestamps. While this is possible with the current schema, we decided a better approach would be to update data to be of type TIMESTAMP. In addition, old is no longer an accurate name, and that new would be a lot better.

Our desired schema is:

New schema

CREATE EXTENSION IF NOT EXISTS "uuid-ossp";

CREATE TABLE IF NOT EXISTS new (
    new_id uuid PRIMARY KEY DEFAULT uuid_generate_v4(),
    created_date TIMESTAMP WITH TIME ZONE NOT NULL
);

Migration requirements

We can further specify the requirements through the migration:

The system must fully respond to requests throughout the migration process
No action can take a write lock against a significant percentage of the table
No unsafe operations
We must be able to roll back any changes to the previous step if we encounter issues

Procedure

Create a new, empty table

Create a new table

CREATE TABLE IF NOT EXISTS new (
    new_id uuid PRIMARY KEY DEFAULT uuid_generate_v4(),
    created_date TIMESTAMP WITH TIME ZONE NOT NULL
);

Write to both tables

Now that we have two tables, we write to both simultaneously. While the old table remains the source of truth, we are setting ourselves up to be eventually consistent.

Write to both tables

-- Create
WITH new_rows AS (
    INSERT INTO new (created_date)
    VALUES (?)
    RETURNING *
)
INSERT
INTO old (old_id, data)
SELECT new_id, CAST(created_date AS TEXT)
FROM new_rows
RETURNING *;

-- Update
UPDATE old
SET data = ?
WHERE old_id = ?;

UPDATE new
SET created_date = ?
WHERE new_id = ?;


-- Delete
DELETE
FROM old
WHERE old_id = ?;

DELETE
FROM new
WHERE new_id = ?;

Note that our create operation appears slightly more complex than before. We are creating a row in the new table, then using the record to populate the values of the old table. This is all done in a single transaction to ensure our randomly generated UUIDs are in sync.

Copy data to new table

Once we know that all new records will be replicated, we can start copy old records.

Replicate data between tables

INSERT INTO new(new_id, created_date)
SELECT old_id, CAST(data AS TIMESTAMP)
FROM OLD
WHERE NOT EXISTS(SELECT *
                 FROM new
                 WHERE new_id = OLD.old_id)
LIMIT 1000
RETURNING *;

We are inserting values into the new table from the old table that don’t yet exist in new. To keep the database responsive, we split the operation using limit 1000. While chunk size can be tuned up or down depending on the table, I prefer smaller chunks to avoid large write locks.

Validate consistency

The often overlooked step. Before we switch over the reads, we should ensure that our data is fully in sync between tables. Here are a few sample queries to validate consistency between old and new.

Are we missing any records?

SELECT *
FROM old
    FULL OUTER JOIN new ON old_id = new_id
WHERE new_id IS NULL
   OR old_id IS NULL

Is any data inconsistent?

SELECT *
FROM old
    INNER JOIN new ON old_id = new_id
WHERE CAST(data AS TIMESTAMP) <> created_date

Switch reads

This is usually the most burdensome step. There are often dozens of different codepaths reading from the table. Since that data is in sync between tables, we can take our time with this part of the migration.

Switch reads

SELECT *
FROM new
WHERE new_id = ?;

This stage is where we’d update any views, foreign keys, triggers, etc to reference the new table.

Stop writes

Now that we’ve switched all reads over to the new system, we no longer need to update the old database.

Drop writes

-- Create
INSERT INTO new (created_date)
VALUES (?)
RETURNING *

-- Update
UPDATE new
SET created_date = ?
WHERE new_id = ?;

-- Delete
DELETE
FROM new
WHERE new_id = ?;

Cleanup table

When we’re confident that our system no longer references the old table, we can drop it.

New schema

DROP TABLE IF EXISTS old;

Done

Congratulations! Migration Complete!

It's done GIF

Now the most challenging part: explaining to Product why their seemingly small request took 3x longer than expected. If it helps you can send them this article. Good luck!