intermediate By Mathias Paulenko Jun 13, 2026

Manage Database Migrations Safely

How to version, apply, and rollback database schema changes using migration tools like Flyway, Alembic, and Liquibase in production environments.

database ci-cd databases sql postgresql

Topics: databases

Note: This guide follows English-language naming conventions and terminology standards common in international development teams. Examples use English identifiers and comments to maximize compatibility across codebases and tooling.

Overview

Database migrations track, version, and apply schema changes over time. Without a migration system, schema changes are applied manually through ad-hoc SQL scripts, SSH sessions, and prayer. This leads to environments that diverge, deployment failures, and production outages caused by forgotten indexes or missing columns.

A migration tool turns schema changes into versioned, repeatable, and reversible scripts. Each migration is numbered or timestamped, tracked in a dedicated history table, and applied automatically during deployment. Rollbacks are scripted and tested, not improvised. Here is how to the three most widely adopted tools: Flyway (JVM), Alembic (Python), and Liquibase (multi-language).

When to Use

Use this recipe when:

Managing schema evolution across development, staging, and production databases. See Safe Migrations for zero-downtime strategies.
Adding tables, columns, indexes, or constraints as part of a feature release. See Database Transactions for consistency during deploys.
Coordinating schema changes with application code deployments. See Clean Code Guide for maintainable patterns.
Rolling back schema changes after failed deployments. See Retry Logic for recovery strategies.
Auditing who changed what in the database and when

Solution

Flyway (JVM/SQL)

-- V1__create_users_table.sql
CREATE TABLE users (
    id SERIAL PRIMARY KEY,
    email VARCHAR(255) NOT NULL UNIQUE,
    created_at TIMESTAMP DEFAULT NOW()
);

-- V2__add_user_status.sql
ALTER TABLE users ADD COLUMN status VARCHAR(20) DEFAULT 'active';

-- V3__create_user_index.sql
CREATE INDEX idx_users_email ON users(email);

flyway -url=jdbc:postgresql://db:5432/app -locations=filesystem:db/migration migrate

Alembic (Python/SQLAlchemy)

# alembic/versions/20250613_add_user_status.py
from alembic import op
import sqlalchemy as sa

revision = 'a1b2c3d4'
down_revision = '9f8e7d6c'

def upgrade():
    op.add_column('users', sa.Column('status', sa.String(20), nullable=True))
    op.execute("UPDATE users SET status = 'active' WHERE status IS NULL")
    op.alter_column('users', 'status', nullable=False)

def downgrade():
    op.drop_column('users', 'status')

alembic upgrade head
alembic downgrade -1

Liquibase (XML/YAML/JSON)

<databaseChangeLog>
    <changeSet id="1" author="developer">
        <createTable tableName="users">
            <column name="id" type="int" autoIncrement="true">
                <constraints primaryKey="true"/>
            </column>
            <column name="email" type="varchar(255)">
                <constraints nullable="false" unique="true"/>
            </column>
        </createTable>
    </changeSet>
    <changeSet id="2" author="developer">
        <addColumn tableName="users">
            <column name="status" type="varchar(20)" defaultValue="active"/>
        </addColumn>
    </changeSet>
</databaseChangeLog>

liquibase --changeLogFile=db.changelog.xml update

Explanation

Versioned scripts: Each migration file has a unique identifier. Tools record applied migrations in a history table (flyway_schema_history, alembic_version, databasechangelog), preventing duplicate execution.
Forward migrations (up): Schema changes that move the database forward — creating tables, adding columns, creating indexes. These run automatically during deployment.
Rollback migrations (down): Reverse operations that undo forward migrations — dropping columns, removing indexes, deleting tables. Test these on staging before production emergencies.
Baseline and repair: When introducing migrations to an existing database, tools can baseline current schema state without attempting to recreate existing tables.

Variants

Tool	Format	Language	Best For
Flyway	Plain SQL	JVM-first	Teams that prefer raw SQL
Alembic	Python code	Python/SQLAlchemy	Python ecosystems
Liquibase	XML/YAML/JSON	Multi-language	Enterprise, multi-DB support
Sequelize CLI	JS code	Node.js	Express/NestJS projects

What Works

Never modify an already-applied migration: once a migration runs in any shared environment, treat it as immutable. Create a new migration to fix mistakes.
Make migrations idempotent when possible: CREATE TABLE IF NOT EXISTS and DROP INDEX IF EXISTS prevent failures during repeated execution.
Separate DDL and DML: schema changes (CREATE, ALTER) and data changes (INSERT, UPDATE) should be in different migrations. DDL often locks tables; DML can be batched.
Test rollbacks on every change: a migration without a tested rollback is a one-way door. Practice downgrades in staging to confirm they work.
Run migrations before application startup: deploy the schema change, then deploy the code that depends on it. Never assume the column exists before the migration runs.

Common Mistakes

Adding non-nullable columns without defaults: existing rows will cause the migration to fail. Add the column as nullable, backfill data, then add the NOT NULL constraint in a follow-up migration. See Safe Migrations for the expand-contract pattern.
Deleting data without backups: dropping a column destroys data permanently. Always back up or copy data before destructive changes.
Locking tables during peak hours: adding an index or altering a large table can lock for minutes. Schedule heavy migrations during maintenance windows or use online schema change tools.
Forgetting about replicas: migrations applied to a primary database may not replicate correctly if they contain non-deterministic functions or temporary tables.

Frequently Asked Questions

Q: Should migrations be in the same repository as application code? A: Yes. Keeping migrations and code together ensures every branch contains the schema it needs, and CI can validate both simultaneously.

Q: How do I handle migrations in a CI/CD pipeline? A: Run migrations as a dedicated deployment step before starting the new application version. Use a locking mechanism to prevent concurrent migration runs.

Q: Can I automate rollback on deployment failure? A: Some teams automatically downgrade after a failed health check, but be cautious — rollbacks can also fail. Test rollback procedures thoroughly.

Q: What is the difference between migrations and seeds? A: Migrations change schema structure. Seeds insert reference data (roles, countries, settings). Keep them separate so migrations remain reversible.

Is this solution production-ready?

Yes. The code examples above show tested implementations. Adapt error handling and configuration to your specific environment before deploying.

What are the performance characteristics?

Performance depends on your data volume and infrastructure. The solutions shown prioritize clarity. For high-throughput scenarios, add caching, batching, and connection pooling as needed.

How do I debug issues with this approach?

Start with the minimal example above. Add logging at each step. Test with small inputs first, then scale up. Use your language’s debugger to step through edge cases.

Expand-Contract Pattern for Zero-Downtime Migrations

The expand-contract pattern splits schema changes into multiple deployments, avoiding downtime on large tables:

-- Phase 1: Expand (backward-compatible)
-- Add nullable column, create new table, add index
ALTER TABLE users ADD COLUMN email_verified BOOLEAN DEFAULT NULL;
CREATE INDEX CONCURRENTLY idx_users_email_verified ON users(email_verified);

-- Phase 2: Migrate data (batch backfill)
UPDATE users SET email_verified = false WHERE email_verified IS NULL AND id <= 10000;
UPDATE users SET email_verified = false WHERE email_verified IS NULL AND id <= 20000;
-- Continue in batches until all rows are set

-- Phase 3: Contract (after all application instances use new schema)
ALTER TABLE users ALTER COLUMN email_verified SET DEFAULT false;
ALTER TABLE users ALTER COLUMN email_verified SET NOT NULL;
DROP INDEX IF EXISTS idx_users_old_email;

Online Schema Changes with `pg_repack`

# Rebuild a table without holding an exclusive lock
pg_repack -d mydb -t users -j 2

# Rebuild indexes concurrently
pg_repack -d mydb -t users --index idx_users_email

pg_repack creates a shadow table, copies data, swaps tables using a brief lock, and syncs changes via triggers. Use it for VACUUM FULL alternatives on production tables.

Alembic Autogenerate and Manual Revision

# Auto-detect schema changes between models and database
alembic revision --autogenerate -m "add_user_preferences"

# The generated migration may need manual review:
# - Column order may differ
# - Server defaults may be missing
# - Constraints may need manual adjustment

# Manual revision for complex changes
alembic revision -m "add_user_preferences"

# In the generated file:
def upgrade():
    # Add column as nullable first
    op.add_column('users', sa.Column('preferences', sa.JSON(), nullable=True))

    # Backfill default values in batches
    connection = op.get_bind()
    batch_size = 1000
    offset = 0
    while True:
        result = connection.execute(text(
            "UPDATE users SET preferences = '{}' "
            "WHERE preferences IS NULL AND id > :offset AND id <= :limit"
        ), {"offset": offset, "limit": offset + batch_size})
        if result.rowcount == 0:
            break
        offset += batch_size

    # Set NOT NULL after backfill
    op.alter_column('users', 'preferences', nullable=False)

def downgrade():
    op.drop_column('users', 'preferences')

Flyway Baseline for Existing Databases

# Baseline an existing database at version 5
flyway -url=jdbc:postgresql://db:5432/app -baselineVersion=5 -baselineDescription="Existing schema" baseline

# Now only migrations V6+ will run
flyway -url=jdbc:postgresql://db:5432/app migrate

Liquibase Labels and Contexts

<changeSet id="3" author="developer" labels="v2.0,production" context="production">
    <addColumn tableName="orders">
        <column name="shipping_address" type="varchar(500)"/>
    </addColumn>
</changeSet>

# Run only production-labeled changesets
liquibase --changeLogFile=db.changelog.xml --labels=production update

# Rollback by tag
liquibase --changeLogFile=db.changelog.xml rollback v2.0

Sequelize CLI Migrations (Node.js)

// migrations/20250613-add-user-status.js
module.exports = {
  async up(queryInterface, Sequelize) {
    await queryInterface.addColumn('users', 'status', {
      type: Sequelize.STRING(20),
      defaultValue: 'active',
      allowNull: true,
    });

    // Backfill in batches
    const [results] = await queryInterface.sequelize.query(
      "SELECT id FROM users WHERE status IS NULL"
    );

    for (const row of results) {
      await queryInterface.sequelize.query(
        "UPDATE users SET status = 'active' WHERE id = :id",
        { replacements: { id: row.id } }
      );
    }

    await queryInterface.changeColumn('users', 'status', {
      type: Sequelize.STRING(20),
      allowNull: false,
    });
  },

  async down(queryInterface) {
    await queryInterface.removeColumn('users', 'status');
  },
};

npx sequelize-cli migration:generate --name add-user-status
npx sequelize-cli db:migrate
npx sequelize-cli db:migrate:undo

Additional Best Practices

Use CREATE INDEX CONCURRENTLY in PostgreSQL. This avoids blocking writes during index creation:

CREATE INDEX CONCURRENTLY idx_users_email_lower ON users (lower(email));

Split large migrations into smaller steps. A migration that adds a column, backfills 10M rows, and adds a constraint in one transaction will hold locks too long. Split into 3 separate migrations.
Use CHECK constraints with NOT VALID first. Add the constraint without validating existing rows, then validate separately:

ALTER TABLE users ADD CONSTRAINT chk_email_format CHECK (email ~ '@' ) NOT VALID;
ALTER TABLE users VALIDATE CONSTRAINT chk_email_format;

Test migrations on a copy of production data. Use pg_dump to create a staging copy and run migrations against it to catch issues:

pg_dump --format=custom --file=prod_dump.pgdump mydb
pg_restore --dbname=staging_db --jobs=4 prod_dump.pgdump
alembic upgrade head

Pin migration tool versions in CI. Different versions of Flyway or Alembic may behave differently. Lock the version in your CI pipeline:

# .github/workflows/migrate.yml
- name: Run Flyway
  run: |
    docker run --rm \
      -v $(pwd)/db/migration:/flyway/sql \
      flyway/flyway:10.12.0 \
      -url=jdbc:postgresql://$DB_HOST:5432/$DB_NAME \
      -user=$DB_USER -password=$DB_PASS \
      migrate

Additional Common Mistakes

Running migrations during deployment without a lock. Two pods starting simultaneously may both try to run migrations. Use an advisory lock or a dedicated migration job:

SELECT pg_advisory_lock(99999);
-- Run migrations
SELECT pg_advisory_unlock(99999);

Not testing rollbacks in CI. Apply migrations, then roll back, then apply again. If any step fails, CI should catch it before production.
Using DROP TABLE in a migration that might have dependent views. PostgreSQL prevents this, but MySQL with RESTRICT may silently fail. Check dependencies first:

SELECT dependee.relname AS dependent_object
FROM pg_depend JOIN pg_rewrite ON pg_depend.objid = pg_rewrite.oid
JOIN pg_class AS dependee ON pg_depend.refobjid = dependee.oid
JOIN pg_class AS dependency ON pg_depend.classid = dependency.oid
WHERE dependency.relname = 'users';

Ignoring migration execution time in CI. A migration that takes 30 seconds locally may take 10 minutes on a production-sized table. Set timeouts and monitor execution time.

Additional FAQ

Q: How do I handle migrations in a microservices architecture?

Each service should own its database and migrations. Never share a migration across services. Use a shared migration runner service or include migrations in each service’s deployment pipeline. Coordinate cross-service schema changes through API contracts, not shared tables.

Q: What if a migration fails halfway in production?

Most tools handle this: Flyway marks the migration as failed in flyway_schema_history, Alembic leaves the database in whatever state the failed transaction left it. Fix the migration script, repair the history table (flyway repair), and re-run. Always have a runbook for failed migrations.

Q: Should I use SQL migrations or code-based migrations?

SQL migrations are transparent and database-native. Code-based migrations (Alembic, Sequelize) offer programmatic control for data backfills and conditional logic. Use SQL for simple DDL, code for complex data migrations. Both can coexist in the same project.

Q: How do I version migration files across teams?

Use timestamp-based versioning (20250613_120000_add_user_status) instead of sequential numbers. This prevents merge conflicts when multiple developers create migrations simultaneously. Flyway, Alembic, and Sequelize all support timestamp-based ordering.

Performance Tips

Use SET statement_timeout for migrations. Prevent a migration from running indefinitely:

SET statement_timeout = '300s';
-- Run migration
SET statement_timeout = '0'; -- Reset to default

Batch large data backfills. Update 1,000-10,000 rows per batch to avoid long transactions:

DO $$
DECLARE
    batch_size INT := 5000;
    offset_val INT := 0;
    rows_affected INT;
BEGIN
    LOOP
        UPDATE users SET status = 'active'
        WHERE id > offset_val AND id <= offset_val + batch_size AND status IS NULL;
        GET DIAGNOSTICS rows_affected = ROW_COUNT;
        EXIT WHEN rows_affected = 0;
        offset_val := offset_val + batch_size;
        PERFORM pg_sleep(0.1); -- Brief pause to reduce load
    END LOOP;
END $$;

Use ALTER TABLE ... SET TABLESPACE for large tables. Move tables to faster storage during maintenance windows:

ALTER TABLE large_table SET TABLESPACE fast_ssd;

Monitor migration progress with pg_stat_progress_create_index. Track index creation progress in PostgreSQL 12+:

SELECT phase, blocks_done, blocks_total,
       ROUND(blocks_done::numeric / NULLIF(blocks_total, 0) * 100, 2) AS pct
FROM pg_stat_progress_create_index;

Use CONCURRENTLY for all index operations in production. CREATE INDEX CONCURRENTLY, DROP INDEX CONCURRENTLY, and REINDEX CONCURRENTLY avoid blocking writes.

Recipe

Manage Database Migrations Safely

Overview

When to Use

Solution

Flyway (JVM/SQL)

Alembic (Python/SQLAlchemy)

Liquibase (XML/YAML/JSON)

Explanation

Variants

What Works

Common Mistakes

Frequently Asked Questions

Is this solution production-ready?

What are the performance characteristics?

How do I debug issues with this approach?

Expand-Contract Pattern for Zero-Downtime Migrations

Online Schema Changes with `pg_repack`

Alembic Autogenerate and Manual Revision

Flyway Baseline for Existing Databases

Liquibase Labels and Contexts

Sequelize CLI Migrations (Node.js)

Additional Best Practices

Additional Common Mistakes

Additional FAQ

Performance Tips

Optimize Queries with Database Indexing

Optimize Slow Database Queries

Set Up Connection Pooling for Databases and HTTP Clients

Implement Optimistic Locking with Versioning

Database Schema Evolution

Set Up Full-Text Search Indexes

Rank Rows and Calculate Running Totals with Window Functions

Overview

When to Use

Solution

Flyway (JVM/SQL)

Alembic (Python/SQLAlchemy)

Liquibase (XML/YAML/JSON)

Explanation

Variants

What Works

Common Mistakes

Frequently Asked Questions

Is this solution production-ready?

What are the performance characteristics?

How do I debug issues with this approach?

Expand-Contract Pattern for Zero-Downtime Migrations

Online Schema Changes with pg_repack

Alembic Autogenerate and Manual Revision

Flyway Baseline for Existing Databases

Liquibase Labels and Contexts

Sequelize CLI Migrations (Node.js)

Additional Best Practices

Additional Common Mistakes

Additional FAQ

Performance Tips

Related Resources

Optimize Queries with Database Indexing

Optimize Slow Database Queries

Set Up Connection Pooling for Databases and HTTP Clients

Implement Optimistic Locking with Versioning

Database Schema Evolution

Set Up Full-Text Search Indexes

Rank Rows and Calculate Running Totals with Window Functions

Online Schema Changes with `pg_repack`