Tech Bytes

Practical MySQL Performance Tuning

2026-04-26T00:00:00+00:00

Performance tuning matters because it affects the experience people feel at the application layer.

In OLTP systems, slow checkout flows and query latency affect revenue directly. In batch-oriented systems, the tuning goal shifts toward predictable throughput and completing work inside the available processing window. Good tuning starts with the workload, not with random parameter changes.

This article condenses the practical approach from the source material into one workflow:

Set realistic baseline expectations
Size the server for the workload
Tune the MySQL configuration for the deployment type
Identify bottlenecks before changing SQL
Use built-in MySQL tools to validate changes
Review index design and maintenance choices

Start with the Right Goal

Do not start by tuning the loudest query you find.

Start by answering these questions:

What performance does the application need?
Where does the workload spend time today?
Which resource becomes the bottleneck first: CPU, memory, disk, or SQL design?
What does good enough look like for this system?

That framing matters because a query that runs slowly in isolation may not be the real problem. The real issue may come from memory pressure, I/O waits, missing indexes, or a database layout that does not match the workload.

Size the Server for the Workload

Database performance depends on hardware resources and configuration together.

CPU

CPU throughput affects concurrency, parsing, and execution speed. This guide uses systems ranging from a handful of cores to larger deployments with many more. The right CPU choice depends on the transaction rate, query complexity, and the amount of parallel work the application generates.

Memory

Memory drives cache efficiency. A larger buffer cache reduces disk reads and improves response time for repeated access patterns.

Disk

Disk performance becomes critical in write-heavy systems and workloads with frequent modifications. SSDs or other high-throughput storage outperform spinning disks for most transactional systems.

If the workload modifies data heavily, storage latency quickly becomes visible in user-facing response time.

Tune the Core Database Settings

MySQL defaults target small or moderate deployments. Larger or latency-sensitive systems need a deliberate configuration review.

`innodb_dedicated_server`

Use innodb_dedicated_server only on hosts that exist primarily for MySQL. When enabled, MySQL configures the buffer pool and redo capacity automatically.

[mysqld]
innodb_dedicated_server=ON

Check the current value:

SHOW VARIABLES LIKE 'innodb_dedicated_server';

`innodb_buffer_pool_size`

InnoDB buffer pool tuning ranks among the most important performance settings. A larger buffer pool keeps data and index pages in memory and reduces physical I/O.

In practice, this guide recommends assigning a large share of server memory to the buffer pool on dedicated database hosts.

[mysqld]
innodb_buffer_pool_size=10G

Verify it with:

SHOW VARIABLES LIKE 'innodb_buffer_pool_size';

`innodb_buffer_pool_instances`

Multiple buffer pool instances can reduce contention on busy systems.

[mysqld]
innodb_buffer_pool_instances=24

The exact value depends on the deployment size and workload shape. Start with a sensible baseline and monitor the result.

`innodb_log_buffer_size`

The log buffer affects transaction commit behavior and can help workloads that generate frequent changes.

[mysqld]
innodb_log_buffer_size=48M

`innodb_flush_log_at_trx_commit`

This setting controls how aggressively InnoDB flushes log records at commit time.

[mysqld]
innodb_flush_log_at_trx_commit=1

Keep the value at 1 for the strongest durability behavior.

`innodb_flush_method`

innodb_flush_method controls how MySQL flushes data to disk.

SHOW VARIABLES LIKE 'innodb_flush_method';

Linux and Unix deployments commonly use fsync. Some fast local storage systems perform better with O_DIRECT, which avoids extra buffering overhead.

`innodb_file_per_table`

Use file-per-table tablespaces for most modern deployments.

[mysqld]
innodb_file_per_table=ON

That setting stores each table in its own .ibd file and simplifies some maintenance operations.

`innodb_redo_log_capacity`

In MySQL 8.0.30 and later, innodb_redo_log_capacity replaces the older redo file sizing model.

[mysqld]
innodb_redo_log_capacity=32G

`sort_buffer_size` and `join_buffer_size`

These buffers matter when the optimizer must sort or join without an efficient index path.

Use more memory here only when the optimizer must sort or join without an efficient index path; indexing still provides the better fix.

SHOW VARIABLES LIKE 'sort_buffer_size';
SHOW VARIABLES LIKE 'join_buffer_size';

`read_buffer_size`

This setting matters less for InnoDB than for MyISAM, but this guide includes it as part of the broader tuning review.

SHOW VARIABLES LIKE 'read_buffer_size';

Use a Practical Baseline Configuration

This guide gives two example configurations: one for dedicated servers and one for systems that do not dedicate all resources to MySQL.

On dedicated MySQL servers, allocate resources so InnoDB can use the machine effectively.

Example baseline:

[mysqld]
innodb_dedicated_server=1
innodb_buffer_pool_instances=24
innodb_log_buffer_size=48M
innodb_file_per_table=1
max_connections=500
slow-query-log=1
slow_query_log_file=/var/log/slow_query.log

For a non-dedicated system, set the memory explicitly:

[mysqld]
innodb_buffer_pool_size=10G
innodb_buffer_pool_instances=24
innodb_redo_log_capacity=32G
innodb_log_buffer_size=48M
innodb_file_per_table=1
max_connections=500
slow-query-log=1
slow_query_log_file=/var/log/slow_query.log

Analyze Bottlenecks Before Changing SQL

Performance issues usually span the OS, the database, and the application. If you only inspect SQL, you may miss the real bottleneck.

Check CPU

Use top or similar tools to watch CPU saturation, run queue pressure, and the percentage of idle time.

Check Memory

Use free -g or free -gt to look for swap activity and low available memory.

Check I/O

Use iostat or similar tools to find disk bottlenecks, write pressure, and elevated I/O wait.

Example commands:

top
free -gt
iostat 2 3

Focus on the resource that limits the system first; the command matters less than the signal it exposes.

Turn on Slow Query Logging

MySQL can record the statements that run longer than a chosen threshold.

[mysqld]
slow-query-log=1
slow_query_log_file=/var/log/slow_query.log
long_query_time=1

The slow query log gives you a practical starting point for workload analysis.

Do not assume that every query in the log deserves tuning. Use it as a signal, then validate with execution plans, row counts, and access patterns.

Use Performance Schema for Deeper Analysis

Performance Schema ships with MySQL and stores runtime metrics, waits, locks, and statement history.

Verify that MySQL enables it:

SHOW VARIABLES LIKE 'performance_schema';

Then inspect the available tables:

SELECT TABLE_NAME, TABLE_ROWS
FROM INFORMATION_SCHEMA.TABLES
WHERE TABLE_SCHEMA = 'performance_schema';

The useful tables include:

events_statements_summary_by_digest
events_waits_summary_global_by_event_name
data_locks
metadata_locks
threads
table_io_waits_summary_by_table

That schema gives you the raw material for spotting lock contention, hot statements, and wait behavior.

Use Maintenance Tools Carefully

This guide treats maintenance tools as targeted utilities, not as blanket fixes.

Run maintenance tools during non-peak hours because some operations take locks or pause writes.

`ANALYZE TABLE`

ANALYZE TABLE refreshes statistics that the optimizer uses to choose access paths.

ANALYZE TABLE EMPLOYEE1;

Use it after large DML changes or when execution plans no longer match reality.

`OPTIMIZE TABLE`

OPTIMIZE TABLE reorganizes physical storage and can reclaim space in some cases.

OPTIMIZE TABLE EMPLOYEE1;

This guide shows the common MySQL behavior where InnoDB may recreate and analyze the table instead of performing a classic optimization path.

`CHECK TABLE`

CHECK TABLE helps validate table and index integrity.

CHECK TABLE EMPLOYEE1;

Use it when you suspect corruption, compatibility issues, or index problems.

Review Table Statistics

Performance work often depends on understanding table size, row estimates, and index footprint.

This guide uses information_schema.INNODB_TABLESTATS to inspect statistics for a table:

SELECT *
FROM information_schema.INNODB_TABLESTATS
WHERE NAME='test/EMPLOYEE1'\G

Useful fields include:

TABLE_ID
NAME
STATS_INITIALIZED
NUM_ROWS
CLUST_INDEX_SIZE
OTHER_INDEX_SIZE
MODIFIED_COUNTER
AUTOINC
REF_COUNT

This information helps you understand whether MySQL has collected usable statistics and how much storage the clustered and secondary indexes consume.

Index Design Still Matters

Good tuning usually comes back to indexes.

An index can reduce scans, shorten response time, and eliminate expensive sorts or joins. When you cannot add a useful index, buffer settings may help temporarily, but the index problem remains.

Add a Non-Unique Index

ALTER TABLE tablename ADD INDEX (colname);
CREATE INDEX indexname ON tablename (colname);

Add a Unique Index

ALTER TABLE tablename ADD UNIQUE (colname);
CREATE UNIQUE INDEX indexname ON tablename (colname);

Add a Primary Key

Use a primary key constraint instead of CREATE INDEX.

ALTER TABLE tablename ADD PRIMARY KEY (col1, col2);

Add a Functional Index

ALTER TABLE tablename ADD INDEX ((func(colname)));
CREATE INDEX indexname ON tablename ((func(colname)));

Drop an Index

ALTER TABLE tablename DROP INDEX indexname;
DROP INDEX indexname ON tablename;

Practical Tuning Flow

If you need a repeatable tuning sequence, use this order:

Measure the workload and capture the symptom.
Check CPU, memory, and storage behavior on the host.
Enable or review the slow query log.
Inspect Performance Schema for waits, locks, and hot statements.
Refresh table statistics with ANALYZE TABLE when needed.
Review indexes before raising buffer sizes.
Only then adjust configuration or SQL.

That order prevents guesswork and keeps changes tied to observed behavior.

Final Takeaway

Treat performance tuning as a process that matches hardware, configuration, indexes, and workload behavior to application needs.

If you size the server correctly, configure InnoDB intentionally, watch the right bottlenecks, and validate query plans and index choices, you will solve more performance problems than you would by chasing one slow statement at a time.

Practical MySQL Backup Utilities: mysqldump, mydumper, and XtraBackup

2026-04-12T00:00:00+00:00

This article pulls the source material into one operational guide instead of splitting it into a series.

Production MySQL teams often start serious backup planning only after the first restore request arrives. Start earlier. Build a usable backup process around three elements: a tool that fits the workload, a repeatable command pattern, and a restore procedure you have already verified.

This guide walks through three practical backup paths covered in the source material:

mysqldump for straightforward logical exports
mydumper and myloader for faster multithreaded logical backup and restore
Percona XtraBackup for hot physical backup and incremental recovery workflows

1. Use `mysqldump` for Simple Logical Exports

mysqldump remains the easiest way to capture MySQL objects as SQL statements that you can review, store, and replay later.

Many environments still benefit from mysqldump when the job calls for a simple export of one database, a few tables, or schema-only metadata.

Single Database Backup

mysqldump -u root -p employees > single_db_bk_employees.sql

Multiple Databases

mysqldump --databases -u root -p db2 db3 employees > multiple_db_bk.sql

All Databases

mysqldump --all-databases -u root -p > alldbs.sql

Schema-Only Backup

Use --no-data when you need DDL without row data.

mysqldump -u root -p --no-data employees > employees_metadata.sql

Single Table Backup

mysqldump -u root -p db1 tab1 > db1_tab1_table.sql

Table Schema Only

mysqldump -u root -p --no-data db1 tab1 > db1_emp_table_metadata.sql

Table Data Only

Use --no-create-info when the target schema already exists and you only want row data.

mysqldump -u root -p --no-create-info db1 tab1 > db1_emp_data.sql

Exclude a Table

mysqldump -u root -p db1 --ignore-table=db1.emp > db1_wo_emp_table.sql

Compress During Backup

mysqldump -u root -p db1 | gzip > db1_gzip_compressed.sql.gz

Add a Timestamp to the Output File

mysqldump -u root -p db1 > db1-$(date +%Y%m%d).sql

Take a Global Read Lock During Backup

mysqldump -u root -p --lock-all-tables db1 > db1_global_readlock.sql

Record Binary Log Coordinates

If you plan to use the dump for replication bootstrap or point-in-time recovery planning, include source log metadata.

mysqldump -u root -p --master-data db1 > db1_master_data.sql

2. Restore a `mysqldump` Backup Carefully

Treat a logical backup as incomplete until you can run a predictable restore.

Basic restore pattern:

mysql db1 < db1.sql > db1_restore.log

After the restore, validate the target database immediately:

SHOW DATABASES;
USE db1;
SHOW TABLES;

That quick verification step catches more issues than most teams expect, especially when the backup contains only part of the original schema.

3. Use `mydumper` and `myloader` for Faster Logical Backups

mydumper solves the biggest operational limitation of mysqldump: single-threaded execution. On larger datasets, multithreaded logical backup can reduce runtime significantly.

mydumper writes the dump files. myloader reads the backup set and restores the objects into MySQL.

Install the Tools

The source workflow installs the upstream GitHub releases. After you install the packages, confirm that both binaries are available.

which mydumper
which myloader

Back Up a Single Database

mydumper \
  --database=db1 \
  --host=localhost \
  --user=root \
  --password='' \
  --outputdir=mysql_backup/ \
  -G -E -R \
  --threads=4 \
  --rows=10

Important flags from the source material:

-G dumps triggers
-E dumps events
-R dumps routines
--threads controls parallelism
--rows controls chunk sizing behavior

Restore with `myloader`

myloader \
  --host=localhost \
  --user=root \
  --password='' \
  --database=db1 \
  --directory=/home/mysql/mysql_backup/mysql_backup \
  --queries-per-transaction=10 \
  --threads=4 \
  --verbose=3

The source examples validate restore success by dropping the database first, loading it back, and then checking the database and table inventory.

Back Up Selected Databases with Regex

mydumper \
  --host=localhost \
  --user=root \
  --password='' \
  --outputdir=/home/mysql/mysql_backup/mysql_backup \
  --rows=50000 \
  -G -E -R \
  --threads=4 \
  --regex '^(db3\.|db4\.)' \
  -L /tmp/mydumper-logs.txt

Back Up Selected Tables

mydumper \
  --host=localhost \
  --user=root \
  --password='' \
  --outputdir=/home/mysql/mysql_backup/mysql_backup \
  --rows=50000 \
  -G -E -R \
  --threads=8 \
  --regex '^(db1\.emp$|db1\.country$)' \
  -L /tmp/mydumper-logs.txt

Back Up a Single Table with Compression

mydumper \
  --host=localhost \
  --user=root \
  --password='' \
  --outputdir=/home/mysql/mysql_backup/mysql_backup \
  --rows=50000 \
  -G -E -R \
  --threads=4 \
  --regex '^(db1\.mgr$)' \
  --compress \
  --verbose 3 \
  -L /tmp/mydumper-logs.txt

Restore the Compressed Table Backup

myloader \
  --host=localhost \
  --user=root \
  --password='' \
  -B db1 \
  --directory=/home/mysql/mysql_backup/mysql_backup \
  --queries-per-transaction=50000 \
  --threads=4 \
  --verbose=3 \
  --overwrite-tables

Use this pattern when you want selective logical restore without replaying a full database export.

4. Use Percona XtraBackup When You Need Hot Physical Backups

Although XtraBackup does not produce logical dumps, this guide includes it because many MySQL backup strategies combine logical and physical methods.

Use XtraBackup when you need:

Hot backups against active InnoDB workloads
Faster recovery of large datasets
Incremental backup support
A backup that preserves physical storage state and binlog position metadata

Install and Verify XtraBackup

The source workflow installs the Percona release package, enables the repository, and then installs percona-xtrabackup-80.

Validation commands:

rpm -qa | grep percona-xtra
xtrabackup --version

Take a Full Backup

xtrabackup --backup \
  --user=root \
  --password='' \
  --target-dir=/var/lib/backup/

After the backup completes, inspect the output directory and metadata files such as:

xtrabackup_info
xtrabackup_checkpoints
xtrabackup_binlog_info
backup-my.cnf

Those files tell you whether the backup completed successfully and what binlog position it captured.

Take an Incremental Backup

After the full backup, point the next backup to the full backup directory as the base.

xtrabackup --backup \
  --user=root \
  --password='' \
  --target-dir=/var/lib/incremental_backup/ \
  --incremental-basedir=/var/lib/backup/

The resulting directory contains .delta and .meta files that represent changed pages relative to the full backup.

5. Prepare and Restore an XtraBackup Recovery Set

The source material simulates data loss by creating tables, taking an incremental backup, dropping those tables, and then restoring the prepared backup set.

Prepare the Full Backup

xtrabackup --prepare=TRUE --apply-log-only=TRUE --target-dir=/var/lib/backup/

Apply the Incremental Backup

xtrabackup --prepare=TRUE \
  --target-dir=/var/lib/backup/ \
  --incremental-dir=/var/lib/incremental_backup/

Stop MySQL and Recreate the Target Directory

sudo systemctl stop mysqld
cd /var/lib/
mv mysql mysql_old
mkdir mysql
chown -R mysql:mysql /var/lib/mysql

Copy Back the Prepared Backup

xtrabackup --copy-back --target-dir=/var/lib/backup/

Start MySQL and Verify Recovery

sudo systemctl start mysqld
mysql -u root -p

Then validate the restored schema and tables:

SHOW DATABASES;
USE employees;
SHOW TABLES;

That validation step closes the loop. Skip it, and you only know that the process copied files; you still do not know whether the restored dataset is usable.

How to Choose Between These Tools

Use mysqldump when you need portability, simple exports, schema-only dumps, or targeted object extraction.

Use mydumper and myloader when logical backup remains the right fit but mysqldump takes too long for the dataset size or restore window.

Use XtraBackup when you need hot physical backup, incremental capture, or faster recovery on larger MySQL environments.

In practice, many teams combine these approaches:

Logical dumps for selective export and object-level recovery
Physical backups for full-server protection and faster restore objectives

Final Takeaway

Do not look for one backup tool to win every scenario. Match the backup method to the recovery objective.

If you only script the backup and never test the restore, the process remains unfinished. A working MySQL backup strategy includes verified recovery steps, metadata inspection, and enough operational discipline to reproduce the workflow under pressure.

Practical MySQL Replication and Scalability - Part 4: Scale-Out with Clone and Chain Replication

2026-03-27T00:00:00+00:00

Part 4 closes the series by moving from replication setup into scale-out operations.

Once GTID-based replication is stable, the next question is usually how to add capacity without repeatedly taking full manual backups from the primary source. MySQL’s clone plugin gives you a faster bootstrap path, and chained replication helps distribute replication load more flexibly.

The Scale-Out Pattern

In the example topology, mysql-a already replicates to mysql-b. Instead of building mysql-c directly from mysql-a, you can clone from mysql-b and then configure mysql-c to replicate downstream.

That creates a chain topology:

mysql-a -> mysql-b -> mysql-c

This can reduce operational pressure on the original source and give you more options for replica placement.

Step 1: Install the Clone Plugin

Install the plugin on both the donor and the receiving replica.

INSTALL PLUGIN clone SONAME 'mysql_clone.so';

SELECT plugin_name, plugin_status
FROM information_schema.plugins
WHERE plugin_name = 'clone';

You want the plugin status to return ACTIVE.

Step 2: Create a Donor Account

Create a dedicated user for clone operations rather than reusing the replication account.

Example pattern:

CREATE USER 'donor_clone_user'@'mysql-c' IDENTIFIED BY '';
GRANT ALL PRIVILEGES ON *.* TO 'donor_clone_user'@'mysql-c';

In a production environment, you would likely narrow both privileges and host scope more aggressively than a lab example.

Step 3: Define the Valid Donor List

On the receiving server, tell MySQL which donor host is allowed.

SET GLOBAL clone_valid_donor_list='mysql-b:3306';
SHOW VARIABLES LIKE '%clone_valid%';

This prevents arbitrary clone sources from being used accidentally.

Step 4: Run the Clone Operation

Now clone the donor instance onto the new replica.

CLONE INSTANCE FROM 'donor_clone_user'@'mysql-b':3306 IDENTIFIED BY '';

The clone operation replaces existing user-created objects on the target and restarts MySQL as part of the workflow. Treat it as a provisioning action, not a casual maintenance command.

Step 5: Reattach Replication

If GTID-based replication is already enabled, you can connect the cloned server to its upstream source without manually supplying binary log coordinates.

Example:

CHANGE REPLICATION SOURCE TO
  SOURCE_HOST='mysql-b',
  SOURCE_PORT=3306,
  SOURCE_USER='replication_user',
  SOURCE_PASSWORD='',
  SOURCE_AUTO_POSITION=1,
  GET_SOURCE_PUBLIC_KEY=1;

START REPLICA;
SHOW REPLICA STATUS\G

This is where GTID pays off again. The new replica can align based on executed transactions instead of a manually captured file position.

Step 6: Validate the Chain Topology

To prove the scale-out path works, create a new object on mysql-a and verify that it appears on both downstream servers.

Example:

CREATE DATABASE db8;

Then verify on mysql-b and mysql-c:

SHOW DATABASES;

If db8 appears on both servers and the replica status remains healthy, the chained flow is working as expected.

Operational Notes

This pattern improves scalability, but it does not automatically solve failover orchestration, write conflict management, or split-brain prevention. Those concerns belong to a fuller high-availability design using tools such as InnoDB Cluster and related topologies.

Still, for read scaling and faster replica provisioning, clone plus GTID-based chained replication is a practical and effective pattern.

This closes this replication series. The next natural step is to move from replication and scale-out into the higher-level MySQL HA patterns built on top of them.

Practical MySQL Replication and Scalability - Part 3: Moving to GTID-Based Replication

2026-03-10T00:00:00+00:00

Part 3 moves the replication topology from manual file-position coordination to GTID-based auto-positioning.

If you have ever had to rebuild a replica under pressure, you already know the weakness of classic binlog coordinates: they work, but they add manual bookkeeping exactly when you want the process to be simpler. GTID reduces that operational burden.

What GTID Changes

A GTID is a globally unique identifier attached to each committed transaction. Instead of asking a replica to start at a specific binary log byte offset, you ask it to continue from the transactions it has not yet executed.

Conceptually:

GTID = source_uuid:transaction_id

That means the replica can synchronize based on transaction history rather than a human-tracked log coordinate.

In this GTID phase, the topology still uses one source and two replicas. The change is not the shape of the environment, but the way replicas determine where to resume replication.

Enable GTID on the Source

The source must be started with GTID support enabled.

Example configuration:

[mysqld]
log-bin=mysql-bin
log-bin-index=mysql-bin.index
server-id=1
binlog-format=ROW
innodb_flush_log_at_trx_commit=1
sync-binlog=1
gtid_mode=ON
enforce_gtid_consistency=ON

enforce_gtid_consistency=ON prevents statements that would break GTID-safe replication semantics.

Enable GTID on the Replicas

Each replica needs the same GTID-related settings, along with its existing relay log configuration.

Example:

[mysqld]
server-id=2
relay-log=relay-mysql-b
relay-log-index=relay-mysql-b.index
skip-slave-start
gtid_mode=ON
enforce_gtid_consistency=ON

Repeat the same pattern on additional replicas, changing only the server-specific identifiers and relay log names.

Verify GTID Settings After Restart

After restarting MySQL on each server, confirm that the required variables are active.

SHOW VARIABLES LIKE 'gtid_mode';
SHOW VARIABLES LIKE 'enforce_gtid_consistency';

You want both variables to report ON.

Reconfigure the Replica to Use Auto-Positioning

Once GTID is enabled, change the replication source configuration to use transaction auto-positioning.

Example:

CHANGE REPLICATION SOURCE TO
  SOURCE_HOST='10.0.0.10',
  SOURCE_USER='replication_user',
  SOURCE_PASSWORD='',
  SOURCE_AUTO_POSITION=1,
  GET_SOURCE_PUBLIC_KEY=1;

Then start replication:

START REPLICA;
SHOW REPLICA STATUS\G

What to Look for in Replica Status

When the replica is healthy, these signals matter most:

Replica_IO_Running: Yes
Replica_SQL_Running: Yes
Auto_Position: 1
Seconds_Behind_Source: 0 or near zero

You can also inspect the GTID tracking fields:

Retrieved_Gtid_Set
Executed_Gtid_Set

These fields become especially useful when troubleshooting lag, reparenting, or partial recovery.

Why GTID Is Usually Better

GTID-based replication is not magically simpler in every edge case, but it is much easier to operate day to day.

Main advantages:

No need to manually capture log file and position during every reprovision
Easier replica rebuilds and source changes
Better fit for automated failover or topology management tools
Clearer transaction history tracking across servers

Functional Validation

Once GTID replication is online, validate it the same way you validated binlog-based replication.

Example:

CREATE DATABASE db4;
CREATE DATABASE db5;

Then confirm those changes appear on every replica.

If the databases arrive cleanly and SHOW REPLICA STATUS\G remains healthy, the topology is working under GTID.

In Part 4, I extend the discussion from replication into scalability by using the clone plugin and chained replication to build out additional capacity more efficiently.

Practical MySQL Replication and Scalability - Part 2: Bootstrapping Binlog Replicas

2026-02-23T00:00:00+00:00

Part 2 focuses on the most operationally sensitive part of classic replication: provisioning replicas from a consistent source snapshot.

Once your source and replicas are configured correctly, the next job is to capture a known-good data state and align the replicas to the matching binary log position.

Step 1: Confirm the Source Is Ready

Before taking a snapshot, verify that binary logging is enabled and that the source is healthy.

Useful checks:

SHOW VARIABLES LIKE 'log_bin';
SHOW VARIABLES LIKE 'binlog_format';
SHOW MASTER STATUS;

You should also verify that the replication user already exists and that network access from the replicas is possible.

Step 2: Lock the Source Long Enough to Capture Coordinates

To align the dump with a precise binary log position, place the source under a read lock, capture the coordinates, and keep the lock only as long as necessary.

FLUSH TABLES WITH READ LOCK;
SHOW MASTER STATUS;

The output from SHOW MASTER STATUS gives you the two values the replicas need:

File
Position

Make a note of both before moving on.

Step 3: Take a Logical Backup

While the lock is in place, create the bootstrap dump from the source.

Example:

mysqldump -uroot -p \
  --all-databases \
  --triggers \
  --routines \
  --events \
  --source-data \
  --set-gtid-purged=OFF \
  > replication_db_dump.sql

This combination works well for a full-environment bootstrap because it captures:

All databases
Stored routines
Events
Triggers
Source log metadata inside the dump file

When the dump is complete, release the read lock:

UNLOCK TABLES;

Step 4: Load the Snapshot on Each Replica

Copy the dump file to each replica and import it before enabling replication.

Example import flow:

mysql -uroot -p < replication_db_dump.sql

At this point, the replica has the same logical data set as the source had at the captured binlog coordinate.

Step 5: Point Each Replica at the Source

Now configure replication using the recorded binary log file and position.

Example:

CHANGE REPLICATION SOURCE TO
  SOURCE_HOST='10.0.0.10',
  SOURCE_USER='replication_user',
  SOURCE_PASSWORD='',
  SOURCE_LOG_FILE='mysql-bin.000001',
  SOURCE_LOG_POS=1482,
  GET_SOURCE_PUBLIC_KEY=1;

If you are working with older syntax or legacy automation, you may still see CHANGE MASTER TO. On current MySQL 8.0 builds, CHANGE REPLICATION SOURCE TO is the preferred form.

Step 6: Start Replication

Once the source coordinates are configured, start the replica threads.

START REPLICA;
SHOW REPLICA STATUS\G

These fields matter most during first validation:

Replica_IO_Running: Yes
Replica_SQL_Running: Yes
Seconds_Behind_Source: 0 or a small transient value
Replica_SQL_Running_State showing the replica has caught up and is waiting for new events

Step 7: Validate End-to-End Replication

A simple validation pattern is to create test objects on the source and confirm that they appear on each replica.

Example:

CREATE DATABASE db1;
CREATE DATABASE db2;
CREATE DATABASE db3;

Then on a replica:

SHOW DATABASES;

If you want a stronger test, create a table and insert a few rows, then verify both schema and data replication.

Common Failure Points

If the replica does not start cleanly, check these first:

Wrong source host or port
Incorrect replication credentials
Wrong SOURCE_LOG_FILE or SOURCE_LOG_POS
Duplicate server-id or server_uuid
Firewall or network path issues

Binlog-based replication is reliable, but it demands careful coordinate handling. That manual dependency is exactly why many teams prefer GTID once the baseline topology is working.

In Part 3, I move the same topology to GTID-based replication and show how auto-positioning simplifies source alignment.

Practical MySQL Replication and Scalability - Part 1: Replication Models and Binlog Prerequisites

2026-02-08T00:00:00+00:00

Part 1 starts the next MySQL series with the fundamentals behind replication and scale-out planning.

When teams talk about MySQL high availability, they often jump straight to failover tooling. Before that, it helps to understand how replication actually moves data, what assumptions it depends on, and which server settings must be correct before you bootstrap replicas.

Why Replication Matters

MySQL replication is commonly used for four practical outcomes:

Scale-out read traffic across multiple servers
Keep analytical or reporting workloads away from the primary write path
Distribute data closer to remote users or applications
Improve resilience by maintaining additional synchronized copies of data

Replication by itself is not a full high-availability strategy, but it is the base layer that most HA designs build on.

Binlog Position vs GTID Replication

MySQL 8.0 supports two mainstream replication approaches.

Binlog Position-Based Replication

This is the traditional model. Replicas connect to the source and start reading changes from a specific binary log file and position.

That means you need two pieces of state when configuring the replica:

The source binary log file name
The byte position within that binary log

This method works well, but it is more manual during provisioning and recovery.

GTID-Based Replication

GTID replication assigns a unique transaction identifier to each committed transaction. Instead of telling a replica where to start in a log file, you tell it to auto-position based on executed transaction history.

A GTID looks like this:

source_uuid:transaction_id

GTID-based replication is usually the better operational choice because it reduces manual coordination and makes source changes easier to reason about.

High-Level Overview

This series uses a simple topology with one source and two replicas. The same shape supports the initial binlog-based setup before moving to GTID-based auto-positioning later in the series.

Baseline Prerequisites

Before configuring replicas, verify the following across the topology:

Binary logging is enabled on the source
Every server has a unique server-id
Every server has a unique server_uuid
Replicas can reach the source over the network
A dedicated replication user exists on the source

One of the easiest mistakes in cloned lab environments is duplicated UUID metadata. If multiple servers were copied from the same image, validate UUID uniqueness immediately.

Checks to run:

SELECT @@server_id;
SELECT @@server_uuid;
SHOW VARIABLES LIKE 'skip_networking';

If skip_networking is ON, the replica will not be able to connect to the source over TCP.

Source Configuration for Binlog Replication

On the source server, the MySQL configuration needs durable binary logging behavior and a unique identifier.

Example configuration:

[mysqld]
log-bin=mysql-bin
log-bin-index=mysql-bin.index
server-id=1
binlog-format=ROW
innodb_flush_log_at_trx_commit=1
sync-binlog=1

ROW format is the safer default for modern replication because it avoids a number of ambiguity issues found in statement-based logging.

Replica Configuration Basics

Each replica needs its own server ID and relay log settings.

Example configuration:

[mysqld]
server-id=2
relay-log=relay-mysql-b
relay-log-index=relay-mysql-b.index
skip-slave-start

For a second replica, keep the same structure but assign a different server-id and relay log name.

skip-slave-start is useful during initial provisioning because it prevents replication from starting before the configuration is complete.

Handling Duplicate Server UUIDs

If two servers report the same @@server_uuid, stop MySQL on the affected replica, remove or move the auto.cnf file, and start MySQL again so a fresh UUID is generated.

Example flow:

sudo systemctl stop mysqld
sudo mv /var/lib/mysql/auto.cnf /tmp/auto.cnf.backup
sudo systemctl start mysqld

Then verify:

SELECT @@server_uuid;

Creating a Dedicated Replication User

Create a dedicated account on the source instead of reusing an administrative login.

Example pattern:

CREATE USER 'replication_user'@'10.0.0.%' IDENTIFIED BY '';
GRANT REPLICATION SLAVE ON *.* TO 'replication_user'@'10.0.0.%';
FLUSH PRIVILEGES;

Use the narrowest host specification that fits your environment. Avoid % when you know the replica subnet or host list.

What Comes Next

At this stage, the source and replicas are prepared, but data has not yet been synchronized. In Part 2, I walk through the bootstrap flow for binlog-based replication: capturing a consistent snapshot, recording binary log coordinates, loading the snapshot on replicas, and starting replication cleanly.

Practical MySQL Tablespace and Partitioning - Part 6: COLUMNS, HASH, KEY, and Subpartitioning

2026-01-31T00:00:00+00:00

Part 6 closes this partitioning series with advanced partitioning patterns.

These methods are useful when RANGE/LIST alone do not match the distribution characteristics of your workload.

1. COLUMNS Partitioning

COLUMNS partitioning extends RANGE/LIST concepts to multiple columns and supports non-integer types such as date values.

Example pattern:

CREATE TABLE emp_columns (
  id INT NOT NULL,
  fname VARCHAR(30),
  lname VARCHAR(30),
  hired DATE NOT NULL DEFAULT '2023-01-01',
  position INT NOT NULL,
  fired VARCHAR(5) NOT NULL DEFAULT 'No',
  dep_id INT NOT NULL
)
PARTITION BY RANGE COLUMNS(fname,lname,hired) (
  PARTITION p1 VALUES LESS THAN ('a','a','2023-02-02'),
  PARTITION p2 VALUES LESS THAN ('z','z','2099-12-31')
);

2. HASH Partitioning

HASH distributes rows using a user-defined expression.

CREATE TABLE emp_hash (
  id INT NOT NULL,
  fname VARCHAR(30),
  lname VARCHAR(30),
  hired DATE NOT NULL DEFAULT '2023-01-01',
  position INT NOT NULL,
  fired VARCHAR(5) NOT NULL DEFAULT 'No',
  dep_id INT NOT NULL
)
PARTITION BY HASH(id)
PARTITIONS 5;

Check spread:

SELECT partition_name, table_rows
FROM information_schema.partitions
WHERE table_name='emp_hash';

3. KEY Partitioning

KEY partitioning uses MySQL’s internal hashing logic instead of a user-defined hash expression.

CREATE TABLE emp_key (
  id INT NOT NULL PRIMARY KEY,
  fname VARCHAR(30),
  lname VARCHAR(30),
  hired DATE NOT NULL DEFAULT '2023-01-01',
  position INT NOT NULL,
  fired VARCHAR(5) NOT NULL DEFAULT 'No',
  dep_id INT NOT NULL
)
PARTITION BY KEY()
PARTITIONS 4;

4. Subpartitioning (Composite Partitioning)

Subpartitioning means partitions within partitions.

Example structure:

CREATE TABLE emp_subpart (
  bill_no INT,
  sale_date DATE,
  cust_code VARCHAR(15),
  amount DECIMAL(8,2)
)
PARTITION BY RANGE (YEAR(sale_date))
SUBPARTITION BY HASH (TO_DAYS(sale_date))
SUBPARTITIONS 4 (
  PARTITION p0 VALUES LESS THAN (1990),
  PARTITION p1 VALUES LESS THAN (2000),
  PARTITION p2 VALUES LESS THAN (2010),
  PARTITION p3 VALUES LESS THAN MAXVALUE
);

Inspect partition/subpartition metadata:

SELECT partition_name, table_rows
FROM information_schema.partitions
WHERE table_name='emp_subpart';

Operational Wrap-Up

Across this post, the most useful pattern is consistent verification after each DDL change:

information_schema.partitions
information_schema.files
SHOW VARIABLES checks
EXPLAIN for partition-aware query behavior

This closes the practical tablespace and partitioning series. In the next post flow, I move to high availability and replication topics.

Practical MySQL Tablespace and Partitioning - Part 5: RANGE and LIST Partitioning

2026-01-16T00:00:00+00:00

Part 5 starts the partitioning half of the series.

Partitioning helps when large datasets need predictable pruning boundaries and easier lifecycle operations. In this part, I focus on RANGE and LIST because they are usually the first practical patterns teams adopt.

RANGE Partitioning Basics

Example table definition:

CREATE TABLE emp_range (
  id INT NOT NULL,
  fname VARCHAR(30),
  lname VARCHAR(30),
  hired DATE NOT NULL DEFAULT '2023-01-01',
  position INT NOT NULL,
  fired VARCHAR(5) NOT NULL DEFAULT 'No'
)
PARTITION BY RANGE (id) (
  PARTITION p0 VALUES LESS THAN (5),
  PARTITION p1 VALUES LESS THAN (10),
  PARTITION p2 VALUES LESS THAN (15),
  PARTITION p3 VALUES LESS THAN (20)
);

Inspect distribution:

SELECT partition_name, table_rows
FROM information_schema.partitions
WHERE table_name='emp_range';

Common RANGE Error and Fix

Inserting value outside defined ranges can fail:

ERROR 1526 (HY000): Table has no partition for value 23

Add catch-all partition:

ALTER TABLE emp_range
ADD PARTITION (PARTITION p4 VALUES LESS THAN MAXVALUE);

LIST Partitioning Basics

LIST uses explicit value sets per partition.

CREATE TABLE emp_list (
  id INT NOT NULL,
  fname VARCHAR(30),
  lname VARCHAR(30),
  hired DATE NOT NULL DEFAULT '2023-01-01',
  position INT NOT NULL,
  fired VARCHAR(5) NOT NULL DEFAULT 'No',
  dep_id INT NOT NULL
)
PARTITION BY LIST(dep_id) (
  PARTITION first_dep VALUES IN (3,5,20),
  PARTITION second_dep VALUES IN (25,50,75),
  PARTITION third_dep VALUES IN (80,85,90,100,120,140,150)
);

Out-of-list values trigger the same class of partition-missing error.

Validation Workflow

For both RANGE and LIST:

SELECT partition_name, table_rows
FROM information_schema.partitions
WHERE table_name IN ('emp_range','emp_list');

EXPLAIN SELECT * FROM emp_range;
EXPLAIN SELECT * FROM emp_list;

Practical Design Notes

Use RANGE for natural ordered growth keys.
Use LIST for explicit business bucket values.
Decide early how you will handle future values to avoid frequent emergency DDL.

In Part 6, I cover COLUMNS, HASH, KEY, and subpartitioning.

Practical MySQL Tablespace and Partitioning - Part 4: File-per-Table and General Tablespaces

2026-01-02T00:00:00+00:00

Part 4 explores practical file-per-table and general tablespace operations.

This is where logical design and physical placement intersect. You can control where table data lives, but you must stay within InnoDB rules.

Confirm File-per-Table Mode

SHOW VARIABLES LIKE 'innodb_file_per_table';

With ON, each InnoDB table typically maps to its own .ibd file.

Configure External Directories for General Tablespaces

If you create a general tablespace outside datadir, configure allowed directories.

innodb-directories=/var/lib/tbs/

Then restart and validate:

SHOW VARIABLES LIKE 'innodb_directories';

Create General Tablespace and Move Table

Example pattern:

CREATE TABLESPACE db1_tbs ADD DATAFILE '/var/lib/tbs/db1_tbs.ibd';

Inspect metadata:

SELECT name
FROM information_schema.innodb_tablespaces;

SELECT file_name, tablespace_name, extent_size, initial_size, autoextend_size
FROM information_schema.files
WHERE tablespace_name='db1_tbs'\G

Move a non-partitioned table:

ALTER TABLE emp_range TABLESPACE db1_tbs;

Important Constraint: Partitioned Tables

A partitioned table cannot be placed in a shared general tablespace in this context.

You can encounter errors like:

ERROR 1478 (HY000): InnoDB: A partitioned table is not allowed in a shared tablespace.

So evaluate table design first, then decide tablespace strategy.

Operational Checklist

Validate directory ownership before creating tablespace files.
Keep naming conventions clear for tablespace files.
Use metadata queries after each structural change.

In Part 5, I begin partitioning with RANGE and LIST patterns.

Practical MySQL Tablespace and Partitioning - Part 3: Managing UNDO and Temporary Tablespaces

2025-12-15T00:00:00+00:00

Part 3 covers UNDO and temporary tablespace management in MySQL 8.0.

These two areas are operationally important: UNDO affects transaction rollback and MVCC behavior, while temporary tablespace growth can become a capacity issue in busy environments.

Move UNDO Tablespaces

Typical workflow:

Inspect current undo files.
Set innodb_fast_shutdown=0.
Stop MySQL.
Move undo files to target directory.
Configure innodb-undo-directory.
Start MySQL and validate.

Command pattern:

ls -lrth /var/lib/mysql/undo*

SHOW GLOBAL VARIABLES LIKE 'innodb_fast_shutdown';
SET GLOBAL innodb_fast_shutdown = 0;

sudo systemctl stop mysqld
sudo mv /var/lib/mysql/undo_* /var/lib/mysql/innodb/
sudo chown -R mysql:mysql /var/lib/mysql

Configuration snippet:

innodb-undo-directory=/var/lib/mysql/innodb/

Validate UNDO Move

sudo systemctl start mysqld

Then check:

SHOW GLOBAL VARIABLES LIKE 'innodb_undo%';

Resize Temporary Tablespace

If you need to cap or tune temporary tablespace growth, configure innodb-temp-data-file-path.

Example:

innodb-temp-data-file-path=ibtmp1:12M:autoextend:max:2G

Verification queries:

SELECT @@innodb_temp_tablespaces_dir;
SELECT @@innodb_temp_data_file_path;

SELECT file_name, tablespace_name, initial_size,
       total_extents * extent_size AS totalsizebytes,
       data_free, maximum_size
FROM information_schema.files
WHERE tablespace_name='innodb_temporary'\G

Practical Guidance

Use predictable directory standards for easier backup and incident response.
Keep ownership/permissions checks in every move procedure.
Always validate both config variables and actual file placement.

In Part 4, I move to file-per-table and general tablespace operations.

Tech Bytes

Practical MySQL Performance Tuning

Start with the Right Goal

Size the Server for the Workload

CPU

Memory

Disk

Tune the Core Database Settings

innodb_dedicated_server

innodb_buffer_pool_size

innodb_buffer_pool_instances

innodb_log_buffer_size

innodb_flush_log_at_trx_commit

innodb_flush_method

innodb_file_per_table

innodb_redo_log_capacity

sort_buffer_size and join_buffer_size

read_buffer_size

Use a Practical Baseline Configuration

Analyze Bottlenecks Before Changing SQL

Check CPU

Check Memory

Check I/O

Turn on Slow Query Logging

Use Performance Schema for Deeper Analysis

Use Maintenance Tools Carefully

ANALYZE TABLE

OPTIMIZE TABLE

CHECK TABLE

Review Table Statistics

Index Design Still Matters

Add a Non-Unique Index

Add a Unique Index

Add a Primary Key

Add a Functional Index

Drop an Index

Practical Tuning Flow

Final Takeaway

Practical MySQL Backup Utilities: mysqldump, mydumper, and XtraBackup

1. Use mysqldump for Simple Logical Exports

Single Database Backup

Multiple Databases

All Databases

Schema-Only Backup

Single Table Backup

Table Schema Only

Table Data Only

Exclude a Table

Compress During Backup

Add a Timestamp to the Output File

Take a Global Read Lock During Backup

Record Binary Log Coordinates

2. Restore a mysqldump Backup Carefully

3. Use mydumper and myloader for Faster Logical Backups

Install the Tools

Back Up a Single Database

Restore with myloader

Back Up Selected Databases with Regex

Back Up Selected Tables

Back Up a Single Table with Compression

Restore the Compressed Table Backup

4. Use Percona XtraBackup When You Need Hot Physical Backups

Install and Verify XtraBackup

Take a Full Backup

Take an Incremental Backup

5. Prepare and Restore an XtraBackup Recovery Set

Prepare the Full Backup

Apply the Incremental Backup

Stop MySQL and Recreate the Target Directory

Copy Back the Prepared Backup

Start MySQL and Verify Recovery

How to Choose Between These Tools

Final Takeaway

Practical MySQL Replication and Scalability - Part 4: Scale-Out with Clone and Chain Replication

The Scale-Out Pattern

Step 1: Install the Clone Plugin

Step 2: Create a Donor Account

Step 3: Define the Valid Donor List

Step 4: Run the Clone Operation

Step 5: Reattach Replication

`innodb_dedicated_server`

`innodb_buffer_pool_size`

`innodb_buffer_pool_instances`

`innodb_log_buffer_size`

`innodb_flush_log_at_trx_commit`

`innodb_flush_method`

`innodb_file_per_table`

`innodb_redo_log_capacity`

`sort_buffer_size` and `join_buffer_size`

`read_buffer_size`

`ANALYZE TABLE`

`OPTIMIZE TABLE`

`CHECK TABLE`

1. Use `mysqldump` for Simple Logical Exports

2. Restore a `mysqldump` Backup Carefully

3. Use `mydumper` and `myloader` for Faster Logical Backups

Restore with `myloader`