MySQL :: InnoDB 1.1 for MySQL 5.5 User’s Guide

InnoDB 1.1 for MySQL 5.5 User’s Guide :: C InnoDB Glossary

Section Navigation [Toggle]

Appendix C. InnoDB Glossary

These terms are commonly used in information about the InnoDB storage engine.

ACID

An acronym standing for atomicity, consistency, isolation, and durability. These properties are all desirable in a database system, and are all closely tied to the notion of a transaction. The transactional features of InnoDB adhere to the ACID principles.

Transactions are atomic units of work that can be committed or rolled back. When a transaction makes multiple changes to the database, either all the changes succeed when the transaction is committed, or all the changes are undone when the transaction is rolled back.

The database remains in a consistent state at all times -- after each commit or rollback, and while transactions are in progress.

Transactions are protected (isolated) from each other while they are in progress; they cannot interfere with each other or see each other's uncommitted data. This isolation is achieved through the notion of locking. (Expert users can adjust the isolation level, trading off less protection in favor of increased performance, when they can be sure that the transactions really do not interfere with each other.)

The results of transactions are durable: once a commit operation succeeds, the changes made by that transaction are safe from power failures, system crashes, race conditions, or other potential dangers that many non-database applications are vulnerable to. Durability typically involves writing to disk storage, with a certain amount of redundancy to protect against power failures or software crashes during write operations. (In InnoDB, the doublewrite buffer assists with durability.)

See also commit.

See also doublewrite buffer.

See also isolation level.

See also locking.

See also rollback.

See also transaction.

adaptive flushing

An algorithm, first introduced in the InnoDB Plugin, that smooths out the I/O overhead introduced by checkpoints. Instead of flushing all modified pages from the buffer pool to the data files at once, the InnoDB storage engine periodically flushes small sets of modified pages. The adaptive flushing algorithm extends this process by estimating the optimal rate to perform these periodic flushes, based on the rate of flushing and how fast redo information is generated.

See also buffer pool.

See also data files.

See also flush.

See also plugin.

adaptive hash index

If a table fits almost entirely in main memory, the fastest way to perform queries on it is to use hash indexes. InnoDB has a mechanism that monitors index searches made to the indexes defined for a table. If InnoDB notices that queries could benefit from a hash index, it builds one automatically.

The hash index is always built based on an existing B-tree index on the table. InnoDB can build a hash index on a prefix of any length of the key defined for the B-tree, depending on the pattern of searches that InnoDB observes for the B-tree index. A hash index can be partial; the whole B-tree index does not need to be cached in the buffer pool. InnoDB builds hash indexes on demand for those pages of the index that are often accessed.

In a sense, the adaptive hash index configures InnoDB at runtime to take advantage of ample main memory, coming closer to the architecture of main-memory databases.

See also B-tree.

AIO

Acronym for asynchronous I/O. You might see this acronym in InnoDB messages or keywords.

See also Barracuda.

See also compact row format.

See also compressed row format.

See also dynamic row format.

See also file format.

See also ibdata file.

See also row format.

application programming interface (API)

A set of functions or procedures. An API provides a stable set of names and types for functions, procedures, parameters, and return values.

apply

When a backup produced by the MySQL Enterprise Backup product does not include the most recent changes that occurred while the backup was underway, the process of updating the backup files to include those changes is known as the apply step. It is specified by the --apply-log option of the ibbackup or innobackup commands.

Before the changes are applied, we refer to the files as a raw backup. After the changes are applied, we refer to the files as a prepared backup. The changes are recorded in the ibbackup_logfile file; once the apply step is finished, this file is no longer necessary.

See also hot backup.

See also MySQL Enterprise Backup.

See also prepared backup.

See also raw backup.

.ARM file

Archive storage engine metadata. Files with this extension are always included in backups produced by the innobackup command of the MySQL Enterprise Backup product.

See also innobackup command.

See also MySQL Enterprise Backup.

.ARZ file

Archive storage engine data. Files with this extension are always included in backups produced by the innobackup command of the MySQL Enterprise Backup product.

See also innobackup command.

See also MySQL Enterprise Backup.

asynchronous I/O

A type of I/O operation that allows other processing to proceed before the I/O is completed. Also known as non-blocking I/O and abbreviated as AIO. InnoDB uses this type of I/O for certain operations that can run in parallel without affecting the reliability of the database, such as reading pages into the buffer pool that have not actually been requested, but might be needed soon.

Historically, InnoDB has used asynchronous I/O on Windows systems only. Starting with the InnoDB Plugin 1.1, InnoDB uses asynchronous I/O on Linux systems. This change introduces a dependency on libaio. On other Unix-like systems, InnoDB uses synchronous I/O only.

See also buffer pool.

See also auto-increment locking.

See also innodb_autoinc_lock_mode.

See also primary key.

See also concurrency.

See also innodb_autoinc_lock_mode.

autocommit

A setting that causes a commit operation after each SQL statement. This mode is not usually recommended for working with InnoDB tables, where transactions typically span several statements. It is more appropriate for working with MyISAM tables where transactions are not applicable, or in specialized situations with InnoDB to minimize overhead from locking and generation of undo data.

See also commit.

See also locking.

See also SQL.

See also transaction.

See also SQL.

Barracuda

The code name for an InnoDB file format that supports compression for table data. This file format was first introduced in the InnoDB Plugin. It supports the dynamic and compressed row formats. You can select it through the innodb_file_format option.

The default InnoDB file format switches to Barracuda in MySQL 5.5.5.

The MySQL Enterprise Backup product version 3.5 and above supports backing up tablespaces that use the Barracuda file format.

See also Antelope.

See also compact row format.

See also compressed row format.

See also dynamic row format.

See also file format.

See also ibdata file.

See also row format.

beta

An early stage in the life of a software product, when it is available only for evaluation, typically without a definite release number or a number less than 1. InnoDB does not use the beta designation, preferring an early adopter phase that can extend over several point releases, leading to a GA release.

See also binary log.

bottleneck

A portion of a system that is constrained in size or capacity, that has the effect of limiting overall throughput. For example, a memory area might be smaller than necessary; access to a single required resource might prevent multiple CPU cores from running simultaneously; or waiting for disk I/O to complete might prevent the CPU from running at full capacity.

See also buffer pool.

See also concurrency.

buffer

A memory or disk area used for temporary storage. Data is buffered in memory so that it can be written to disk efficiently, with a few large I/O operations rather than many small ones. Data is buffered on disk for greater reliability, so that it can be recovered even when a crash or other failure occurs at the worst possible time. The main types of buffers used by InnoDB are the buffer pool, the doublewrite buffer, and the insert buffer.

See also buffer pool.

See also doublewrite buffer.

See also insert buffer.

buffer pool

The memory area that holds cached InnoDB data, read from both tables and indexes. For efficiency of high-volume read operations, the buffer pool is divided into pages that can potentially hold multiple rows. For efficiency of cache management, the buffer pool is implemented as a linked list of pages, so that data that is rarely used can be aged out of the cache, using a variation of the LRU algorithm. On systems with large memory, you can improve concurrency by dividing the buffer pool into multiple instances.

See also LRU.

See also page.

built-in

The built-in InnoDB storage engine within MySQL is the original form of distribution for the storage engine. Contrast with the InnoDB Plugin. Starting with MySQL 5.5, the InnoDB Plugin is merged back into the MySQL code base as the built-in InnoDB storage engine (known as InnoDB 1.1).

See also plugin.

business rules

The relationships and sequences of actions that form the basis of business software, used to run a commercial company. Sometimes these rules are dictated by law, other times by company policy. Careful planning ensures that the relationships encoded and enforced by the database, and the actions performed through application logic, accurately reflect the real policies of the company and can handle real-life situations.

For example, an employee leaving a company might trigger a sequence of actions from the human resources department. The human resources database might also need the flexibility to represent data about a person who has been hired, but not yet started work. Closing an account at an online service might result in data being removed from a database, or the data might be moved or flagged so that it could be recovered if the account is re-opened. A company might establish policies regarding salary maximums, minimums, and adjustments, in addition to basic sanity checks such as the salary not being a negative number. A retail database might not allow a purchase with the same serial number to be returned more than once, or might not allow credit card purchases above a certain value, while a database used to detect fraud might allow these kinds of things.

See also relational.

See also buffer pool.

cardinality

The number of different values in a table column. When queries refer to columns that have an associated index, the cardinality of each column influences which access method is most efficient. If the values in a column have a very uneven distribution, the cardinality might not be a good way to determine the best query plan. (http://bugs.mysql.com/bug.php?id=36513)

Cardinality can also apply to the number of distinct values present in multiple columns, as in a composite index.

See also composite index.

See also index.

See also delete buffering.

See also insert buffer.

See also insert buffering.

See also purge buffering.

checkpoint

As changes are made to data pages that are cached in the buffer pool, those changes are written to the data files sometime later, a process known as flushing. The checkpoint is a record of the latest changes (represented by an LSN value) that have been successfully written to the data files.

See also buffer pool.

See also data files.

See also flush.

See also foreign key.

See also index.

See also primary key.

See also secondary index.

cold backup

A backup taken while the database is shut down. For busy applications and web sites, this might not be practical, and you might prefer a warm backup or a hot backup.

See also hot backup.

See also composite index.

See also index.

column prefix

When an index is created with a length specification, such as CREATE INDEX idx ON t1 (c1(N)), only the first N characters of the column value are stored in the index. Keeping the index prefix small makes the index compact, and the memory and disk I/O savings help performance. (Although making the index prefix too small can hinder query optimization by making rows with different values appear to the query optimizer to be duplicates.)

For columns containing binary values or long text strings, where sorting is not a major consideration and storing the entire value in the index would waste space, the index automatically uses the first N (typically 768) characters of the value to do lookups and sorts.

See also index.

commit

A SQL statement that ends a transaction, making permanent any changes made by the transaction. It is the opposite of rollback, which undoes any changes made in the transaction.

InnoDB uses an optimistic mechanism for commits, so that changes can be written to the data files before the commit actually occurs. This technique makes the commit itself faster, with the tradeoff that more work is required in case of a rollback.

By default, MySQL uses the autocommit setting, which automatically issues a commit following each SQL statement.

See also rollback.

See also SQL.

See also transaction.

compact row format

The default InnoDB row format since MySQL 5.0.3. It has a more compact representation for nulls and variable-length fields than the prior default (redundant row format).

Because of the B-tree indexes that make row lookups so fast in InnoDB, there is little if any performance benefit to keeping all rows the same size.

See also row format.

composite index

An index that includes multiple columns.

See also index.

See also binary log.

See also hot backup.

See also MySQL Enterprise Backup.

See also tablespace.

compressed row format

A row format introduced in the InnoDB Plugin, available as part of the Barracuda file format. Large fields are stored away from the page that holds the rest of the row data, as in dynamic row format. Both index pages and the large fields are compressed, yielding memory and disk savings. Depending on the structure of the data, the decrease in memory and disk usage might or might not outweigh the performance overhead of uncompressing the data as it is used.

See also Barracuda.

See also dynamic row format.

See also row format.

compression

A feature with wide-ranging benefits from using less disk space, performing less I/O, and using less cache memory. InnoDB products include more than one feature related to compression: table data can be kept in a compressed format during database operation; data can be compressed as part of a backup operation with the MySQL Enterprise Backup product.

When InnoDB table data is compressed, the compression applies to the table itself, any associated index data, and the pages loaded into the buffer pool. Compression does not apply to pages in the undo buffer.

The table compression feature requires using MySQL 5.5 or higher, or the InnoDB Plugin in MySQL 5.1 or earlier, and creating the table using the Barracuda file format and compressed row format, with the innodb_file_per_table setting turned on.

The compression feature of the MySQL Enterprise Backup product makes a compressed copy of each tablespace, changing the extension from .ibd to .ibz. Compressing the backup data allows you to keep more backups on hand, and reduces the time to transfer backups to a different server. The data is uncompressed during the restore operation. When a compressed backup operation processes a table that is already compressed, it skips the compression step for that table, because compressing again would result in little or no space savings.

See also Barracuda.

See also buffer pool.

See also compressed row format.

See also hot backup.

See also index.

See also plugin.

See also table.

See also ACID.

See also locking.

See also transaction.

configuration file

The file that holds the option values used by MySQL at startup. Traditionally, on Linux and UNIX this file is named my.cnf, and on Windows it is named my.ini. You can set a number of options related to InnoDB under the [mysqld] section of the file.

When you use the MySQL Enterprise Backup product, you typically use two configuration files: one that specifies where the data comes from and how it is structured (which could be the original configuration file for your real server), and a stripped-down one containing only a small set of options that specify where the backup data goes and how it is structured. The configuration files used with the MySQL Enterprise Backup product must contain certain options that are typically left out of regular configuration files, so you might need to add some options to your existing configuration file for use with MySQL Enterprise Backup.

See also option file.

consistent read

A read operation that uses snapshot information to present query results based on a point in time, regardless of changes performed by other transactions running at the same time. If queried data has been changed by another transaction, the original data is reconstructed based on the contents of the undo log. This technique avoids some of the locking issues that can reduce concurrency by forcing transactions to wait for other transactions to finish.

With the repeatable read isolation level, the snapshot is based on the time when the first read operation is performed. With the read committed isolation level, the snapshot is reset to the time of each consistent read operation.

Consistent read is the default mode in which InnoDB processes SELECT statements in READ COMMITTED and REPEATABLE READ isolation levels. Because a consistent read does not set any locks on the tables it accesses, other sessions are free to modify those tables while a consistent read is being performed on the table.

See also ACID.

See also concurrency.

See also isolation level.

See also locking.

See also read uncommitted.

See also repeatable read.

See also serializable read.

See also transaction.

See also undo log.

constraint

An automatic test that can block database changes to prevent data from becoming inconsistent. (In computer science terms, a kind of assertion related to an invariant condition.) Constraints are a crucial component of the ACID philosophy, to maintain data consistency. Constraints supported by MySQL include foreign key constraints and unique constraints.

See also ACID.

See also foreign key.

See also relational.

See also composite index.

See also index.

See also secondary index.

crash recovery

The cleanup activities that occur when InnoDB is started again after a crash. Changes that were committed before the crash, but not yet written into the tablespace files, are reconstructed from the doublewrite buffer. When the database is shut down normally, this type of activity is performed during shutdown by the purge operation.

During normal operation, committed data can be stored in the insert buffer for a period of time before being written to the tablespace files. There is always a tradeoff between keeping the tablespace files up-to-date, which introduces performance overhead during normal operation, and buffering the data, which can make shutdown and crash recovery take longer.

See also doublewrite buffer.

See also insert buffer.

See also purge.

cursor

An internal data structure that is used to represent the result set of a query, or other operation that performs a search using a SQL WHERE clause. It works like an iterator in other high-level languages, producing each value from the result set as requested.

Although usually SQL handles the processing of cursors for you, you might delve into the inner workings when dealing with performance-critical code.

See also query.

data definition language

See DDL.

data dictionary

A set of tables, controlled by the InnoDB storage engine, that keeps track of InnoDB-related objects such as tables, indexes, and table columns. These tables are part of the InnoDB system tablespace.

Because the MySQL Enterprise Backup product always backs up the system tablespace, all backups include the contents of the data dictionary.

See also hot backup.

See also system tablespace.

data files

The files that physically contain the InnoDB table and index data. There can be a one-to-many relationship between data files and tables, as in the case of the system tablespace, which can hold multiple InnoDB tables as well as the data dictionary. There can also be a one-to-one relationship between data files and tables, as when the file-per-table setting is enabled, causing each newly created table to be stored in a separate tablespace.

See also data dictionary.

See also file-per-table.

See also system tablespace.

See also tablespace.

data manipulation language

See DML.

data warehouse

A database system or application that primarily runs large queries. The read-only or read-mostly data might be organized in denormalized form for query efficiency. Contrast with OLTP.

See also query.

database

An InnoDB database is largely defined by its data files.

For long-time MySQL users, a database is a familiar notion. Users coming from an Oracle background will find that the MySQL meaning of a database is closer to what Oracle calls a schema.

See also data files.

DDL

Data definition language, a set of SQL statements for manipulating the database itself rather than individual table rows. Includes all forms of the CREATE, ALTER, and DROP statements. Also includes the TRUNCATE statement, because it works differently than a DELETE FROM table_name statement, even though the ultimate effect is similar.

DDL statements automatically commit the current transaction; they cannot be rolled back.

See also commit.

See also rollback.

See also SQL.

See also transaction.

deadlock

A situation where different transactions are unable to proceed, because each holds a lock that the other needs. Because both transactions are waiting for a resource to become available, neither will ever release the locks it holds.

A deadlock can occur when the transactions acquire locks on multiple tables, but in the opposite order. A deadlock can also occur when statements such as UPDATE or SELECT ... FOR UPDATE lock ranges of index records and gaps, with each transaction acquiring some locks but not others due to a timing issue.

To reduce the possibility of deadlocks, use transactions rather than LOCK TABLE statements; keep transactions that insert or update data small enough that they do not stay open for long periods of time; when different transactions update multiple tables or large ranges of rows, use the same order of operations (such as SELECT ... FOR UPDATE) in each transaction; create indexes on the columns used in SELECT ... FOR UPDATE and UPDATE ... WHERE statements. The possibility of deadlocks is not affected by the isolation level, because the isolation level changes the behavior of read operations, while deadlocks occur because of write operations.

If a deadlock does occur, InnoDB detects the condition and rolls back one of the transactions (the victim). Thus, even if your application logic is perfectly correct, you must still handle the case where a transaction must be retried. To monitor how frequently deadlocks occur, use the command SHOW ENGINE INNODB STATUS.

See also concurrency.

See also gap.

See also isolation level.

See also lock.

See also locking.

See also rollback.

See also transaction.

See also deadlock.

See also rollback.

See also transaction.

See also purge.

delete buffering

The technique of storing index changes due to DELETE operations in the insert buffer rather than writing them immediately, so that the physical writes can be performed to minimize random I/O. (Because delete operations are a two-step process, this operation buffers the write that normally marks an index record for deletion.) It is one of the types of change buffering; the others are insert buffering and purge buffering.

See also change buffering.

See also insert buffer.

See also insert buffering.

See also purge buffering.

descending index

See also index.

dirty page

A page in the InnoDB buffer pool that has been updated in memory, where the changes are not yet written (flushed) to the data files.

See also buffer pool.

See also flush.

See also page.

See also ACID.

See also commit.

See also consistent read.

See also isolation level.

See also read uncommitted.

See also rollback.

disk-based

A kind of database that primarily organizes data on disk storage (hard drives or equivalent). Data is brought back and forth between disk and memory to be operated upon. It is the opposite of an in-memory database. InnoDB is disk-based.

See also commit.

See also locking.

See also rollback.

See also SQL.

See also transaction.

doublewrite buffer

InnoDB uses a novel file flush technique called doublewrite. Before writing pages to a data file, InnoDB first writes them to a contiguous area called the doublewrite buffer. Only after the write and the flush to the doublewrite buffer have completed, does InnoDB write the pages to their proper positions in the data file. If the operating system crashes in the middle of a page write, InnoDB can later find a good copy of the page from the doublewrite buffer during crash recovery.

Although data is always written twice, the doublewrite buffer does not require twice as much I/O overhead or twice as many I/O operations. Data is written to the buffer itself as a large sequential chunk, with a single fsync call to the operating system.

The doublewrite buffer can be turned off by specifying the option innodb_doublewrite=0.

See also purge.

dynamic row format

A row format introduced in the InnoDB Plugin, available as part of the Barracuda file format. Because TEXT and BLOB fields are stored outside of the rest of the page that holds the row data, it is very efficient for rows that include large objects, resulting in fewer I/O operations when the large fields do not need to be accessed.

See also Barracuda.

See also row format.

early adopter

A stage similar to beta, when a software product is typically evaluated for performance, functionality, and compatibility in a non-mission-critical setting. InnoDB uses the early adopter designation rather than beta, through a succession of point releases leading up to a GA release.

See also LRU.

exclusive lock

A kind of lock that prevents any other transaction from locking the same row. Depending on the transaction isolation level, this kind of lock might block other transactions from writing to the same row, or might also block other transactions from reading the same row. The default InnoDB isolation level, REPEATABLE READ, enables higher concurrency by allowing transactions to read rows that have exclusive locks, a technique known as consistent read.

See also concurrency.

See also consistent read.

See also isolation level.

See also lock.

See also repeatable read.

See also transaction.

explicit row lock

fast index creation

A capability first introduced in the InnoDB Plugin, that speeds up creation of secondary indexes by avoiding the need to completely rewrite the associated table. The speedup applies to dropping secondary indexes also.

Because index maintenance can add performance overhead to many data transfer operations, consider doing operations such as ALTER TABLE ... ENGINE=INNODB or INSERT INTO ... SELECT * FROM ... without any secondary indexes in place, and creating the indexes afterward.

Even if you do not use the InnoDB Plugin as your primary storage engine, you can take advantage of this capability by enabling the Plugin temporarily, just to create or drop indexes, and then switch back to the built-in InnoDB storage engine for normal use.

See also index.

See also secondary index.

fast shutdown

A shutdown procedure that is required before installation of the InnoDB Plugin. From the MySQL command line, issue the following command before performing the shutdown:

SET GLOBAL innodb_fast_shutdown=0;

To make this type of shutdown the default, specify by the configuration parameter innodb_fast_shutdown=0.

See also Antelope.

See also Barracuda.

See also ibdata file.

See also row format.

file-per-table

A general name for the setting controlled by the innodb_file_per_table option. For each table created while this setting is in effect, the data is stored in a separate file rather than in the system tablespace. When table data is stored in a separate file, you have more flexibility to choose non-default file formats and row formats, which are required for features such as data compression. The TRUNCATE TABLE operation is also much faster, and the reclaimed space can be used by the operating system rather than remaining reserved for InnoDB.

This mode is the default setting in MySQL 5.5.5 and higher.

The MySQL Enterprise Backup product is more flexible for tables that are in their own files. For example, tables can be excluded from a backup, but only if they are in separate files. Thus, this setting is suitable for tables that are backed up less frequently or on a different schedule.

See also file format.

See also ibdata file.

See also innodb_file_per_table.

See also row format.

See also system tablespace.

fixed row format

This row format is used by the MyISAM storage engine, not by InnoDB. If you create an InnoDB table with the option row_format=fixed, InnoDB translates this option to use the compact row format instead, although the fixed value might still show up in output such as SHOW TABLE STATUS reports.

See also compact row format.

See also row format.

flush

To write changes to the database files, that had been buffered in a memory area or a temporary disk storage area. The InnoDB storage structures that are periodically flushed include the redo log, the undo log, and the buffer pool.

Flushing can happen because a memory area becomes full and the system needs to free some space, because a commit operation means the changes from a transaction can be finalized, or because a shutdown operation means that all outstanding work should be finalized.

See also buffer pool.

See also commit.

See also undo log.

flush list

An internal InnoDB data structure that tracks pages in the buffer pool that have been changed and need to be written back out to disk. This data structure is updated frequently by InnoDB's internal mini-transactions, and so is protected by its own mutex to allow concurrent access to the buffer pool.

See also buffer pool.

See also mutex.

See also page.

foreign key

A type of pointer relationship, between rows in separate InnoDB tables. The foreign key relationship is defined on one column in both the parent table and the child table.

In addition to enabling fast lookup of related information, foreign keys help to enforce referential integrity, by preventing any of these pointers from becoming invalid as data is inserted, updated, and deleted. This enforcement mechanism is a type of constraint. A row that points to another table cannot be inserted if the associated foreign key value does not exist in the other table. If a row is deleted or its foreign key value changed, and rows in another table point to that foreign key value, the foreign key can be set up to prevent the deletion, cause the corresponding column values in the other table to become null, or automatically delete the corresponding rows in the other table.

One of the stages in designing a normalized database is to identify data that is duplicated, separate that data into a new table, and set up a foreign key relationship so that the multiple tables can be queried like a single table, using a join operation.

See also relational.

foreign key constraint

The type of constraint that maintains database consistency through a foreign key relationship. Unlike other kinds of constraints that prevent data from being inserted or updated, foreign key constraints can also cause data in child rows to be deleted, changed to different values, or set to null, based on the ON CASCADE option specified when creating the foreign key.

See also foreign key.

See also innobackup command.

See also MySQL Enterprise Backup.

full table scan

An operation that requires reading the entire contents of a table, rather than just selected portions using an index. Typically performed either with small lookup tables, or in data warehousing situations with large tables where all available data is aggregated and analyzed. How frequently these operations occur, and the sizes of the tables relative to available memory, have implications for the algorithms used in query optimization and managing the buffer pool.

The purpose of indexes is to allow lookups for specific values or ranges of values within a large table, thus avoiding full table scans when practical.

See also buffer pool.

See also index.

See also LRU.

GA

"Generally available", the stage when a software product leaves beta and is available for sale, official support, and production use.

See also concurrency.

See also index.

See also isolation level.

See also commit.

See also plugin.

See also B-tree.

See also index.

heartbeat

A periodic message that is sent to indicate that a system is functioning properly. In a replication context, if the master stops sending such messages, one of the slaves can take its place. Similar techniques can be used between the servers in a cluster environment, to confirm that all of them are operating properly.

See also replication.

See also slave database.

hot

A condition where a row, table, or internal data structure is accessed so frequently, in a context that requires some form of locking or mutual exclusion, that it results in a performance or scalability issue.

Although “hot” typically indicates an undesirable condition, a hot backup is the preferred kind of backup.

See also hot backup.

hot backup

A backup taken while the database and is running and applications are reading and writing to it. The backup involves more than simply copying data files: it must include any data that was inserted or updated while the backup was in process; it must exclude any data that was deleted while the backup was in process; and it must ignore any changes that were not committed.

The Oracle product that performs hot backups, of InnoDB tables especially but also tables from MyISAM and other storage engines, is known as MySQL Enterprise Backup.

The hot backup process consists of two stages. The initial copying of the data files produces a raw backup. The apply step incorporates any changes to the database that happened while the backup was running. Applying the changes produces a prepared backup; these files are ready to be restored whenever necessary.

See also MySQL Enterprise Backup.

See also prepared backup.

See also raw backup.

ib-file set

The set of files managed by InnoDB within a MySQL database: the system tablespace, any file-per-table tablespaces, and the (typically 2) log files. Used sometimes in detailed discussions of InnoDB file structures and formats, to avoid ambiguity between the meanings of database between different DBMS products, and the non-InnoDB files that may be part of a MySQL database.

See also binary log.

See also hot backup.

See also innobackup command.

See also MySQL Enterprise Backup.

ibbackup_logfile

A supplemental backup file created by the MySQL Enterprise Backup product during a hot backup operation. It contains information about any data changes that occurred while the backup was running. The initial backup files, including ibbackup_logfile, are known as a raw backup, because the changes that occurred during the backup operation are not yet incorporated. After you perform the apply step to the raw backup files, the resulting files do include those final data changes, and are known as a prepared backup. At this stage, the ibbackup_logfile file is no longer necessary.

See also hot backup.

See also MySQL Enterprise Backup.

See also prepared backup.

See also raw backup.

ibd file

Each InnoDB tablespace created using the file-per-table setting has a filename with a .ibd extension. This extension does not apply to the system tablespace, which is made up of files named ibdata1, ibdata2, and so on.

See also file-per-table.

See also system tablespace.

See also tablespace.

ibdata file

A set of files with names such as ibdata1, ibdata2, and so on, that make up the InnoDB system tablespace. These files contain metadata about InnoDB tables, and can contain some or all of the table data also (depending on whether the file-per-table option is in effect when each table is created).

See also Antelope.

See also Barracuda.

See also file format.

See also file-per-table.

See also system tablespace.

ibz file

When the MySQL Enterprise Backup product performs a compressed backup, it transforms each tablespace file that is created using the file-per-table setting from a .ibd extension to a .ibz extension.

The compression applied during backup is distinct from the compressed row format that keeps table data compressed during normal operation. A compressed backup operation skips the compression step for a tablespace that is already in compressed row format, as compressing a second time would slow down the backup but produce little or no space savings.

See also compressed row format.

See also file-per-table.

See also MySQL Enterprise Backup.

See also tablespace.

implicit row lock

See also locking.

in-memory database

A type of database system that maintains data in memory, to avoid overhead due to disk I/O and translation between disk blocks and memory areas. Some in-memory databases sacrifice durability (the "D" in the ACID design philosophy) and are vulnerable to hardware, power, and other types of failures, making them more suitable for read-only operations. Other in-memory databases do use durability mechanisms such as logging changes to disk or using non-volatile memory.

See also ACID.

See also hot backup.

See also MySQL Enterprise Backup.

See also page.

index

A data structure that provides a fast lookup capability for rows of a table, typically by forming a tree structure representing all the values of a particular column or set of columns.

See also B-tree.

See also clustered index.

See also composite index.

See also primary key.

See also secondary index.

index prefix

In an index that applies to multiple columns (known as a composite index), the initial or leading columns of the index. A query that references the first 1, 2, 3, and so on columns of a composite index can use the index, even if the query does not reference all the columns in the index.

See also composite index.

See also index.

infimum record

A pseudo-record in an index, representing the gap below the smallest value in that index. If a transaction has a statement such as SELECT ... FOR UPDATE ... WHERE col < 10;, and the smallest value in the column is 5, it is a lock on the infimum record that prevents other transactions from inserting even smaller values such as 0, -10, and so on.

See also gap.

See also data dictionary.

See also locking.

See also transaction.

innobackup command

A command-line tool of the MySQL Enterprise Backup product. It performs a hot backup operation for tables from InnoDB, MyISAM, and other storage engines. It calls the ibbackup command to back up the InnoDB tables; if you only need to back up InnoDB tables, it can be more efficient to call ibbackup directly. This command is not available on the Windows platform.

See also hot backup.

See also MySQL Enterprise Backup.

InnoDB

A storage engine for MySQL that combines high performance with transactional capability (that is, reliability, robustness, and concurrent access). Tables created under the InnoDB storage engine are ideally suited for hot backups.

See also ACID.

See also hot backup.

See also transaction.

InnoDB Hot Backup

A licensed backup product, superceded in MySQL 5.1 and above by MySQL Enterprise Backup.

See also MySQL Enterprise Backup.

innodb_autoinc_lock_mode

The option that controls the algorithm used for auto-increment locking. When you have an auto-incrementing primary key, you can use statement-based replication only with the setting innodb_autoinc_lock_mode=1. This setting is known as consecutive lock mode, because multi-row inserts within a transaction receive consecutive auto-increment values. If you have innodb_autoinc_lock_mode=2, which allows higher concurrency for insert operations, use row-based replication rather than statement-based replication. This setting is known as interleaved lock mode, because multiple multi-row insert statements running at the same time can receive autoincrement values that are interleaved. The setting innodb_autoinc_lock_mode=0 is the previous (traditional) default setting and should not be used except for compatibility purposes.

See also auto-increment locking.

innodb_file_format

A setting that determines the file format for all tablespaces created after you specify a value for this option. To create tablespaces other than the system tablespace, you must use the file-per-table option also.

See also Antelope.

See also Barracuda.

See also file format.

See also file-per-table.

See also innodb_file_per_table.

See also system tablespace.

See also tablespace.

innodb_file_per_table

The option that allows you to use the file-per-table setting, which stores newly created tables in their own data files, outside the system tablespace. This option is needed to take full advantage of many other features, such as such as table compression in the InnoDB Plugin, or backups of named tables in MySQL Enterprise Backup.

This option was once static, but can now be set using the SET GLOBAL command.

See also data files.

See also file-per-table.

See also system tablespace.

innodb_lock_wait_timeout

A setting that provides an alternative to automatic deadlock detection. Rolls back any transaction that waits more than a specified time to acquire a lock. Especially useful if deadlocks are caused by updates to multiple tables controlled by different storage engines; such deadlocks are not detected automatically.

See also deadlock.

See also locking.

innodb_strict_mode

The option that controls whether InnoDB operates in strict mode, where conditions that are normally treated as warnings, cause errors instead (and the underlying statements fail).

This mode is the default setting in MySQL 5.5.5 and higher.

See also locking.

insert

One of the primary DML operations in SQL. The performance of inserts is a key factor in data warehouse systems that load millions of rows into tables, and OLTP systems where many concurrent connections might insert rows into the same table, in arbitrary order. If insert performance is important to you, you should learn about InnoDB features such as the insert buffer used in change buffering, and auto-increment columns.

See also change buffering.

See also DML.

See also insert buffer.

See also SQL.

insert buffer

A special index data structure that records changes to pages in secondary indexes. These values could result from SQL INSERT UPDATE, or DELETE statements. The set of features involving the insert buffer is known collectively as change buffering, consisting of insert buffering, delete buffering, and purge buffering.

Changes are only recorded in the insert buffer when the relevant page from the secondary index is not in the buffer pool. When the relevant index page is brought into the buffer pool while associated changes are still in the insert buffer, the changes for that page are applied in the buffer pool using the data from the insert buffer. Periodically, the purge operation that runs during times when the system is mostly idle, or during a slow shutdown, writes the new index pages to disk. The purge operation can write the disk blocks for a series of index values more efficiently than if each value were written to disk immediately.

Physically, the insert buffer is part of the system tablespace, so that the index changes remain buffered across database restarts. The changes are only applied when the pages are brought into the buffer pool due to some other read operation.

To see information about the current data in the insert buffer, issue the SHOW INNODB STATUS command.

See also buffer pool.

See also change buffering.

See also delete buffering.

See also insert buffering.

See also page.

See also purge.

See also purge buffering.

See also system tablespace.

insert buffering

The technique of storing secondary index changes due to INSERT operations in the insert buffer rather than writing them immediately, so that the physical writes can be performed to minimize random I/O. It is one of the types of change buffering; the others are delete buffering and purge buffering.

Insert buffering is not used if the secondary index is unique, because the uniqueness of new values cannot be verified before the new entries are written out. Other kinds of change buffering do work for unique indexes.

See also change buffering.

See also delete buffering.

See also insert buffer.

See also purge buffering.

insert-intention gap lock

See also gap.

See also lock.

See also locking.

intention exclusive lock

See intention lock.

intention lock

A kind of lock that applies to the table level, used to indicate what kind of lock the transaction intends to acquire on rows in the table. Different transactions can acquire different kinds of intention locks on the same table, but the first transaction to acquire an intention exclusive (IX) lock on a table prevents other transactions from acquiring any S or X locks on the table. Conversely, the first transaction to acquire an intention shared (IS) lock on a table prevents other transactions from acquiring any X locks on the table. The two-phase process allows the lock requests to be resolved in order, without blocking locks and corresponding operations that are compatible.

See also lock.

See also locking.

intention shared lock

See intention lock.

isolation level

One of the foundations of database processing. Isolation is the I in the acronym ACID; the isolation level is the setting that fine-tunes the balance between performance and reliability, consistency, and reproducibility of results when multiple transactions are making changes and performing queries at the same time.

From highest amount of consistency and protection to the least, the isolation levels supported by InnoDB are: serializable read, repeatable read, consistent read, and read uncommitted.

With the built-in InnoDB storage engine and the InnoDB Plugin, many users can keep the default isolation level (repeatable read) for all operations. Expert users might choose the read committed level as they push the boundaries of scalability with OLTP processing, or during data warehousing operations where minor inconsistencies do not affect the aggregate results of large amounts of data. The levels on the edges (serializable read and read uncommitted) change the processing behavior to such an extent that they are rarely used.

See also ACID.

See also consistent read.

See also read uncommitted.

See also repeatable read.

See also serializable read.

See also transaction.

join

A query that retrieves data from more than one table, by referencing columns in the tables that hold identical values. These columns are frequently part of a foreign key relationship.

See also foreign key.

See also query.

key_block_size

An option to specify the size of data pages within a table that uses compressed row format. The default is 8 kilobytes. Lower values risk hitting internal limits that depend on the combination of row size and compression percentage.

See also compressed row format.

latch

A lightweight structure used by InnoDB to implement a lock for its own internal memory structures, typically held for a brief time measured in milliseconds or microseconds. A general term that includes both mutexes (for exclusive access) and rw-locks (for shared access). Certain latches are the focus of performance tuning within the InnoDB storage engine, such as the data dictionary mutex. Statistics about latch use and contention are available through the Performance Schema interface.

See also data dictionary.

See also lock.

See also locking.

See also mutex.

See also buffer pool.

See also LRU.

See also latch.

See also locking.

See also mutex.

See also locking.

See also lock.

See also locking.

lock monitor

See also deadlock.

See also lock.

See also locking.

locking

The system of protecting a transaction from seeing or changing data that is being queried or changed by other transactions. The locking strategy must balance reliability and consistency of database operations (the principles of the ACID philosophy) against the performance needed for good concurrency. Fine-tuning the locking strategy often involves choosing an isolation level and ensuring all your database operations are safe and reliable for that isolation level.

See also ACID.

See also concurrency.

See also isolation level.

See also latch.

See also lock.

See also mutex.

See also transaction.

log write

See also buffer pool.

See also hot backup.

See also replication.

See also slave database.

master thread

See also buffer pool.

See also LRU.

mini-transaction

An internal phase of InnoDB processing, when making changes to the data dictionary during a DDL operation. The changes to the data dictionary are made without affecting the transaction that is being processed by MySQL.

See also data dictionary.

See also transaction.

monitor

See also innobackup command.

See also MySQL Enterprise Backup.

multi-core

A type of processor that can take advantage of multi-threaded programs, such as the MySQL server.

multiversion concurrency control

See MVCC.

mutex

Informal abbreviation for “mutex variable”. (Mutex itself is short for “mutual exclusion”.) The low-level object that InnoDB uses to represent and enforce exclusive-access locks to internal in-memory data structures. Once the lock is acquired, any other process, thread, and so on is prevented from acquiring the same lock. Contrast with rw-locks, which allow shared access. Mutexes and rw-locks are known collectively as latches.

See also latch.

See also lock.

See also ACID.

See also concurrency.

See also consistent read.

See also isolation level.

See also lock.

my.cnf

The name, on UNIX or Linux systems, of the MySQL option file.

See also option file.

my.ini

The name, on Windows systems, of the MySQL option file.

See also option file.

.MYD files

The files that MySQL uses to store the data for MyISAM tables. Files with this extension are always included in backups produced by the innobackup command of the MySQL Enterprise Backup product.

See also innobackup command.

See also MySQL Enterprise Backup.

.MYI files

The files that MySQL uses to store the indexes for MyISAM tables. Files with this extension are always included in backups produced by the innobackup command of the MySQL Enterprise Backup product.

See also innobackup command.

See also MySQL Enterprise Backup.

mysql

The mysql program is the command-line interpreter for the MySQL database. It processes SQL statements, and also MySQL-specific commands such as SHOW TABLES.

See also hot backup.

non-blocking I/O

An industry term that means the same as asynchronous I/O.

See also ACID.

See also consistent read.

See also isolation level.

See also locking.

See also read uncommitted.

See also repeatable read.

See also serializable read.

See also transaction.

normalized

The condition where data is split into multiple tables, and duplicate values condensed into single rows represented by an ID, to avoid storing, querying, and updating redundant or lengthy values.

For example, an address might be given a unique ID, so that a census database could represent the relationship lives at this address by associating that ID with each member of a family, rather than storing multiple copies of a complex value such as 123 Main Street, Anytown, USA.

For another example, although a simple address book application might store each phone number in the same table as a person's name and address, a phone company database might give each phone number a special ID, and store the numbers and IDs in a separate table. This normalized representation could simplify large-scale updates when area codes split apart.

Normalization is not always recommended. Data that is primarily queried, and only updated by deleting entirely and reloading, is often kept in fewer, larger tables with redundant copies of duplicate values. This data representation is referred to as denormalized, and is frequently found in data warehousing applications.

See also foreign key.

See also relational.

NULL

A special value in SQL, indicating the absence of data. Any arithmetic operation or equality test involving a NULL value, in turn produces a NULL result. (Thus it is similar to the IEEE floating-point concept of NaN, not a number.) Any aggregate calculation such as AVG() ignores rows with NULL values, when determining how many rows to divide by. The only test that works with NULL values uses the SQL idioms IS NULL or IS NOT NULL.

NULL values play a part in index operations, because for performance a database must minimize the overhead of keeping track of missing data values, Typically, NULL values are not represented in an index, because a query that tests an indexed column using a standard comparison operator could never match a row with a NULL value for that column. For the same reason, unique indexes do not prevent NULL values; those values simply are not represented in the index. Declaring a NOT NULL constraint on a column provides reassurance that there are no rows left out of the index, allowing for better query optimization (accurate counting of rows and estimation of whether to use the index).

Because the primary key must be able to uniquely identify every row in the table, a single-column primary key cannot contain any NULL values, and a multi-column primary key cannot contain any rows with NULL values in all columns.

Although the Oracle database allows a NULL value to be concatenated with a string, InnoDB treats the result of such an operation as NULL.

See also index.

See also primary key.

See also SQL.

off-page column

A column containing variable-length data (such as BLOB and VARCHAR) that are too long to fit on a B-tree page. The data is stored in overflow pages.

See also B-tree.

See also DML.

See also query.

See also transaction.

.OPT file

A file containing database configuration information. Files with this extension are always included in backups produced by the innobackup command of the MySQL Enterprise Backup product.

See also innobackup command.

See also MySQL Enterprise Backup.

optimistic

A methodology that guides low-level implementation decisions for a relational database system. The requirements of performance and concurrency in a relational database mean that operations must be started or dispatched quickly. The requirements of consistency and referential integrity mean that any operation could fail: a transaction might be rolled back, a DML operation could violate a constraint, a request for a lock could cause a deadlock, a network error could cause a timeout. An optimistic strategy is one that assumes most requests or attempts will succeed, so that relatively little work is done to prepare for the failure case. When this assumption is true, the database does little unnecessary work; when requests do fail, extra work must be done to clean up and undo changes.

InnoDB uses optimistic strategies for operations such as locking and commits. For example, data changed by a transaction can be written to the data files before the commit occurs, making the commit itself very fast, but requiring more work to undo the changes if the transaction is rolled back.

The opposite of an optimistic strategy is a pessimistic one, where a system is optimized to deal with operations that are unreliable and frequently unsuccessful. This methodology is rare in a database system, because so much care goes into choosing reliable hardware, networks, and algorithms.

See also commit.

See also locking.

See also option file.

See also plugin.

option file

Traditionally, on Linux and UNIX this file is named my.cnf, and on Windows it is named my.ini.

See also B-tree.

See also page.

page

A unit representing how much data InnoDB transfers at any one time between disk (the data files) and memory (the buffer pool). A page can contain one or more rows, depending on how much data is in each row. If a row does not fit entirely into a single page, InnoDB sets up additional pointer-style data structures so that the information about the row can be stored in one page.

One way to fit more data in each page is to use compressed row format.

See also buffer pool.

See also compressed row format.

See also data files.

See also page.

.PAR file

A file containing partition definitions. Files with this extension are always included in backups produced by the innobackup command of the MySQL Enterprise Backup product.

See also innobackup command.

See also MySQL Enterprise Backup.

parent table

The table in a foreign key relationship that holds the initial column values pointed to from the child table. The consequences of deleting, or updating rows in the parent table depend on the ON UPDATE and ON DELETE clauses in the foreign key definition. Rows with corresponding values in the child table could be automatically deleted or updated in turn, or those columns could be set to NULL, or the operation could be prevented.

See also foreign key.

partial index

An index that represents only part of a column value, typically the first N characters (the prefix) of a long VARCHAR value.

See also index.

See also latch.

See also mutex.

See also deadlock.

See also locking.

See also consistent read.

See also isolation level.

See also read uncommitted.

See also repeatable read.

See also serializable read.

See also transaction.

physical backup

A backup that copies the actual data files. For example, the MySQL Enterprise Backup command produces a physical backup, because its output contains data files that can be used directly by the mysqld server. Contrast with logical backup.

See also MySQL Enterprise Backup.

PITR backup

Acronym for point-in-time recovery.

See also MySQL Enterprise Backup.

See also locking.

prefix

See index prefix.

prepared backup

A set of backup files, produced by the MySQL Enterprise Backup product, after all the stages of applying binary logs and incremental backups are finished. The resulting files are ready to be restored. Prior to the apply steps, the files are known as a raw backup.

See also binary log.

See also hot backup.

See also MySQL Enterprise Backup.

See also raw backup.

See also clustered index.

See also index.

pseudo-record

An artificial record in an index, used for locking key values or ranges that do not currently exist.

See also locking.

See also mutex.

purge

A type of garbage collection performed by a separate thread, running on a periodic schedule. The purge includes these actions: removing obsolete values from indexes; physically removing rows that were marked for deletion by previous DELETE statements.

See also doublewrite buffer.

purge buffering

The technique of storing index changes due to DELETE operations in the insert buffer rather than writing them immediately, so that the physical writes can be performed to minimize random I/O. (Because delete operations are a two-step process, this operation buffers the write that normally purges an index record that was previously marked for deletion.) It is one of the types of change buffering; the others are insert buffering. and delete buffering

See also change buffering.

See also delete buffering.

See also insert buffer.

See also insert buffering.

purge thread

A thread within the InnoDB process that is dedicated to performing the periodic purge operation.

See also purge.

See also index.

See also SQL.

See also table.

RAID

Acronym for "Redundant Array of Inexpensive Drives". Spreading I/O operations across multiple drives enables greater concurrency at the hardware level, and improves the efficiency of low-level write operations that otherwise would be performed in sequence.

See also concurrency.

random dive

A technique for quickly estimating the number of different values in a column (the column's cardinality). InnoDB samples pages at random from the index and uses that data to estimate the number of different values. This operation occurs when each table is first opened.

Originally, the number of sampled pages was fixed at 8; now, it is determined by the setting of the innodb_stats_sample_pages parameter.

The way the random pages are picked depends on the setting of the innodb_use_legacy_cardinality_algorithm parameter. The default setting (OFF) has better randomness than in older releases.

See also cardinality.

raw backup

The initial set of backup files produced by the MySQL Enterprise Backup product, before the changes reflected in the binary log and any incremental backups are applied. At this stage, the files are not ready to restore. After these changes are applied, the files are known as a prepared backup.

See also binary log.

See also hot backup.

See also MySQL Enterprise Backup.

See also prepared backup.

See also ACID.

See also isolation level.

See also locking.

See also repeatable read.

See also serializable read.

See also transaction.

read uncommitted

The isolation level that provides the least amount of protection between transactions. Queries employ a locking strategy that allows them to proceed in situations where they would normally wait for another transaction. However, this extra performance comes at the cost of less reliable results, including data that has been changed by other transactions and not committed yet (known as dirty read). Use this isolation level only with great caution, and be aware that the results might not be consistent or reproducible, depending on what other transactions are doing at the same time. Typically, transactions with this isolation level do only queries, not insert, update, or delete operations.

See also ACID.

See also isolation level.

See also locking.

See also transaction.

read-ahead

A type of I/O request that prefetches pages into the buffer cache asynchronously, in anticipation that these pages will be needed soon.

record-only lock

See also locking.

redo log

A set of files, typically named ib_logfile0 and ib_logfile1, that record statements that attempt to change data in InnoDB tables. These statements are replayed automatically to correct data written by incomplete transactions, on startup following a crash.

See also compact row format.

See also row format.

referential integrity

The technique of maintaining data always in a consistent format, part of the ACID philosophy. In particular, data in different tables is kept in synch through the use of foreign keys, which can prevent changes from happening or automatically propagate those changes to all related tables.

See also ACID.

See also foreign key.

relational

An important aspect of modern database systems.

In a mathematical context, the relations within a database are derived from set theory. For example, the OR and AND operators of a WHERE clause represent the notions of union and intersection.

In everyday terms, the database encodes and enforces relationships such as one-to-one, one-to-many, many-to-one, and uniqueness. For example, a person might have zero, one, or many phone numbers in an address database; a single phone number might be associated with several family members. In a financial database, a person might be required to have exactly one taxpayer ID, and any taxpayer ID could only be associated with one person.

At the database level, these relationships are expressed through SQL features such as columns within a table, unique and NOT NULL constraints, foreign keys, and different kinds of join operations. Complex relationships typically involve data split between more than one table. Often, the data is normalized, so that duplicate values in one-to-many relationships are stored only once.

See also ACID.

See also foreign key.

See also ACID.

See also consistent read.

See also isolation level.

See also locking.

See also serializable read.

See also transaction.

replication

The practice of sending changes from a master database, to one or more slave databases, so that all databases have the same data. This technique has a wide range of uses, such as load-balancing for better scalability, disaster recovery, and testing software upgrades and configuration changes. The changes can be sent between the database by methods called row-based replication and statement-based replication.

See also slave database.

See also hot backup.

See also innobackup command.

See also MySQL Enterprise Backup.

See also prepared backup.

See also replication.

See also slave database.

rollback

A SQL statement that ends a transaction, undoing any changes made by the transaction. It is the opposite of commit, which makes permanent any changes made in the transaction.

By default, MySQL uses the autocommit setting, which automatically issues a commit following each SQL statement. You must change this setting before you can use the rollback technique.

See also ACID.

See also commit.

See also transaction.

rollback segment

The storage area containing the undo log, part of the system tablespace.

See also system tablespace.

See also undo log.

row format

The disk storage format for a row from an InnoDB table. As InnoDB gains new capabilities such as compression, new row formats are introduced to support the resulting improvements in storage efficiency and performance.

Each table has its own row format, specified through the ROW_FORMAT option. To see the row format for each InnoDB table, issue the command SHOW TABLE STATUS. Because all the tables in the system tablespace share the same row format, to take advantage of other row formats typically requires setting the innodb_file_per_table option, so that each table is stored in a separate tablespace.

See also compact row format.

See also compressed row format.

See also dynamic row format.

See also lock.

See also transaction.

row-based replication

This form of replication is safe to use for all settings of the innodb_autoinc_lock_mode option.

See also auto-increment locking.

See also innodb_autoinc_lock_mode.

See also replication.

See also latch.

See also lock.

See also mutex.

See also rollback.

See also transaction.

scalability

The ability to add more work and issue more simultaneous requests to a system, without a sudden drop in performance due to exceeding the limits of system capacity. Software architecture, hardware configuration, application coding, and type of workload all play a part in scalability. When the system reaches its maximum capacity, popular techniques for increasing scalability are scale up (increasing the capacity of existing hardware or software) and scale out (adding new hardware or more instances of server software).

schema

Conceptually, a schema is a set of interrelated database objects, such as tables, table columns, data types of the columns, indexes, foreign keys, and so on. These objects are connected through SQL syntax, because the columns make up the tables, the foreign keys refer to tables and columns, and so on. Ideally, they are also connected logically, working together as part of a unified application or flexible framework.

In MySQL, physically, a schema is analogous to a database. Typically, all the objects for a MySQL application are contained within a single database.

Some other database products draw a distinction. For example, in the Oracle Database product, a schema represents only a part of a database: the tables and other objects owned by a single user.

See also clustered index.

See also index.

selectivity

A property of data distribution, the number of distinct values in a column divided by the number of records in the table. High selectivity means that the column values are relatively unique, and can retrieved efficiently through an index. If you (or the query optimizer) can predict that a test in a WHERE clause only matches a small number (or proportion) of rows in a table, the overall query tends to be efficient if it evaluates that test first, using an index.

See also cardinality.

semi-consistent read

A type of read operation used for UPDATE statements, that is a combination of read committed and consistent read. When an UPDATE statement examines a row that is already locked, InnoDB returns the latest committed version to MySQL so that MySQL can determine whether the row matches the WHERE condition of the UPDATE. If the row matches (must be updated), MySQL reads the row again, and this time InnoDB either locks it or waits for a lock on it. This type of read operation can only happen when the transaction has the read committed isolation level, or when the innodb_locks_unsafe_for_binlog option is enabled.

See also consistent read.

See also isolation level.

See also ACID.

See also consistent read.

See also isolation level.

See also locking.

See also repeatable read.

See also transaction.

server

A type of program that runs continuously, waiting to receive and act upon requests from another program (the client). Because often an entire computer is dedicated to running one or more server programs (such as a database server, a web server, an application server, or some combination of these), the term server can also refer to the computer that runs the server software.

See also lock.

See also transaction.

shared tablespace

Another way of referring to the system tablespace.

See also system tablespace.

shutdown

The process of stopping the InnoDB storage engine. This process can do various cleanup operations, so it is slow to shut down but fast to start up later; or it can skip the cleanup operations, so it is fast to shut down but must do the cleanup the next time it starts.

The shutdown mode is controlled by the innodb_fast_shutdown option.

See also replication.

--sleep

An option for the Hot Backup product that reduces the I/O overhead of the backup operation. (The backup takes longer as a result.)

slow shutdown

A type of shutdown that does additional flushing operations before completing. Specified by the configuration parameter innodb_fast_shutdown=0. Although the shutdown itself can take longer, that time will be saved on the subsequent startup.

See also tablespace.

SQL

The Structured Query Language that is standard for performing database operations. Often divided into the categories DDL, DML, and queries.

See also DML.

See also query.

SSD

Acronym for “solid-state drive”. A type of storage device with different performance characteristics than a traditional hard drive: smaller storage capacity, faster for random reads, no moving parts, and with a number of considerations affecting write performance.

See also auto-increment locking.

See also innodb_autoinc_lock_mode.

See also replication.

See also MySQL Enterprise Backup.

stored procedure

strict mode

The general name for the setting controlled by the innodb_strict_mode option. Turning on this setting causes certain conditions that are normally treated as warnings, to be considered errors. For example, certain invalid combinations of options related to file format and row format, that normally produce a warning and continue with default values, now cause the CREATE TABLE operation to fail.

MySQL also has something called strict mode.

See also buffer pool.

See also LRU.

supremum record

A pseudo-record in an index, representing the gap above the largest value in that index. If a transaction has a statement such as SELECT ... FOR UPDATE ... WHERE col > 10;, and the largest value in the column is 20, it is a lock on the supremum record that prevents other transactions from inserting even larger values such as 50, 100, and so on.

See also gap.

See also Barracuda.

See also data dictionary.

See also file format.

See also file-per-table.

See also ibdata file.

See also innodb_file_per_table.

See also tablespace.

table

Although a table is a distinct, addressable object in the context of SQL, for backup purposes we are often concerned with whether the table is part of the system tablespace, or was created under the file-per-table setting and so resides in its own tablespace.

See also clustered index.

See also file-per-table.

See also system tablespace.

See also table.

table lock

A lock that prevents any other transaction from accessing a table. InnoDB makes considerable effort to make such locks unnecessary, by using techniques such as row locks and consistent reads for processing DML statements and queries. You can create such a lock through SQL using the LOCK TABLE statement; one of the steps in migrating from other database systems or MySQL storage engines is to remove such statements wherever practical.

See also consistent read.

See also DML.

See also lock.

See also locking.

See also query.

See also table.

See also transaction.

table monitor

See also Antelope.

See also Barracuda.

See also compressed row format.

See also data dictionary.

See also file-per-table.

See also ibdata file.

See also innodb_file_per_table.

See also system tablespace.

tablespace dictionary

A representation of the data dictionary metadata for a table, within the InnoDB tablespace. This metadata can be checked against the .frm file for consistency when the table is opened, to diagnose errors resulting from out-of-date .frm files. This information is present for InnoDB tables that are part of the system tablespace, as well as for tables that have their own .ibd file because of the file-per-table option.

See also data dictionary.

See also file-per-table.

See also system tablespace.

See also tablespace.

tablespace monitor

See also table.

thread

See also ACID.

See also commit.

See also isolation level.

See also lock.

See also rollback.

transaction ID

An internal field associated with each row. This field is physically changed by INSERT, UPDATE, and DELETE operations to record which transaction has locked the row.

See also file-per-table.

See also system tablespace.

See also tablespace.

.TRG file

A file containing trigger parameters. Files with this extension are always included in backups produced by the innobackup command of the MySQL Enterprise Backup product.

See also innobackup command.

See also MySQL Enterprise Backup.

See also innobackup command.

See also MySQL Enterprise Backup.

See also rollback.

tuple

A technical term designating an ordered set of elements. It is an abstract notion, used in formal discussions of database theory. In the database field, tuples are usually represented by the columns of a table row. They could also be represented by the result sets of queries, for example, queries that retrieved only some columns of a table, or columns from joined tables.

See also commit.

See also rollback.

See also transaction.

See also rollback.

See also transaction.

See also undo log.

undo buffer

See undo log.

undo log

A storage area that holds copies of data modified by active transactions. If another transaction needs to see the original data (as part of a consistent read operation), the unmodified data is retrieved from this storage area. This area is physically part of the system tablespace. It is split into separate portions, the insert undo buffer and the update undo buffer. Collectively, these parts are also known as the rollback segment, a familiar term for Oracle DBAs.

See also consistent read.

See also system tablespace.

See also transaction.

unique constraint

A kind of constraint that asserts that a column can not contain any duplicate values. In terms of relational algebra, it is used to specify 1-to-1 relationships. For efficiency in checking whether a value can be inserted, (that is, the value does not already exist in the column), a unique constraint is supported by an underlying unique index.

See also relational.

See also cardinality.

See also deadlock.

See also concurrency.

See also latch.

See also lock.

See also mutex.

warm backup

A backup taken while the database is running, but that restricts some database operations during the backup process. For example, tables might become read-only. For busy applications and web sites, you might prefer a hot backup.

See also hot backup.

warm up

To run a system under a typical workload for some time after startup, so that the buffer pool and other memory regions are filled as they would be under normal conditions. Typically done during performance testing, to ensure consistent results across multiple runs; otherwise, performance might be artificially low during the first run.

See also buffer pool.

Windows

The built-in InnoDB storage engine and the InnoDB Plugin are supported on all the same Microsoft Windows versions as MySQL.

The MySQL Enterprise Backup product is available on Windows, although the innobackup command is not part of the Windows edition of the product.

workload

The combination and volume of SQL and other database operations, performed by a database application during typical or peak usage. You can subject the database to a particular workload during performance testing to identify bottlenecks, or during capacity planning.

See also SQL.

write combining

An optimization technique that reduces write operations when dirty pages are flushed from the InnoDB buffer pool. If a row in a page is updated multiple times, or multiple rows on the same page are updated, all of those changes are stored to the data files in a single write operation rather than one write for each change.

See also buffer pool.

See also flush.

XA

A standard interface for coordinating distributed transactions, allowing multiple databases to participate in a transaction while maintaining ACID compliance.

XA Distributed Transaction support is turned on by default. If you are not using this feature, note that it adds an extra fsync for each transaction, which may adversely affect performance.

See also commit.

See also transaction.

Nov	DEC	Jan
	25
2009	2010	2011