In computer science, ACID (Atomicity, Consistency, Isolation,
Durability) is a set of properties that guarantee that database transactions
are processed reliably. In the context of databases, a single logical operation on the data is
called a transaction.
An example of a transaction is a transfer of funds from one account to another, even though it might consist of multiple
individual operations (such as debiting one account and crediting another).
Atomicity
Atomicity refers to the ability of the DBMS to guarantee that either all of the tasks of a transaction are performed or none of them
are. The transfer of funds can be completed or it can fail for a multitude of reasons, but atomicity guarantees that one account
won't be debited if the other is not credited as well.
Consistency
Consistency refers to the database being in a legal state when the
transaction begins and when it ends. This means that a transaction cannot break the rules, or integrity constraints, of the database. If an integrity constraint states that all accounts
must have a positive balance, then any transaction violating this rule will be aborted.
Isolation
-
Isolation refers to the ability of the application to make operations in a transaction appear isolated from all other
operations. This means that no operation outside the transaction can ever see the data in an intermediate state; a bank manager
can see the transferred funds on one account or the other, but never on both — even if he ran his query while the transfer was
still being processed. More formally, isolation means the transaction history (or schedule) is serializable. For performance reasons,
this ability is the most often relaxed constraint.
Durability
Durability refers to the guarantee that once the user has been notified
of success, the transaction will persist, and not be undone. This means it will survive system failure, and that the
database system has checked the integrity constraints and won't need to abort the
transaction. Many databases implement durability by writing all transactions into a log
that can be played back to recreate the system state right before the failure. A transaction can only be deemed committed after
it is safely in the log.
Implementation
Implementing the ACID properties correctly is not simple. Processing a transaction often requires a number of small changes to
be made, including updating indices that are used by the system to speed up
searches. This sequence of operations is subject to failure for a number of reasons; for instance, the system may have no room
left on its disk drives, or it may have used up its allocated CPU time.
ACID suggests that the database be able to perform all of these operations at once. In fact this is difficult to arrange.
There are two popular families of techniques: write ahead logging and
shadow paging. In both cases, locks must
be acquired on all information that is updated, and depending on the implementation, on all data that is being read. In write
ahead logging, atomicity is guaranteed by ensuring that information about all changes is written to a log before it is written to
the database. That allows the database to return to a consistent state in the event of a crash. In shadowing, updates are applied
to a copy of the database, and the new copy is activated when the transaction commits. The copy refers to unchanged parts of the
old version of the database, rather than being an entire duplicate.
Until recently almost all databases relied upon locking to provide ACID capabilities. This means that a lock must always be
acquired before processing data in a database, even on read operations. Maintaining a large number of locks, however, results in
substantial overhead as well as hurting concurrency. If user A is running a transaction that has read a row of data that user B
wants to modify, for example, user B must wait until user A's transaction is finished.
An alternative to locking is multiversion concurrency control, in
which the database maintains separate copies of any data that is modified. This allows users to read data without acquiring any
locks. Going back to the example of user A and user B, when user A's transaction gets to data that user B has modified, the
database is able to retrieve the exact version of that data that existed when user A started their transaction. This ensures that
user A gets a consistent view of the database even if other users are changing data that user A needs to read. A natural
implementation of this idea results in a relaxation of the isolation property, namely snapshot isolation.
It is difficult to guarantee ACID properties in a network environment. Network connections might fail, or two users might want
to use the same part of the database at the same time.
Two-phase commit is typically applied in distributed transactions to ensure that each participant in the transaction agrees on whether
the transaction should be committed or not.
Care must be taken when running transactions in parallel. Two phase locking is typically applied to guarantee full isolation.
See also
References
This entry is from Wikipedia, the leading user-contributed encyclopedia. It may not have been reviewed by professional editors (see full disclaimer)