The polymerase chain reaction (PCR) is a laboratory technique for "amplifying" a specific DNA sequence. PCR is extremely efficient and sensitive; it can make millions or billions of copies of any specific sequence of DNA, even when the sequence is in a complex mixture. Because of this power, researchers can use it to amplify sequences even if they only have a minute amount of DNA. A single hair root, or a microscopic blood stain left at a crime scene, for example, contains ample DNA for PCR.
PCR has revolutionized the field of molecular biology. It has enabled researchers to perform experiments easily that previously had been unthinkable. Before the mid-1980s, when PCR was developed, molecular biologists had to use laborious and time-consuming methods to identify, clone, and purify DNA sequences they wanted to study. Kary Mullis was awarded the 1993 Nobel Prize in Chemistry for inventing PCR.
PCR is based on the way cells replicate their DNA. During DNA replication, the two strands of each DNA molecule separate, and DNA polymerase, an enzyme, assembles nucleotides to form two new partner strands for each of the original strands. The original strands serve as templates for the new strands. The new strands are assembled such that each nucleotide in the new strand is determined by the corresponding nucleotide in the template strand. The nucleotides adenine (A) and thymine (T) always lie opposite each other, as do cytosine (C) and guanine (G). Because of this base-pairing specificity, each newly synthesized partner strand has the same sequence as the original partner strand, and replication produces two identical copies of the original double-stranded DNA molecule.
In PCR, a DNA sequence that a researcher wants to amplify, called the "target" sequence, undergoes about thirty rounds of replication in a small reaction tube. During each replication cycle, the number of molecules of the target sequence doubles, because the products and templates of one round of replication all become the templates for the next round. After n rounds of replication, 2n copies of the target sequence are theoretically produced. After thirty cycles, PCR can produce 230 or more than ten billion copies of a single target DNA sequence. This is called a polymerase chain reaction because DNA polymerase catalyzes a chain reaction of replication.
Designing Primers
To replicate DNA, DNA polymerases require not only a template, but also a primer. A primer is a sequence of single-stranded DNA that "anneals," or binds, to the template by specific base-pairing. An automated apparatus called an oligonucleotide synthesizer, sometimes nickname a "gene machine," can produce primers of any chosen sequence.
Primers for PCR are typically short sequences, around twenty nucleotides long. It is the primers' sequences that are responsible for PCR's enormous specificity. Researchers design primers so they are likely to bind to sequences on either side of the target DNA. They do so by making the primers complementary to the appropriate sequences and by making them long enough that they are unlikely to bind elsewhere.
The longer the primer, the more likely it is that it will be complementary only to the target sequence. Because any single position in a DNA sequence can be occupied by either an A, T, C, or G, there is a one in four chance that any position will contain an A, for example. (This is an approximation, because the nucleotides are not distributed equally or randomly in DNA.) The odds that any specific DNA sequence that is n nucleotides long would be present at a given spot in a DNA sequence is therefore 1 in 4. The chance that a particular twenty-nucleotide sequence (a typical length for a PCR primer) would occur in a given spot at random is less than one in one trillion (10−12). The human genome has only about three billion (3 × 109) nucleotide pairs, so any twenty-nucleotide-long sequence is very unlikely to occur more than once by chance in the human genome.
Researchers design two primers that will bind to opposite strands of the DNA on either side of the target sequence. They design them to "point" the right way, so that the section of DNA between, not outside of them, is copied. Designing the primers to "point" in the right direction simply requires building them so that their 3′ ends lie toward the target DNA and their 5′ ends lie away from it. The ends of any segment of DNA, including the complete strand, are chemically different.
One end is called the 5′ (pronounced "5-prime") end. The other is called the 3′ end. In DNA replication, nucleotides are always added to the 3′ end of a growing strand of DNA. DNA synthesis is said to proceed in a 5′ to 3′ direction. The two complementary strands of DNA are anti-parallel, which means that they run in opposite directions. The 5′ end of one strand lies next to the 3′ end of the other, as shown in the diagram.
A Typical Pcr Reaction
A typical PCR reaction consists of the following components, mixed together in a solution with a total volume of between 25 and 100 microliters. The solution must include the template DNA, the primers, nucleotides to serve as building blocks for the newly forming DNA, DNA polymerase to catalyze the synthesis, and buffers and salts, usually including magnesium, that are required for optimal activity of the DNA polymerase. The template can be an unpurified mixture of DNA, such as DNA extracted from a swab of cheek cells from a patient or crime suspect.
To perform the PCR reaction, the tube containing the solution is placed into a machine called a DNA thermal cycler. Thermal cyclers are basically programmable heating blocks. They usually contain a thick aluminum block with holes in which PCR reaction tubes can fit snugly. The block can be rapidly cooled or heated to specific temperatures, for specific lengths of time, under programmable computer control. Each cycle in a PCR reaction is controlled by changing the temperature of the block and, therefore, of the reaction mixture.
The first step in PCR is to heat the mixture to a high temperature, usually 94 to 95 °C, for about five minutes. The hydrogen bonds that hold together the two strands of a double helix are broken at these temperatures, and the DNA separates into single strands. This process is termed denaturation.
In the second step, the PCR mixture is cooled to a lower temperature, typically between about 50 °C and 65 °C. This allows the primers to anneal to their specific complementary sequences in the template DNA. The temperature for this step is chosen carefully to be just low enough to allow the primers to bind, but no cooler. A lower annealing temperature might allow the primers to bind to regions in the template DNA that are not perfect complements, which could lead to the amplification of non-specific sequences.
The optimal annealing temperature for a set of primers can be determined by a formula that is based on the nucleotide composition of the primers, but it is often a matter of trial and error to find the best annealing temperature. The annealing step usually takes about fifteen to thirty seconds, an amazingly short time considering that the primers must "scan" through the template DNA to find their proper binding sites.
In the third step, the reaction is heated again, usually to about 72 °C, the temperature at which the DNA polymerase is most active. Most enzymes are destroyed at 72 °C. In the early days of PCR, scientists used a DNA polymerase that was derived from the bacterium Escherichia coli, which itself is most active at human body temperature, 37 °C. But the E. coli polymerase was destroyed at the high temperatures required for the denaturation and annealing steps, and the polymerase therefore had to be added anew to the reaction, during each PCR cycle.
To solve this problem, scientists purified DNA polymerases from microorganisms that live in hot springs or in deep-sea thermal vents. These organisms' enzymes are most active at high temperatures. The most commonly used enzyme for PCR is called Taq DNA polymerase, which was originally purified from the hot-spring bacterium Thermus aquaticus. (Most commercially available preparations today are recombinant versions, produced in engineered E. coli strains.)
At 72 °C, Taq DNA polymerase adds nucleotides to the 3′ ends of annealed primers at the rate of about two thousand nucleotides per minute. Therefore, to amplify a sequence that is one thousand nucleotides long, the primer extension step must last about thirty seconds at 72 °C. By the end of this step, each template strand has a new complementary strand. This completes the first cycle of the PCR reaction.
The cycle can be repeated, at that point, by restarting the denaturation step. In the next cycle, the original two DNA strands will serve again as templates, as will the two newly synthesized strands. In this way, the number of templates has doubled, and it will double again with each successive cycle.
At the end of the reaction, the tube contains DNA fragments that are almost solely copies of the target DNA. The original template DNA mixture is still present, but for the purpose of most applications (with the exception of subsequent PCR experiments), it is present in negligible amounts compared to the PCR product. The amplified DNA can be analyzed by gel electrophoresis, ligated into a cloning vector, labeled for use as a hybridization probe, or used in numerous other experimental procedures.
Contamination in Pcr Reactions
The extreme sensitivity of PCR for amplifying rare DNA sequences is a mixed blessing. Just as PCR can easily amplify any sequence that a researcher wants to amplify, it can also amplify other sequences.
Amplifying a minute amount of DNA isolated from an ancient mosquito preserved in amber, for instance, could be extremely difficult. DNA from other sources could contaminate the sample during every step, including during the recovery of the amber, while researchers are drilling into it, and while the needle is prepared to remove the mosquito tissue. Contamination of the sample by even a single cell from another source can lead to amplification of that DNA, along with or instead of the mosquito's DNA. Especially if there are segments of contaminating DNA that are similar to the target DNA, primers may bind to the wrong segments.
Or consider a human geneticist who has designed a PCR assay to detect a particular genetic disease. Imagine that a positive result, amplification of the disease allele from a patient's DNA sample, would indicate the patient is a carrier for the disease. If even a trace of DNA from a disease carrier contaminates any of the PCR reagents, then assays performed on samples from non-disease carriers are likely to produce the diagnostic PCR amplification product. It isn't difficult to imagine that a lab that routinely performs this PCR assay might have lots of the tell-tale DNA contaminating benches, pipettes, and even lab coats.
Two approaches address the contamination problem. First, laboratory practices for PCR aim to ensure the utmost cleanliness. Whole new industries have been created to produce contamination-resistant supplies, including micropipette tips. The second solution, as in all carefully planned experiments, is to use controls. Negative controls, including mixtures that have not had any template added or that contain a template known to lack the target sequence, are particularly important for PCR experiments.
Pcr Applications and Variations
PCR is such a powerful, easy, and relatively inexpensive technique that it seems that molecular biologists are always looking for ways to use PCR in their research. Every month, scientific journals describe modifications to tailor the basic PCR approach to new applications.
One variation that has proved very fruitful in gene identification is the use of "degenerate primers." Many genes tend to be highly conserved among different species. Homologues, which are genes from different organisms whose protein products have similar functions, tend to have very similar, but not necessarily identical, sequences. The differences in sequence make it challenging to design standard PCR primers to search for homologues.
However, by comparing the DNA sequences of the gene as it occurs in many different species and finding portions of the sequence that are the same in all the species, a researcher can make an educated guess regarding which nucleotides in an unidentified homologue are likely to be identical to those in a known homologue.
The researcher can design a set of "degenerate" PCR primers, which are primers whose nucleotide sequence is fixed only in those positions where the nucleotides are presumed to be known. In the other positions, nucleotides are allowed to incorporate at random. This makes it likely that at least one of the primers will amplify the unknown target. By conducting PCR with degenerate primer sets, and by using primer annealing temperatures that are lower than normal to allow for less-than-perfect base-pairing, a researcher can often amplify a gene in a single experiment, thus isolating the new homologue and allowing it to be sequenced and studied.
Another important variation on PCR is reverse-transcription PCR. This technique involves first copying RNA into DNA molecules, using the enzyme reverse transcriptase, and subsequently using the standard PCR technique to amplify this complementary DNA (cDNA). Because the messenger RNA content of a cell or tissue represents only the genes that are actively being expressed, this technique provides a powerful method of analyzing gene expression.
Bibliography
Alberts, Bruce, et al. Molecular Biology of the Cell, 4th ed. New York: Garland Publishing, 2002.
Lodish, Harvey, et al. Molecular Cell Biology, 4th ed. New York: W. H. Freeman, 2000.
Micklos, David A., and Greg A. Freyer. DNA Science: A First Course in Recombinant DNA Technology. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press, 1990.
Watson, James D., et al. Recombinant DNA, 2nd ed. New York: Scientific AmericanBooks, 1992.
—Paul J. Muhlrad