Scientific method is a body of techniques for investigating phenomena, acquiring
new knowledge, or correcting and integrating previous knowledge. It is based on gathering
observable, empirical and measurable evidence subject to specific
principles of reasoning,[1] the collection of data through observation and experimentation, and the formulation and testing of hypotheses.[2]
Although procedures vary from one field of inquiry to another, identifiable
features distinguish scientific inquiry from other methodologies of knowledge. Scientific researchers propose hypotheses as explanations of phenomena, and design experimental
studies to test these hypotheses. These steps must be repeatable in order to predict dependably
any future results. Theories that encompass wider domains of inquiry may bind many hypotheses
together in a coherent structure. This in turn may help form new hypotheses or place groups of hypotheses into context.
Among other facets shared by the various fields of inquiry is the conviction that the process must be objective to reduce a biased interpretation of the results. Another
basic expectation is to document, archive and share all data and methodology so it is available for careful scrutiny by other scientists, thereby
allowing other researchers the opportunity to verify results by attempting to reproduce
them. This practice, called full disclosure, also allows statistical measures of the reliability of these data to be established.
Introduction to scientific method
From Alhacen (Ibn Al-Haytham 965 – 1039, a
pioneer of scientific method) to the present day, the emphasis has been on seeking truth: "Truth is sought for its own sake. And those who are engaged upon the quest for anything for its own sake are not
interested in other things. Finding the truth is difficult, and the road to it is rough. ..." [3]
"How does light travel through transparent bodies? Light travels through transparent bodies in straight lines only. ... We
have explained this exhaustively in our Book of Optics. But let us now mention
something to prove this convincingly: the fact that light travels in straight lines is clearly observed in the lights which enter
into dark rooms through holes. ... the entering light will be clearly observable in the dust which fills the air." --
Alhacen[4]
Alhacen (1000): light travels in straight lines
The conjecture that "Light travels through transparent bodies in straight lines only", was corroborated by Alhacen only after
years of effort. His demonstration of the conjecture was to place a straight stick or a taut thread next to the light
beam[5], to prove that light travels in a straight
line.
Thus scientific method has been practiced by some for at least one thousand years. There are difficulties in a formulaic
statement of method, however. As William Whewell (1794-1866) noted in his History of Inductive Science (1837) and in
Philosophy of Inductive Science (1840), "invention, sagacity, genius" are required at every step in scientific method. It is not enough to base scientific method on experience alone[6]; multiple steps are
needed in scientific method, ranging from our experience to our imagination, back and forth.
In the twentieth century, a hypothetico-deductive model for scientific
method was formulated (For a more formal discussion, see below.):
- 1. Use your experience - consider the problem and try to make sense
of it. Look for previous explanations; if this is a new problem to you, then do
- 2. Conjecture an explanation - when nothing else is yet known, try
to state your explanation, to someone else, or to your notebook.
- 3. Deduce a prediction from that explanation- if 2
were true, then state a consequence of that explanation.
- 4. Test - look for the opposite of that consequence in order to
disprove 2. It is a logical error to seek 3 directly as proof of 2. This error is
called affirming the consequent.
This model underlies the scientific revolution. One thousand years ago, Alhacen
demonstrated the importance of steps 1 and 4. Galileo (1638) also showed the importance of step
4 (also called Experiment) in Two New
Sciences. One possible sequence in this model would be 1, 2, 3,
4. If the outcome of 4 holds, and 3 is not yet disproven, you may continue with
3, 4, 1, and so forth; but if the outcome of 4 shows 3 to
be false, you will have go back to 2 and try to invent a new 2, deduce a new 3, look
for 4, and so forth. Note that 2 can never be shown to be absolutely true by scientific
method[7]; only that 2 can be shown to be
absolutely false by scientific method. (This is what Einstein meant when he said "No amount of
experimentation can ever prove me right; a single experiment can prove me wrong.")
In the twentieth century, Ludwik Fleck (1896-1961) and others found that we need to
consider our experiences more carefully, because our experience may be biased, and that we need to be more exact when describing
our experiences. These considerations are discussed below.
Flying horse depiction: disproven; see below
Truth and belief
-
A belief need not be true (although a belief can be true, even if its origins were myth).[8]
Needham's Science and Civilisation in China uses the 'flying horse' image as an example of observation: in it, a
horse's legs are depicted as splayed, when the stop-action picture by Eadweard
Muybridge shows otherwise. Note that the moment that no hoof is touching the ground, the horse's legs are gathered
together and are not splayed.
Earlier paintings depict the incorrect flying horse observation. This demonstrates Ludwik
Fleck's caution that we see what we expect to observe, until shown otherwise; our beliefs will affect our observations
(and therefore our subsequent actions). But repeated application of scientific method can help us solve our problems by exposing
those parts of our beliefs which are false. A scientific community will have the
same interests, which allows it to help solve problems together.
Elements of scientific method
There are many ways of outlining the basic method shared by all fields of scientific inquiry. The following examples are
typical classifications of the most important components of the method on which there is wide agreement in the scientific community and among philosophers of
science. There are, however, disagreements about some aspects.
The following set of methodological elements and organization of procedures tends to be more characteristic of natural
sciences and experimental psychology than of social sciences. In the social sciences mathematical and statistical methods of
verification and hypotheses testing may be less stringent. Nonetheless the cycle of hypothesis, verification and formulation of
new hypotheses will resemble the cycle described below.
Imre Lakatos and Thomas Kuhn had done extensive
work on the "theory laden" character of observation. Kuhn (1961) said the scientist generally has a theory in mind before
designing and undertaking experiments so as to make empirical observations, and that the "route from theory to measurement can
almost never be traveled backward". This implies that the way in which theory is tested is dictated by the nature of the theory
itself, which led Kuhn (1961, p. 166) to argue that "once it has been adopted by a profession ... no theory is recognized to be
testable by any quantitative tests that it has not already passed".
Each element of a scientific method is subject to peer review for possible
mistakes. These activities do not describe all that scientists do (see below) but apply
mostly to experimental sciences (e.g., physics, chemistry). The elements above are often taught in the educational system.[21]
Scientific method is not a recipe: it requires intelligence, imagination, and creativity[22]. It is also an ongoing cycle, constantly developing more useful, accurate and
comprehensive models and methods. For example, when Einstein developed the Special and General Theories of Relativity, he did not
in any way refute or discount Newton's Principia. On the contrary, if the astronomically large, the vanishingly small, and
the extremely fast are reduced out from Einstein's theories — all phenomena that Newton could not have observed — Newton's
equations remain. Einstein's theories are expansions and refinements of Newton's theories, and observations that increase our
confidence in them also increase our confidence in Newton's approximations to them.
A linearized, pragmatic scheme of the four points above is sometimes offered as a guideline for proceeding:[citation needed]
The iterative cycle inherent in this step-by-step methodology goes from point 3 to 6 back to 3 again.
While this schema outlines a typical hypothesis/testing method,[23] it should also be noted that a number of philosophers, historians and sociologists of science
(perhaps most notably Paul Feyerabend) claim that such descriptions of scientific method
have little relation to the ways science is actually practiced.
The "operational" model combines the concepts of factory-style processing, operational definition, and utility:
The Keystones of Science project, sponsored by the journal Science, has
selected a number of scientific articles from that journal and annotated them, illustrating how different parts of each article
embody scientific method. Here is an annotated example of this scientific method example titled Microbial Genes in the Human Genome: Lateral Transfer or Gene Loss?.
DNA example
- Each element of scientific method is illustrated below by an example from the
discovery of the structure of DNA:
- DNA-characterizations: in this case, although the significance of the gene had
been established, the mechanism was unclear to anyone, as of 1950.
- DNA-hypotheses: Crick and Watson hypothesized that the gene had a physical basis - it
was helical.
- DNA-predictions: from earlier work on tobacco
mosaic virus, Watson was aware of the significance of Crick's formulation of the transform of a helix.[24] Thus he was primed for the significance of the X-shape in photo 51.
- DNA-experiments: Watson sees photo 51.
- The examples are continued in "Evaluations and iterations" with DNA-iterations.
Characterizations
Scientific method depends upon increasingly more sophisticated characterizations of subjects of the investigation. (The
subjects can also be called unsolved problems or the unknowns). For example,
Benjamin Franklin correctly characterized St. Elmo's
fire as electrical in nature, but it has taken a long
series of experiments and theory to establish this. While seeking the pertinent properties of the subjects, this careful thought
may also entail some definitions and observations; the observations often demand careful
measurements and/or counting.
The systematic, careful collection of measurements or counts of relevant quantities is often the critical difference between
pseudo-sciences, such as alchemy, and a science, such as chemistry or biology. Scientific measurements taken are usually
tabulated, graphed, or mapped, and statistical manipulations, such as correlation and
regression, performed on them. The measurements might be made in a controlled setting, such
as a laboratory, or made on more or less inaccessible or unmanipulatable objects such as stars or human populations. The
measurements often require specialized scientific instruments such as thermometers, spectroscopes, or voltmeters, and the
progress of a scientific field is usually intimately tied to their invention and development.
Uncertainty
Measurements in scientific work are also usually accompanied by estimates of their uncertainty. The uncertainty is often estimated by making repeated measurements of the desired quantity.
Uncertainties may also be calculated by consideration of the uncertainties of the individual underlying quantities that are used.
Counts of things, such as the number of people in a nation at a particular time, may also have an uncertainty due to limitations
of the method used. Counts may only represent a sample of desired quantities, with an uncertainty that depends upon the sampling
method used and the number of samples taken.
Definition
Measurements demand the use of operational definitions of relevant
quantities. That is, a scientific quantity is described or defined by how it is measured, as opposed to some more vague, inexact
or "idealized" definition. For example, electrical current, measured in amperes, may be
operationally defined in terms of the mass of silver deposited in a certain time on an electrode in an electrochemical device
that is described in some detail. The operational definition of a thing often relies on comparisons with standards: the
operational definition of "mass" ultimately relies on the use of an artifact, such as a certain kilogram of platinum-iridium kept
in a laboratory in France.
The scientific definition of a term sometimes differs substantially from its natural
language usage. For example, mass and weight overlap in
meaning in common discourse, but have distinct meanings in mechanics. Scientific quantities
are often characterized by their units of measure which can later be described in
terms of conventional physical units when communicating the work.
New theories sometimes arise upon realizing that certain terms had not previously been sufficiently clearly defined. For
example, Albert Einstein's first paper on relativity begins by defining simultaneity and
the means for determining length. These ideas were skipped over by Isaac Newton with, "I do not define time, space, place and
motion, as being well known to all." Einstein's paper then demonstrates that they
(viz., absolute time and length independent of motion) were approximations. Francis Crick
cautions us that when characterizing a subject, however, it can be premature to define something when it remains
ill-understood.[25] In Crick's study of consciousness, he
actually found it easier to study awareness in the visual system, rather than to study
Free Will, for example. His cautionary example was the gene; the gene was much more poorly understood before Watson and Crick's
pioneering discovery of the structure of DNA; it would have been counterproductive to spend much time on the definition of the
gene, before them.
DNA-characterizations
- The history of the discovery of the structure of DNA is a classic
example of the elements of scientific method: in 1950 it was known that
genetic inheritance had a mathematical description, starting with the studies of
Gregor Mendel. But the mechanism of the gene was unclear. Researchers in Bragg's laboratory at Cambridge University made
X-ray diffraction pictures of various molecules, starting with crystals of salt, and
proceeding to more complicated substances. Using clues which were painstakingly assembled over the course of decades, beginning
with its chemical composition, it was determined that it should be possible to characterize the physical structure of DNA, and
the X-ray images would be the vehicle. ..2. DNA-hypotheses
Precession of Mercury
- The characterization element can require extended and extensive study, even centuries. It took thousands of years of
measurements, from the Chaldean, Indian, Persian, Greek, Arabic and European astronomers, to record the motion of planet Earth. Newton was able to condense these measurements into consequences of his laws of motion. But the perihelion of the planet Mercury's orbit exhibits a precession which is not fully explained by
Newton's laws of motion. The observed difference for Mercury's precession, between Newtonian
theory and relativistic theory (approximately 43 arc-seconds per century), was one of the things that occurred to Einstein as a
possible early test of his theory of General Relativity.
Hypothesis development
A hypothesis is a suggested explanation of a phenomenon, or alternately a reasoned
proposal suggesting a possible correlation between or among a set of phenomena.
Normally hypotheses have the form of a mathematical model. Sometimes, but not
always, they can also be formulated as existential statements, stating that
some particular instance of the phenomenon being studied has some characteristic and causal explanations, which have the general
form of universal statements, stating that every instance of the phenomenon has
a particular characteristic.
Scientists are free to use whatever resources they have — their own creativity, ideas from other fields, induction, Bayesian inference, and so on — to imagine
possible explanations for a phenomenon under study. Charles Sanders Peirce, borrowing a
page from Aristotle (Prior Analytics,
2.25) described the incipient stages of inquiry, instigated by
the "irritation of doubt" to venture a plausible guess, as abductive reasoning. The
history of science is filled with stories of scientists claiming a "flash of inspiration", or a hunch, which then motivated them
to look for evidence to support or refute their idea. Michael Polanyi made such
creativity the centrepiece of his discussion of methodology.
Karl Popper, following others, developing and inverting the views of the Austrian
logical positivists, has argued that a hypothesis must be falsifiable, and that a proposition or theory cannot be called scientific if it does not admit the
possibility of being shown false. It must at least in principle be possible to make an observation that would show the
proposition to be false, even if that observation had not yet been made.
William Glen observes that
- the success of a hypothesis, or its service to science, lies not simply in its perceived "truth", or power to displace,
subsume or reduce a predecessor idea, but perhaps more in its ability to stimulate the research that will illuminate … bald
suppositions and areas of vagueness.[26]
In general scientists tend to look for theories that are "elegant" or "beautiful". In contrast to the usual English use of these terms, they here refer to a theory in accordance
with the known facts, which is nevertheless relatively simple and easy to handle. Occam's
Razor serves as a rule of thumb for making these determinations.
DNA-hypotheses
- Linus Pauling proposed that DNA was a triple helix. Francis Crick and James Watson learned of Pauling's hypothesis,
understood from existing data that Pauling was wrong and realized that Pauling would soon realize his mistake. So the race was on
to figure out the correct structure. Except that Pauling did not realize at the time that he was in a race! ..3. DNA-predictions
Predictions from the hypothesis
Any useful hypothesis will enable predictions, by reasoning including deductive reasoning. It might predict the
outcome of an experiment in a laboratory setting or the observation of a phenomenon in nature. The prediction can also be
statistical and only talk about probabilities.
It is essential that the outcome be currently unknown. Only in this case does the eventuation increase the probability that
the hypothesis be true. If the outcome is already known, it's called a consequence and should have already been considered while
formulating the hypothesis.
If the predictions are not accessible by observation or experience, the hypothesis is not yet useful for the method, and must
wait for others who might come afterward, and perhaps rekindle its line of reasoning. For example, a new technology or theory
might make the necessary experiments feasible.
DNA-predictions
- When Watson and Crick hypothesized that DNA was a double helix, Francis Crick predicted that an X-ray diffraction image of DNA would show an X-shape. Also in their first
paper they predicted that the double helix structure that they discovered would prove
important in biology, writing "It has not escaped our notice that the specific pairing we have postulated immediately suggests a
possible copying mechanism for the genetic material". ..4. DNA-experiments
General relativity
- Einstein's theory of General Relativity makes several specific predictions about
the observable structure of space-time, such as a prediction that light bends in a gravitational field and that the amount of bending
depends in a precise way on the strength of that gravitational field. Arthur
Eddington's observations made during a 1919 solar eclipse supported General
Relativity rather than Newtonian gravitation.
Experiments
-
The control is very important.
Once predictions are made, they can be tested by experiments. If test results contradict predictions, then the hypotheses are
called into question and explanations may be sought. Sometimes experiments are conducted incorrectly and are at fault. If the
results confirm the predictions, then the hypotheses are considered likely to be correct but might still be wrong and are subject
to further testing.
Depending on the predictions, the experiments can have different shapes. It could be a classical experiment in a laboratory
setting, a double-blind study or an archaeological excavation. Even taking a plane from New York to Paris is an experiment which tests the aerodynamical
hypotheses used for constructing the plane.
Scientists assume an attitude of openness and accountability on the part of those conducting an experiment. Detailed record
keeping is essential, to aid in recording and reporting on the experimental results, and providing evidence of the effectiveness
and integrity of the procedure. They will also assist in reproducing the experimental results. This tradition can be seen in the
work of Hipparchus (190 BCE - 120 BCE), when determining a value for the precession of the
Earth over 2100 years ago, and 1000 years before Al-Batani (853
CE – 929 CE).
DNA-experiments
- Before proposing their model Watson and Crick had previously seen x-ray diffraction images by Rosalind Franklin, Maurice Wilkins, and Raymond Gosling. However, they later reported that Franklin initially rebuffed their suggestion that DNA
might be a double helix. Franklin had immediately spotted flaws in the initial hypotheses about the structure of DNA by Watson and Crick. The X-shape in X-ray images helped confirm the helical structure of DNA[27]. ..1.
DNA-characterizations
Evaluation and iteration
Testing and improvement
The scientific process is iterative. At any stage it is possible that some consideration will lead the scientist to repeat an
earlier part of the process. Failure to develop an interesting hypothesis may lead a scientist to re-define the subject they are
considering. Failure of a hypothesis to produce interesting and testable predictions may lead to reconsideration of the
hypothesis or of the definition of the subject. Failure of the experiment to produce interesting results may lead the scientist
to reconsidering the experimental method, the hypothesis or the definition of the subject.
Other scientists may start their own research and enter the process at any stage. They might adopt the characterization and
formulate their own hypothesis, or they might adopt the hypothesis and deduce their own predictions. Often the experiment is not
done by the person who made the prediction and the characterization is based on experiments done by someone else. Published
results of experiments can also serve as a hypothesis predicting their own reproducibility.
DNA-iterations
- After considerable fruitless experimentation, being discouraged by their superior from continuing, and numerous false starts,
Watson and Crick were able to infer the essential structure of DNA by concrete modeling of the physical shapes of the nucleotides which comprise it. They were guided by the bond lengths which had been deduced by
Linus Pauling and Rosalind Franklin's X-ray
diffraction images. ..DNA Example
Confirmation
Science is a social enterprise, and scientific work tends to be accepted by the community when it has been confirmed.
Crucially, experimental and theoretical results must be reproduced by others within the science community. Researchers have given
their lives for this vision; Georg Wilhelm Richmann was killed by
lightning (1753) when attempting to replicate the 1752 kite-flying experiment of
Benjamin Franklin.[28]
To protect against bad science and fraudulent data, government research granting agencies like NSF and science journals like
Nature and Science have a policy that researchers must archive their data and methods so other researchers can access it, test
the data and methods and build on the research that has gone before. Scientific data
archiving can be done at a number of national archives in the U.S. or in the World
Data Center.
Models of scientific inquiry
-
Classical model
The classical model of scientific inquiry derives from Aristotle[29], who distinguished the forms of approximate and exact reasoning, set out the threefold scheme of
abductive, deductive, and
inductive inference, and also treated the compound forms such as reasoning by
analogy.
Pragmatic model
-
Charles Peirce considered scientific inquiry to be a species of the genus
inquiry, which he defined as any means of fixing belief, that is, any means of arriving at a settled opinion on a matter
in question. He observed that inquiry in general begins with a state of uncertainty and moves toward a state of certainty,
sufficient at least to terminate the inquiry for the time being. He graded the prevalent forms of inquiry according to their
evident success in achieving their common objective, scoring scientific inquiry at the high end of this scale. At the low end he
placed what he called the method of tenacity, a die-hard attempt to deny uncertainty and fixate on a favored belief. Next
in line he placed the method of authority, a determined attempt to conform to a chosen
source of ready-made beliefs. After that he placed what might be called the method of congruity, also called the a
priori, the dilettante, or the what is agreeable to reason method. Peirce observed the fact of human nature
that almost everybody uses almost all of these methods at one time or another, and that even scientists, being human, use the
method of authority far more than they like to admit. But what recommends the specifically scientific method of inquiry above all
others is the fact that it is deliberately designed to arrive at the ultimately most secure beliefs, upon which the most
successful actions can be based.[30]
Computational approaches
Many subspecialties of applied logic and computer
science, to name a few, artificial intelligence, machine learning, computational learning theory,
inferential statistics, and knowledge
representation, are concerned with setting out computational, logical, and statistical frameworks for the various types of
inference involved in scientific inquiry, in particular, hypothesis formation,
logical deduction, and empirical
testing. Some of these applications draw on measures of complexity from algorithmic information theory to
guide the making of predictions from prior distributions of experience, for
example, see the complexity measure called the speed prior from which a computable
strategy for optimal inductive reasoning can be derived.
Philosophy and sociology of science
-
- Further information: Sociology of science
While the philosophy of science has limited direct impact on day-to-day
scientific practice, it plays a vital role in justifying and defending the scientific approach. Philosophy of science looks at
the underpinning logic of the scientific method, at what separates science from
non-science, and the ethic that is implicit in science.
We find ourselves in a world that is not directly understandable. We find that we sometimes disagree with others as to the
facts of the things we see in the world around us, and we find that there are things in the world
that are at odds with our present understanding. The scientific method attempts to provide a way in which we can reach agreement
and understanding. A "perfect" scientific method might work in such a way that rational
application of the method would always result in agreement and understanding; a perfect method would arguably be algorithmic, and so not leave any room for rational agents to disagree. As with all philosophical topics, the search has been neither straightforward nor simple. Logical Positivist, empiricist, falsificationist, and other theories have claimed to give a definitive account of the logic of science,
but each has in turn been criticized.
Thomas Samuel Kuhn examined the history of science in his The Structure of Scientific Revolutions, and found that the actual method
used by scientists differed dramatically from the then-espoused method.
Paul Feyerabend similarly examined the history of science, and was led to deny that
science is genuinely a methodological process. In his book Against Method he
argues that scientific progress is not the result of applying any particular method. In essence, he says that "anything
goes", by which he meant that for any specific methodology or norm of science, successful science has been done in violation of
it. Criticisms such as his led to the strong programme, a radical approach to the
sociology of science.
In his 1958 book, Personal Knowledge, chemist and philosopher Michael Polanyi
(1891-1976) criticized the common view that the scientific method is purely objective and generates objective knowledge. Polanyi
cast this view as a misunderstanding of the scientific method and of the nature of scientific inquiry, generally. He argued that
scientists do and must follow personal passions in appraising facts and in determining which scientific questions to investigate.
He concluded that a structure of liberty is essential for the advancement of science - that the freedom to pursue science for its
own sake is a prerequisite for the production of knowledge through peer review and the scientific method.
The postmodernist critiques of science have themselves been the subject of intense
controversy and heated dialogue. This ongoing debate, known as the science wars, is the
result of the conflicting values and assumptions held by the postmodernist and
realist camps. Whereas postmodernists assert
that scientific knowledge is simply another discourse and not representative of any form of fundamental truth, realists in the scientific community maintain that scientific knowledge does reveal real and
fundamental truths about reality. Many books have been written by scientists which take on this problem and challenge the
assertions of the postmodernists while defending science as a legitimate method of
deriving truth.[31][32][33][34][35]
Communication, community, culture
Frequently the scientific method is not employed by a single person, but by several people cooperating directly or indirectly.
Such cooperation can be regarded as one of the defining elements of a scientific
community. Various techniques have been developed to ensure the integrity of the scientific method within such an
environment.
Peer review evaluation
Scientific journals use a process of peer review, in which scientists'
manuscripts are submitted by editors of scientific journals to (usually one to three) fellow (usually anonymous) scientists
familiar with the field for evaluation. The referees may or may not recommend publication, publication with suggested
modifications, or, sometimes, publication in another journal. This serves to keep the scientific literature free of unscientific
or crackpot work, helps to cut down on obvious errors, and generally otherwise improve the quality of the scientific literature.
Work announced in the popular press before going through this process is generally frowned upon. Sometimes peer review inhibits
the circulation of unorthodox work, especially if it undermines the establishment in the particular field, and at other times may
be too permissive. Other drawbacks includes cronyism and favoritism. The peer review process is not always successful, but has
been very widely adopted by the scientific community.