Share on Facebook Share on Twitter Email
Answers.com

random variable

 
Dictionary: random variable
 

n.

A variable whose values are random but whose statistical distribution is known.


Search unanswered questions...
Enter a word or phrase...
All Community Q&A Reference topics
Statistics Dictionary: random variable
 

Variant: r.v.

When the value of a variable is subject to random variation, or when it is the value of a randomly chosen member of a population, it is described as a random variable — though the adjective 'random' may be omitted. See probability distribution.



 
Business Dictionary: Random Variable
Top

A statistical term meaning any data.

 
Britannica Concise Encyclopedia: random variable
Top

In statistics, a function that can take on either a finite number of values, each with an associated probability, or an infinite number of values, whose probabilities are summarized by a density function. Used in studying chance events, it is defined so as to account for all possible outcomes of the event. When these are finite (e.g., the number of heads in a three-coin toss), the random variable is called discrete and the probabilities of the outcomes sum to 1. If the possible outcomes are infinite (e.g., the life expectancy of a light bulb), the random variable is called continuous and corresponds to a density function whose integral over the entire range of outcomes equals 1. Probabilities for specific outcomes are determined by summing probabilities (in the discrete case) or by integrating the density function over an interval corresponding to that outcome (in the continuous case).

For more information on random variable, visit Britannica.com.

 
Philosophy Dictionary: random variable
Top

Intuitively, a variable, such as height, that can take various values in a population, and such that some values have some probability of occurrence, and others a different probability (for example, there might be a probability of one in ten that a person is 5′ 8ʺ tall, but only one in one hundred that they are 6′ 8ʺ tall). In probability theory a random variable is a variable X that can take any one of a finite or countably infinite range of values, each with a probability. The distribution of a random variable is the set of pairs (xi, Prob X = xi), giving the probability associated with each value in the range. In the example, (5′ 8ʺ, 0.1) would be one member of the distribution of the random variable of height in a population, if 10% of the population is that tall.

 
Wikipedia: Random variable
Top

In mathematics, random variables are used in the study of probability. They were developed to assist in the analysis of games of chance, stochastic events, and the results of scientific experiments by capturing only the mathematical properties necessary to answer probabilistic questions. Further formalizations have firmly grounded the entity in the theoretical domains of mathematics by making use of measure theory.

Fortunately, the language and structure of random variables can be grasped at various levels of mathematical fluency. Set theory and calculus are fundamental.

There are two types of random variables — discrete and continuous. A discrete random variable takes values from a countable set of specific values, each with some probability greater than zero. A continuous random variable takes values from an uncountable set, and the probability of any one value is zero, but a set of values can have positive probability. Random variables can also be "mixed", having attributes of both discrete and continuous random variables.

A random variable has an associated probability distribution and frequently also a probability density function. Probability density functions are commonly used for continuous variables.

Contents

Intuitive definition

A random variable can be thought of as an unknown value that may change every time it is inspected. Thus, a random variable can be thought of as a function mapping the sample space of a random process to the real numbers. A few examples will highlight this.

Examples

For a coin toss, the possible events are heads or tails. The number of heads appearing in one fair coin toss can be described using the following random variable:

X = \begin{cases}1,& \text{if heads} ,\\
0,& \text{if tails} .\end{cases}

with probability mass function given by:

\rho_X(x) = \begin{cases}\frac{1}{2},& \text{if }x=0,\\
\frac{1}{2},& \text{if }x=1,\\
0,& \text{otherwise} .\end{cases}

A random variable can also be used to describe the process of rolling a fair die and the possible outcomes. The most obvious representation is to take the set {1, 2, 3, 4, 5, 6} as the sample space, defining the random variable X as the number rolled. In this case ,

X = \begin{cases}1,& \text{if a 1 is rolled} ,\\
2,& \text{if a 2 is rolled} ,\\
3,& \text{if a 3 is rolled} ,\\
4,& \text{if a 4 is rolled} ,\\
5,& \text{if a 5 is rolled} ,\\
6,& \text{if a 6 is rolled} .\end{cases}


\rho_X(x) = \begin{cases}\frac{1}{6},& \text{if }x=1,2,3,4,5,6,\\

0,& \text{otherwise} .\end{cases}

An example of a continuous random variable would be one based on a spinner that can choose a real number from the interval [0, 2π), with all values being "equally likely". In this case, X = the number spun. Any real number has probability zero of being selected. But a positive probability can be assigned to any range of values. For example, the probability of choosing a number in [0, π] is ½. Instead of speaking of a probability mass function, we say that the probability density of X is 1/2π. The probability of a subset of [0, 2π) can be calculated by multiplying the measure of the set by 1/2π. In general, the probability of a set for a given continuous random variable can be calculated by integrating the density over the given set.

An example of a random variable of mixed type would be based on an experiment where a coin is flipped and the spinner is spun only if the result of the coin toss is heads. If the result is tails, X = −1; otherwise X = the value of the spinner as in the preceding example. There is a probability of ½ that this random variable will have the value −1. Other ranges of values would have half the probability of the last example.

Formal definition

Let (\Omega, \mathcal{F}, P) be a probability space and (\mathcal{}Y, \Sigma) be a measurable space. Then a random variable X is formally defined as a measurable function X: \Omega \rightarrow Y. An interpretation of this is that the preimages of the "well-behaved" subsets of Y (the elements of Σ) are events (elements of \mathcal{F}), and hence are assigned a probability by P.

Real-valued random variables

Typically, the measurable space is the measurable space over the real numbers. In this case, let (\Omega, \mathcal{F}, P) be a probability space. Then, the function X: \Omega \rightarrow \mathbb{R} is a real-valued random variable if

\{ \omega : X(\omega) \le r \} \in \mathcal{F} \qquad \forall r \in \mathbb{R}

This definition is a special case of the above because \{(-\infty, r]: r \in \R\} generates the Borel sigma-algebra on the real numbers, and it is enough to check measurability on a generating set. (Here we are using the fact that \{ \omega : X(\omega) \le r \} = X^{-1}((-\infty, r]).)

Distribution functions of random variables

Associating a cumulative distribution function (CDF) with a random variable is a generalization of assigning a value to a variable. If the CDF is a (right continuous) Heaviside step function then the variable takes on the value at the jump with probability 1. In general, the CDF specifies the probability that the variable takes on particular values.

If a random variable X: \Omega \to \mathbb{R} defined on the probability space (\Omega, \mathcal{F}, P) is given, we can ask questions like "How likely is it that the value of X is bigger than 2?". This is the same as the probability of the event \{ \omega : X(\omega) > 2 \}\,\! which is often written as P(X > 2)\,\! for short, and easily obtained since P(X > 2)=1-P(X \le 2)

Recording all these probabilities of output ranges of a real-valued random variable X yields the probability distribution of X. The probability distribution "forgets" about the particular probability space used to define X and only records the probabilities of various values of X. Such a probability distribution can always be captured by its cumulative distribution function

F_X(x) = \operatorname{P}(X \le x)

and sometimes also using a probability density function. In measure-theoretic terms, we use the random variable X to "push-forward" the measure P on Ω to a measure dF on R. The underlying probability space Ω is a technical device used to guarantee the existence of random variables, and sometimes to construct them. In practice, one often disposes of the space Ω altogether and just puts a measure on R that assigns measure 1 to the whole real line, i.e., one works with probability distributions instead of random variables.

Moments

The probability distribution of a random variable is often characterised by a small number of parameters, which also have a practical interpretation. For example, it is often enough to know what its "average value" is. This is captured by the mathematical concept of expected value of a random variable, denoted E[X]. In general, E[f(X)] is not equal to f(E[X]). Once the "average value" is known, one could then ask how far from this average value the values of X typically are, a question that is answered by the variance and standard deviation of a random variable.

Mathematically, this is known as the (generalised) problem of moments: for a given class of random variables X, find a collection {fi} of functions such that the expectation values E[fi(X)] fully characterize the distribution of the random variable X.

Functions of random variables

If we have a random variable X on Ω and a Borel measurable function f: RR, then Y = f(X) will also be a random variable on Ω, since the composition of measurable functions is also measurable. (Warning: this is not true if f is Lebesgue measurable.) The same procedure that allowed one to go from a probability space (Ω, P) to (R, dFX) can be used to obtain the distribution of Y. The cumulative distribution function of Y is

F_Y(y) = \operatorname{P}(f(X) \le y).

Example 1

Let X be a real-valued, continuous random variable and let Y = X2.

F_Y(y) = \operatorname{P}(X^2 \le y).

If y < 0, then P(X2y) = 0, so

F_Y(y) = 0\qquad\hbox{if}\quad y < 0.

If y ≥ 0, then

\operatorname{P}(X^2 \le y) = \operatorname{P}(|X| \le \sqrt{y})
 = \operatorname{P}(-\sqrt{y} \le  X \le \sqrt{y}),

so

F_Y(y) = F_X(\sqrt{y}) - F_X(-\sqrt{y})\qquad\hbox{if}\quad y \ge 0.

Example 2

Suppose \scriptstyle X is a random variable with a cumulative distribution

 F_{X}(x) = P(X \leq x) = \frac{1}{(1 + e^{-x})^{\theta}}

where \scriptstyle \theta > 0 is a fixed parameter. Consider the random variable  \scriptstyle Y = \mathrm{log}(1 + e^{-X}). Then,

 F_{Y}(y) = P(Y \leq y) = P(\mathrm{log}(1 + e^{-X}) \leq y) = P(X > -\mathrm{log}(e^{y} - 1)).\,

The last expression can be calculated in terms of the cumulative distribution of X, so

 F_{Y}(y) = 1 - F_{X}(-\mathrm{log}(e^{y} - 1)) \,
 = 1 - \frac{1}{(1 + e^{\mathrm{log}(e^{y} - 1)})^{\theta}}
 = 1 - \frac{1}{(1 + e^{y} - 1)^{\theta}}
 = 1 - e^{-y \theta}.\,

Equivalence of random variables

There are several different senses in which random variables can be considered to be equivalent. Two random variables can be equal, equal almost surely, equal in mean, or equal in distribution.

In increasing order of strength, the precise definition of these notions of equivalence is given below.

Equality in distribution

Two random variables X and Y are equal in distribution if they have the same distribution functions:

\operatorname{P}(X \le x) = \operatorname{P}(Y \le x)\quad\hbox{for all}\quad x.

Two random variables having equal moment generating functions have the same distribution. This provides, for example, a useful method of checking equality of certain functions of i.i.d. random variables.

d(X,Y)=\sup_x|\operatorname{P}(X \le x) - \operatorname{P}(Y \le x)|,

which is the basis of the Kolmogorov–Smirnov test.

Equality in mean

Two random variables X and Y are equal in p-th mean if the pth moment of |XY| is zero, that is,

\operatorname{E}(|X-Y|^p) = 0.

As in the previous case, there is a related distance between the random variables, namely

d_p(X, Y) = \operatorname{E}(|X-Y|^p).\,

This is equivalent to the following:

Almost sure equality

Two random variables X and Y are equal almost surely if, and only if, the probability that they are different is zero:

\operatorname{P}(X \neq Y) = 0.

For all practical purposes in probability theory, this notion of equivalence is as strong as actual equality. It is associated to the following distance:

d_\infty(X,Y)=\sup_\omega|X(\omega)-Y(\omega)|,

where 'sup' in this case represents the essential supremum in the sense of measure theory.

Equality

Finally, the two random variables X and Y are equal if they are equal as functions on their probability space, that is,

X(\omega)=Y(\omega)\qquad\hbox{for all}\quad\omega.

Convergence

Much of mathematical statistics consists in proving convergence results for certain sequences of random variables; see for instance the law of large numbers and the central limit theorem.

There are various senses in which a sequence (Xn) of random variables can converge to a random variable X. These are explained in the article on convergence of random variables.

Literature

See also

This article incorporates material from Random variable on PlanetMath, which is licensed under the GFDL.


 
Best of the Web: random variable
Top

Some good "random variable" pages on the web:


Math
mathworld.wolfram.com
 
 
 

 

Copyrights:

Dictionary. The American Heritage® Dictionary of the English Language, Fourth Edition Copyright © 2007, 2000 by Houghton Mifflin Company. Updated in 2007. Published by Houghton Mifflin Company. All rights reserved.  Read more
Statistics Dictionary. A Dictionary of Statistics. Second edition revised. Copyright © Oxford University Press, 2008. All rights reserved.  Read more
Business Dictionary. Dictionary of Business Terms. Copyright © 2000 by Barron's Educational Series, Inc. All rights reserved.  Read more
Britannica Concise Encyclopedia. Britannica Concise Encyclopedia. © 2006 Encyclopædia Britannica, Inc. All rights reserved.  Read more
Philosophy Dictionary. The Oxford Dictionary of Philosophy. Copyright © 1994, 1996, 2005 by Oxford University Press. All rights reserved.  Read more
Wikipedia. This article is licensed under the GNU Free Documentation License. It uses material from the Wikipedia article "Random variable" Read more