Share on Facebook Share on Twitter Email
Answers.com

Multidimensional scaling

 
Statistics Dictionary: multidimensional scaling

A multivariate method, resembling principal components analysis, that gives a graphical representation of the main characteristics of a proximity matrix using only a few dimensions. The success of the method is assessed by comparing the 'distances' between individuals as recorded by the proximity matrix with the (suitably scaled) Euclidean distances in the full-dimensional space; the resulting measure of goodness-of-fit is called the stress (also called Kruskal stress) of the solution — a low value suggests a good solution.

The proximity matrix shows, for seven European countries, the numbers of food products (out of twenty)

great britainirelandnether- landsbelgiumfranceitalyspain
Great Britain2011116726
Ireland11201110111010
Netherlands111120121166
Belgium610122012119
France7111112201012
Italy210611102013
Spain61069121320




Multidimensional scaling of countries and food products. The results may appear surprising, but they reflect the (very crude) information from which they were derived. The (unlabelled) axes reflect different aspects of eating behaviour.
that were found to similar extents (i.e. common in both countries or scarce in both countries) in each of the pairs of countries. The biggest difference is between Great Britain and Italy: this is reflected in a two-dimensional graph of the positions of the countries.



Search unanswered questions...
Enter a question here...
Search: All sources Community Q&A Reference topics
Geography Dictionary: multidimensional scaling
Top

A family of statistical methods whereby the information in a data set is represented by a set of points in multidimensional space. MDS mapping can be used to find a configuration of n points in m-dimensional space such that the distances between points in the configuration match the experimental dissimilarities as closely as possible. It can be conducted in any number of dimensions up to (n - 1), where n is the number of objects, such as provinces, cities, or islands, to be represented in the space.

Geographers tend to use ‘lower order spaces’ (1 = m = 3) rather than mapping based on space, time, or time-space; for example, in the case of infection rates using MDS, two places with similar rates will be mapped close together, even if they are geographically remote. MDS has been particularly useful in epidemiological studies (A. D. Cliff et al., 2000).

Multidimensional scaling is a technique that gives a graphical representation of the main characteristics of a proximity matrix using only a few dimensions. The results are evaluated by comparing the ‘distances’ between individuals on the proximity matrix with the Euclidean distances suggested by the method; and a low value of the stress of the solution (a measure of goodness-of-fit) suggests a good solution.

Wikipedia: Multidimensional scaling
Top

Multidimensional scaling (MDS) is a set of related statistical techniques often used in information visualization for exploring similarities or dissimilarities in data. MDS is a special case of ordination. An MDS algorithm starts with a matrix of item–item similarities, then assigns a location to each item in N-dimensional space, where N is specified a priori. For sufficiently small N, the resulting locations may be displayed in a graph or 3D visualisation.

Contents

Types

MDS algorithms fall into a taxonomy, depending on the meaning of the input matrix:

Classical multidimensional scaling 
also known as Torgerson Scaling or Torgerson-Gower scaling – takes an input matrix giving dissimilarities between pairs of items and outputs a coordinate matrix whose configuration minimizes a loss function called strain.[1] (pp. 207–212).
Metric multidimensional scaling
A superset of classical MDS that generalizes the optimization procedure to a variety of loss functions and input matrices of known distances with weights and so on. A useful loss function in this context is called stress which is often minimized using a procedure called Stress Majorization.
Non-metric multidimensional scaling
In contrast to metric MDS, non-metric MDS both finds a non-parametric monotonic relationship between the dissimilarities in the item-item matrix and the Euclidean distance between items, and the location of each item in the low-dimensional space. The relationship is typically found using isotonic regression.
Generalized multidimensional scaling
An extension of metric multidimensional scaling, in which the target space is an arbitrary smooth non-Euclidean space. In case when the dissimilarities are distances on a surface and the target space is another surface, GMDS allows finding the minimum-distortion embedding of one surface into another[2].

Details

The data to be analyzed is a collection of I objects (colors, faces, stocks, ...) on which a distance function is defined,

δi,j := distance between i th and j th objects.

These distances are the entries of the dissimilarity matrix

\Delta := 
\begin{pmatrix}
\delta_{1,1} & \delta_{1,2} & \cdots & \delta_{1,I} \\
\delta_{2,1} & \delta_{2,2} & \cdots & \delta_{2,I} \\
\vdots & \vdots & & \vdots \\
\delta_{I,1} & \delta_{I,2} & \cdots & \delta_{I,I}
\end{pmatrix}.

The goal of MDS is, given Δ, to find I vectors x_1,\ldots,x_I \in \mathbb{R}^N such that

\|x_i - x_j\| \approx \delta_{i,j} for all i,j\in I,

where \|\cdot\| is a vector norm. In classical MDS, this norm is the Euclidean distance, but more generally it may be a metric or arbitrary distance function.[3]

In other words, MDS attempts to find an embedding from the I objects into RN such that distances are preserved. If the dimension N is chosen to be 2 or 3, we may plot the vectors xi to obtain a visualization of the similarities between the I objects.

Procedure

There are several steps in conducting MDS research:

  1. Formulating the problem – What variables do you want to compare? How many variables do you want to compare? More than 20 is cumbersome. Less than 8 (4 pairs) will not give valid results. What purpose is the study to be used for?
  2. Obtaining Input Data – Respondents are asked a series of questions. For each product pair they are asked to rate similarity (usually on a 7 point Likert scale from very similar to very dissimilar). The first question could be for Coke/Pepsi for example, the next for Coke/Hires rootbeer, the next for Pepsi/Dr Pepper, the next for Dr Pepper/Hires rootbeer, etc. The number of questions is a function of the number of brands and can be calculated as Q = N(N − 1) / 2 where Q is the number of questions and N is the number of brands. This approach is referred to as the “Perception data : direct approach”. There are two other approaches. There is the “Perception data : derived approach” in which products are decomposed into attributes which are rated on a semantic differential scale. The other is the “Preference data approach” in which respondents are asked their preference rather than similarity.
  3. Running the MDS statistical program – Software for running the procedure is available in many software for statistics. Often there is a choice between Metric MDS (which deals with interval or ratio level data), and Nonmetric MDS (which deals with ordinal data).
  4. Decide number of dimensions – The researchers must decide on the number of dimensions they want the computer to create. The more dimensions, the better the statistical fit, but the more difficult it is to interpret the results.
  5. Mapping the results and defining the dimensions – The statistical program (or a related module) will map the results. The map will plot each product (usually in two dimensional space). The proximity of products to each other indicate either how similar they are or how preferred they are, depending on which approach was used. The dimensions must be labelled by the researcher. This requires subjective judgement and is often very challenging. The results must be interpreted ( see perceptual mapping).
  6. Test the results for reliability and Validity – Compute R-squared to determine what proportion of variance of the scaled data can be accounted for by the MDS procedure. An R-square of .6 is considered the minimum acceptable level. An R-square of .8 is considered good for metric scaling and .9 is considered good for non-metric scaling. Other possible tests are Kruskal’s Stress, split data tests, data stability tests (i.e., eliminating one brand), and test-retest reliability.

Applications

Applications include scientific visualisation and data mining in fields such as cognitive science, information science, psychophysics, psychometrics, marketing and ecology. New applications arise in the scope of autonomous wireless nodes which populate a space or an area. MDS may apply as a real time enhanced approach to monitoring and managing such populations.

Marketing

In marketing, MDS is a statistical technique for taking the preferences and perceptions of respondents and representing them on a visual grid, called perceptual maps.

Comparison and advantages

Potential customers are asked to compare pairs of products and make judgements about their similarity. Whereas other techniques (such as factor analysis, discriminant analysis, and conjoint analysis) obtain underlying dimensions from responses to product attributes identified by the researcher, MDS obtains the underlying dimensions from respondents’ judgments about the similarity of products. This is an important advantage. It does not depend on researchers’ judgments. It does not require a list of attributes to be shown to the respondents. The underlying dimensions come from respondents’ judgments about pairs of products. Because of these advantages, MDS is the most common technique used in perceptual mapping.

See also

Implementations

Bibliography

  1. ^ Borg, I. and Groenen, P.: "Modern Multidimensional Scaling: theory and applications" (2nd ed.), Springer-Verlag New York, 2005
  2. ^ Bronstein, A. M, Bronstein, M.M, and Kimmel, R. (2006), Generalized multidimensional scaling: a framework for isometry-invariant partial surface matching, Proc. National Academy of Sciences (PNAS), Vol. 103/5, pp. 1168–1172
  3. ^ Kruskal, J. B., and Wish, M. (1978), Multidimensional Scaling, Sage University Paper series on Quantitative Application in the Social Sciences, 07-011. Beverly Hills and London: Sage Publications.
  • Cox, T.F., Cox, M.A.A., (2001), Multidimensional Scaling, Chapman and Hall.
  • Coxon, Anthony P.M. (1982): "The User's Guide to Multidimensional Scaling. With special reference to the MDS(X) library of Computer Programs." London: Heinemann Educational Books.
  • Green, P. (1975) Marketing applications of MDS: Assessment and outlook, Journal of Marketing, vol 39, January 1975, pp 24–31.
  • Torgerson, W. S. (1958). Theory & Methods of Scaling. New York: Wiley.

External links


 
 

 

Copyrights:

Statistics Dictionary. A Dictionary of Statistics. Second edition revised. Copyright © Oxford University Press, 2008. All rights reserved.  Read more
Geography Dictionary. A Dictionary of Geography. Copyright © Susan Mayhew 1992, 1997, 2004. All rights reserved.  Read more
Wikipedia. This article is licensed under the Creative Commons Attribution/Share-Alike License. It uses material from the Wikipedia article "Multidimensional scaling" Read more