Map Catalog

Monday, December 8, 2008

Star plot


A star plot is a graphical method of displaying multivariate data. Each star represents a single observation. In this star plot from NASA the center represents the most desirable design results represented.

Correlation matrix


A correlation matrix is a matrix giving the correlations between all pairs of data sets. THe above matrix is a calculated protein correlation matrix for phage T7. Correlated behavior ranges from high(red) to low(blue) and the triangular block of red reflects proteins involved in phage assembly.

Similarity matrix


A similarity matrix is a matrix of scores which represent the similarity between two data points. This matrix explores the use of behavioral attribute clustering as a method to automatically categorize common malware patterns under one forensic model description, and to help rapidly identify new malware behavioral patterns.

Stem and leaf plot


A stem and leaf plot is a display that organizes data to show the shape and distribution. The data is organized by place value. Typically the "leaf" is the last number of the digit and all the numbers to the left are the "stem". For example - 90 (9 is the stem, 0 the leaf).

Box plot


A box plot, also called a box and whisker plot, was created in 1977 by John Tukey. The plot is an efficient way of showing a five-number data summary. The box gives the middle 50% of the data. The upper edge gives the 75th percentile of the data set and the lower the 25th percentile. The line in the middle of the box represents the median value. The whiskers represent the minimum and maximum data values.

Histogram


A histogram is graphical display of tabulated frequencies shown as bars.

Parallel coordinate graph


A parallel coordinate graph is used to plot large multivariate datasets. Each variable in the data plot is represented as its own Y Axis on the graph. A maximum point for each Y axis is selected, and they are scaled relatively to each other so that each variable takes up the same area in the graph space. This graph was used to plot baseball statistics.