Map Catalog
Monday, December 8, 2008
Similarity matrix
A similarity matrix is a matrix of scores which represent the similarity between two data points. This matrix explores the use of behavioral attribute clustering as a method to automatically categorize common malware patterns under one forensic model description, and to help rapidly identify new malware behavioral patterns.
Box plot
A box plot, also called a box and whisker plot, was created in 1977 by John Tukey. The plot is an efficient way of showing a five-number data summary. The box gives the middle 50% of the data. The upper edge gives the 75th percentile of the data set and the lower the 25th percentile. The line in the middle of the box represents the median value. The whiskers represent the minimum and maximum data values.
Parallel coordinate graph
A parallel coordinate graph is used to plot large multivariate datasets. Each variable in the data plot is represented as its own Y Axis on the graph. A maximum point for each Y axis is selected, and they are scaled relatively to each other so that each variable takes up the same area in the graph space. This graph was used to plot baseball statistics.