20100202

Dimensionality of Visible Data

Human eye has own Curse of Dimensionality (term suggested in 1961 by R.Bellman and described independently by G. Hughes in 1968). In most cases the data (before they visualized) usually organized in multidimensional Cubes (n-Cubes) and/or Data Warehouses and/or speaking more cloudy - in Data Cloud - need to be projected into less-dimensional datasets (small-dimensional Cubes, e.g. 3d-Cubes) before they can be exposed through (preferably  interactive  and  synchronized set of charts, sometimes called dashboards) 2-dimensional surface of computer monitor in form of Charts.

[caption id="attachment_1155" align="aligncenter" width="510"] Projection of DataCloud to DataCubes and then to Charts[/caption]

During last 200+ years people kept inventing all type of charts to be printed on paper or shown on screen, so most charts showing 2- or 3-dimensional datasets. Prof. Hans Rosling led Gapminder.org to create the web-based, animated 6-dimensional Color Bubble Motion Chart (Trendalyzer):

tumblr_mssaaxhajz1stz40uo1_500

ansd screenshot of it here:



which he used in his famous demos: http://www.gapminder.org/world/ , where 6 dimensions in this specific Chart are (almost a record for 2-dimensional chart to carry):

  • X coordinate of the Bubble = Income per person,

  • Y coordinate of the Bubble = Life expectancy,

  • Size of the Bubble = Population of the Country,

  • Color of the Bubble = Continent of the Country,

  • Name of the Bubble = Country,

  • Year = animated 6th Dimension/Parameter as time-stamp of the Bubble.


Trendalyzer was bought from Gapminder in 2007 by Google and was converted into Google Motion Chart, but Google somehow is not in rush to enter the Data Visualization (DV) market.

Dimensionality of this Motion Chart can be pushed even further to 7 dimensions (dimension as an expression of measurement without units) if we will use different Shapes (in addition to filled Circles we can use Triangles, Squares etc.) but it will be literally pushing the limit of what human eye can handle. If you will add to the consideration a tendency of DV Designers to squeeze more than one chart on a screen (how about overcrowded Dashboards with multiple synchronized interactive Charts?), we are literally approaching the limits of both human eye and human brain, regardless of the dimensionality of the Data Warehouse in backend.

Below I approximately assessed the dimensionality of datasets for some popular charts (please feel free to send me the corrections). For each Dataset and respective Chart I estimated the number of measures (usually real or integer number, can be a calculation from other dimensions of dataset), the number of attributes (in many cases they are categories, enumerations or have string as datatype) and 0 or 1 parameter (presenting a well-ordered set, like time (for time series), date, year, sequence (can be used for Data Slicing), natural, integer or real  number) and Dimensionality (the number of Dimensions) as a total number of measures, attributes and parameters in a given dataset.




































































































































































ChartMeasuresAttributesParameterDimensionality
Gauge, Bullet, KPI00
Monochromatic Pie11
Colorful Pie112
Bar/Column112
Sparkline112
Line112
Area112
Radar112
Stacked Line1113
Multiline1113
Stacked Area1113
Overlapped Radar1113
Stacked Bar/Column1113
Heatmap123
Combo123
Mekko213
Scatter (2-d set)213
Bubble (3-d set)314
Shaped Motion Bubble3115
Color Shaped Bubble325
Color Motion Bubble3216
Motion Chart3317




The diversity of Charts and their Dimensionality adding another complexity for DV Designer: what Chart(s) choose. You can find on web some good suggestions about that. Dr. Andrew Abela created Chart Chooser Diagram

[caption id="attachment_1145" align="aligncenter" width="510"] Choosing a good chart by Dr. Abela[/caption]

and it was even converted into online "application"!

Permalink: http://apandre.wordpress.com/2011/03/02/dimensionality/