02 February 2017 |
Using Palm Trees to Visualize Performance Across Multiple Dimensions (Egypt’s Scary Palm Tree)
Palm trees are my favorite visualization. They look great. They are easy to understand. There is no other visualization that is as effective at decomposing performance across multiple dimensions.
Palm trees perfectly align visual elements with the core underlying structure of the data. This encourages our brain to separate out additive effects from interactions – without our brains even needing to know what these terms mean!
A great looking, easy visualization to show multiple dimensions in your data
The palm trees below represent the concerns American travelers have about different destinations. The tallest palm tree is for Egypt. It has the worst performance (i.e., the most concerns). Hover your mouse over the fronds of the palm tree, and you will see the breakdown of these concerns.
Safety is clearly the top concern, although there are lots of other important concerns. Compare China. China has many issues in common with Egypt, but Not being understood is a bigger concern, and Safety is less of a concern.
If you want to play around with the the examples used in this post, or create Palm Tree Plots of your own, you can do so by clicking here.
There is no better way to visualize data with multiple dimensions
I am making a bold claim here. However, it is a claim I can back up.
There is a standard best-practice visualization for data with multiple dimensions: a line chart (I explain the logic of this below). Check out the line chart version of the same data, shown below. Yes, if you dig it is the same data and shows the same patterns. However, your brain has to work really hard to extract them.
Perfect alignment between visual elements and underlying structure of data
The visual elements of a palm tree plot mirror the structure of a two-way analysis of variance, and allow us to visualize the type of conclusions typically obtained via formal statistical testing: separation of the two main effects, and the interaction.
- Main effect 1. Three visual elements represent the average level of concern by country. 1. The height of the palm trees. 2. The average length of the fronds. 3. The order of the palm trees. In the example, it is easy to see that Egypt has marginally more concerns than China, and the UK and Australia are close to tied.
- Main effect 2. Three visual elements encode information regarding the frequency of the concerns. 1. The ordering of the concerns in the legend. 2. The ordering of the fronds (clockwise from midnight). 3. The average length of each type of frond. In our example, reproduced below, we can readily see that Safety is a huge concern, whereas Boredom is largely irrelevant. Although gross patterns can be seen here, this is the weakest aspect of the palm trees(e.g., length of frond is used to encode the prevalence of concerns, but some users will make inferences from area). For this reason, such information is better represented via a separate visualization.
- Interactions. This is where the palm trees have no equal. The statistical literature generally recommends using line charts to represent interactions. Line charts only really work for very simple data sets. The palm trees are so much better for bigger examples, such as this one. Our brains are primed to quickly spot differences in shapes. This allows us to quickly see the relationships between concerns and countries. We can readily see that, for example:
- The UK and Australia are, in terms of concerns, essentially identical: people are concerned about cost, but little else.
- People are also concerned about Cost with France, but Friendliness and Not being understood are also factors (which is a bit sad for France, as my own experience is that these have not been real issues when traveling in France for many years).
- With Mexico, concerns relate to Safety, Health, and Cleanliness.
If you want to play around with the examples used in this post, or create Palm Tree Plots of your own, you can do so by clicking here.
The OECD’s wonderful Better Life Index inspired the visualizations in this post.
Author: Tim Bock
Tim Bock is the founder of Displayr. Tim is a data scientist, who has consulted, published academic papers, and won awards, for problems/techniques as diverse as neural networks, mixture models, data fusion, market segmentation, IPO pricing, small sample research, and data visualization. He has conducted data science projects for numerous companies, including Pfizer, Coca Cola, ACNielsen, KFC, Weight Watchers, Unilever, and Nestle. He is also the founder of Q www.qresearchsoftware.com, a data science product designed for survey research, which is used by all the world’s seven largest market research consultancies. He studied econometrics, maths, and marketing, and has a University Medal and PhD from the University of New South Wales (Australia’s leading research university), where he was an adjunct member of staff for 15 years.