Word clouds are one of the simplest of data visualizations to create and understand. Everyone instinctively understands that the size of words in a word cloud corresponds to the frequency in which that word has been mentioned, with larger words receiving more prominence. In this post, I'm going to show you how you can create a word cloud.

 

Create your own Word Cloud

A long time ago, in a galaxy far, far, away, Tom Cruise jumped over a couch and landed on everybody else's nose. Most people seem to have forgiven him, but at the time, he was unpopular. He'd just broken up with Nicole Kidman. This post is going to show you why Displayr is the best tool for word clouds using data from that time and galaxy. That data contains 300 responses of people telling us why they dislike Tom Cruise. The raw data is below.

How to create a word cloud

Word clouds are created in Displayr by:

  1. Importing a data set into Displayr (Home > Data Set (Data)).
  2. Dragging a text variable from the data tree.
  3. Selecting Home > Chart (Charts) > Word cloud.

If you want to try this for yourself using this data, click here.

Create your own Word Cloud

Automatic removal of uninteresting words

Displayr automatically performs some basic text analytics when you create a word cloud, by choosing to ignore words that it regards as uninteresting. You can drag additional words into the Ignore bucket, or, drag words back.

StopWordsWordCloud

 

Drag and drop to merge words

You can also click on words and drag them onto other words. For example, dragging arrogent onto arrogance. Displayr shows us that the new word is arrogance, which is used by 5 people. We can edit this label if we wish.

MergedWordCloud

 

Viewing the original text

By clicking on the magnifying glass next to the words (e.g., to the right of arrogent, in the image above), we can see the original text. This allows us to confirm if it is appropriate to merge together the words.

OriginalTextWordCloud

Adding phrases to the word cloud

Often when we are trying to create a word cloud we need to add a phrase. For example, in the word cloud, you can see that Tom and Cruise are appearing as separate words. If you click on Tom, you will see that 23 of the appearances are Tom Cruise. The way that we get Displayr to include a phrase is to click on the word we want to change (e.g., Tom) and then edit the name, in the field on the top-left, remembering to press Enter on your keyboard. Below, you can see what happens, we have automatically created a phrase called Tom Cruise, and it contains the 23 Tom Cruise responses and the one Tom response. If we want to split off the solo Tom as its own word on the word cloud, we click the cross next to it.

TomCruiseRenaming

Create your own Word Cloud

 

In my next post, I discuss how you can automate the whole tidying up process of adding phrases and the like using text analytics.