R is one of the most powerful coding languages for analyzing data. It’s used by millions of people across the globe, and is free to…

How to Dynamically Change a Question Based on a Control Box

Control boxes are a popular way for users to change things on a Displayr page. This post will show you how to use a control...

4 Visualizations For Your Customer Satisfaction Data

Customer satisfaction is a valuable customer feedback metric. Here are the four visualizations to find stories in your customer satisfaction data.

Using R in Displayr Video Series

What is Driver Analysis?

Driver analysis, which is also known as key driver analysis, importance analysis, and relative importance analysis, quantifies the importance of a series of predictor variables...

What is Data Merging?

Data merging is the process of combining two or more data sets into a single data set. Most often, this process is necessary when you...

How to Interpret Logistic Regression Coefficients

This post describes how to interpret the coefficients, also known as parameter estimates, from logistic regression (aka binary logit and binary logistic regression). It does...

Adding Logos to Scatter Plots in Displayr

Adding labels to a scatter plot is a great way to increase understanding. By using logos we can take that process a step further. Users…

Create and Update PowerPoint Reports using R

In my sordid past, I was a data science consultant. One thing about data science that they don’t teach you at school is that senior managers…

Layered Data Visualizations Using R, Plotly, and Displayr

If you have tried to communicate research results and data visualizations using R, there is a good chance you will have come across one of…

Analyzing Google Trends Data in Displayr

Using Google Trends data can add further texture to your analysis by providing a history of how popular a topic is (or was) on the…

Analyzing Google Trends Data in R

Google Trends shows the changes in the popularity of search terms over a given time (i.e., number of hits over time). It can be used...

Adding a Combo Box to a Displayr Dashboard

A combo box can be added to a Displayr document by selecting Insert > Control (More), which causes a combo box to appear in the middle of the…

How to Create an Online Choice Simulator by Hand

This post is for the purist who wants to learn all the mechanics of building a choice simulator from scratch.

Adding Supplementary Points to a Correspondence Analysis

Retrospectively adding supplementary points to a correspondence analysis can greatly assist in the interpretation of results. In other words, including supplementary row or column points…

Moonplots: A Better Visualization for Brand Maps

Moonplots are a better way to visualize brand maps than standard correspondence analysis outputs, which are often difficult to read correctly. The Moonplot resolves the...

Normalization and Scaling in Correspondence Analysis

This post gives recommendations for the best approach to normalization for different situations, making correspondence plots less misleading.

Understanding the Math of Correspondence Analysis

If you've ever wanted a deeper understanding of what's going on behind the scenes of correspondence analysis, then this post is for you. Correspondence analysis...

11 Tips for your own MaxDiff Analysis

MaxDiff analysis is one of those advanced techniques that can be run by any good quantitative researcher. In this post, I share 11 tips to...

Singular Value Decomposition (SVD) Tutorial Using Examples in R

If you have ever looked with any depth at statistical computing for multivariate analysis, there is a good chance you have come across the singular value decomposition...

8 Tips for Interpreting R-Squared

Hopefully, if you have landed on this post you have a basic idea of what the R-Squared statistic means. The R-Squared statistic is a number...

How to Check an Experimental Design (MaxDiff, Choice Modeling)

In this post, I explain the basic process that I tend to follow when doing a rough-and-ready check of an experimental design. The last step,…

Correspondence Analysis of Square Tables

Square tables are data tables where the rows and columns have the same labels, commonly seen as a crosstab of brand switching or brand repertoire data.…

Automatically Fitting the Support Vector Machine Cost Parameter

In an earlier post I discussed how to avoid overfitting when using Support Vector Machines. This was achieved using cross validation. In cross validation, prediction accuracy is…

Put PowerPoint into Cruise Control: How to Automatically Update Your Reports

The ability to automatically update PowerPoint slides with new data can save time, money, error, and your sanity. Some analysis software packages, such as Displayr,...

Customization of Bubble Charts for Correspondence Analysis in Displayr

When you insert a bubble chart in Displayr (Insert > Visualization > Bubbleplot), you can customize some aspects of its appearance from the controls that appear in the object…

Using Bubble Charts to Show Significant Relationships and Residuals in Correspondence Analysis

While correspondence analysis does a great job at highlighting relationships in large tables, a practical problem is that correspondence analysis only shows the strongest relationships, and sometimes…

Why Capability Trumps Character for Supporters of the US President

American supporters of President Donald Trump believe that financial skills are more important in a president than decency and ethics, a new survey shows. Data…

Machine Learning: Pruning Decision Trees

In machine learning and data mining, pruning is a technique associated with decision trees. Pruning reduces the size of decision trees by removing parts of...

Comparing Partial Least Squares to Johnson’s Relative Weights

In this post I explore two different methods for computing the relative importance of predictors in regression: Johnson's Relative Weights and Partial Least Squares (PLS) regression. Both techniques solve a problem with...

Using Partial Least Squares to Conduct Relative Importance Analysis in R

Partial Least Squares (PLS) is a popular method for relative importance analysis in fields where the data typically includes more predictors than observations. Relative importance analysis...

The Problem with Using Multiple Linear Regression for Key Driver Analysis: a Case Study of the Cola Market

A key driver analysis investigates the relative importance of predictors against an outcome variable, such as brand preference. Many techniques have been developed for key…

Using Partial Least Squares to Conduct Relative Importance Analysis in Displayr

Partial Least Squares (PLS) is a popular method for relative importance analysis in fields where the data typically includes more predictors than observations. Relative importance analysis…

The Magic Trick that Highlights Interesting Results on Any Table

This post describes the single biggest time saving technique that I know about for highlighting significant results on a table. The table below, which shows…

How to Link Documents in Displayr

Sometimes it is helpful if one Displayr document can refer to information in another document. For example, one document may contain an analysis of sales…

Gradient Boosting Explained – The Coolest Kid on The Machine Learning Block

Gradient boosting is a technique attracting attention for its prediction speed and accuracy, especially with large and complex data.

Using Support Vector Machines in Displayr

Support vector machines (SVMs) are a great machine learning tool for predictive modeling. In this post, I illustrate how to use them. For most problems SVMs…

Using Cross-Validation to Measure MaxDiff Performance

This post compares various approaches to analyzing MaxDiff data using a method known as cross-validation. Before you read this post, make sure you first read How MaxDiff…

How to Analyze MaxDiff Data in Displayr

This post discusses a number of options that are available in Displayr for analyzing data from MaxDiff experiments. For a more detailed explanation of how…

How MaxDiff Analysis Works (Simplish, but Not for Dummies)

This post explains the basic mechanics of how preferences can be measured using the data collected in a MaxDiff experiment. Before you read this post, make sure you...

When to Use, and Not Use, Correspondence Analysis

Correspondence analysis is one of those rare data science tools which make things simpler. You start with a big table that is too hard to…

Correspondence Analysis Versus Multiple Correspondence Analysis: Which to Use and When?

Let me cut to the chase. Multiple correspondence analysis sounds better than correspondence analysis. But, for 99% of real-world data problems, correspondence analysis is the...

How Correspondence Analysis Works (A Simple Explanation)

Correspondence analysis is a data science tool for summarizing tables. This post explains the basics of how it works. It focuses on how to understand…

How to Interpret Correspondence Analysis Plots (It Probably Isn’t the Way You Think)

Correspondence analysis is a popular data science technique. It takes a large table, and turns it into a seemingly easy-to-read visualization. Unfortunately, it is not quite…

Easily Add Images to a Correspondence Analysis Map in Displayr

You can take your correspondence analysis plots to the next level by including images. Better still, you don’t need to paste in the images after…

How to Create a MaxDiff Experimental Design in Displayr

Creating the experimental design for a MaxDiff experiment is easy in Displayr. This post describes how you can create and check the design yourself. If you…

Easily Add Images to a Correspondence Analysis Plot in R
You can take your correspondence analysis plots to the next level by including images. Better still, you don’t need to paste in the images after…

An Introduction to MaxDiff

MaxDiff is a research technique for measuring relative preferences. It is typically used in situations where more traditional question types are problematic. Consider the problem...

5 Ways to Deal with Missing Data in Cluster Analysis

If you have ever tried to perform cluster analysis when you have missing data, there is a good chance your experience was ugly. Most cluster analysis...

Where Pictographs Beat Bar Charts: Proportional Data

Pictographs are exceptionally good for some types of data. In my earlier post, I discussed how they are great for showing counts. In this post, I show...

Where Pictographs Beat Bar Charts: Count Data

Don’t forget you can create free pictographs using Displayr’s pictograph maker. Pictographs are often subject to ridicule. They are seen to compromise interpretability in favor…

Ranking Plots: Illustrating Data with Different Magnitudes

A Ranking Plot, also known as a Rank Flow Plot, is particularly useful for comparing data that differs in magnitude

Creating tables with multiple variables (filters and multiway tables)

It is super-simple to create a table involving one variable in Displayr: just drag it from Data Sets (bottom-left) of the screen onto a page,…

5 Ways to Visualize Relative Importance Scores from Key Driver Analysis

Key driver analysis techniques, such as Shapley Value, Kruskal Analysis, and Relative Weights, are useful for working out the most important predictor variables for some outcome...

Labeled Scatter Plots and Bubble Charts in R

This post explores how the R package for labeled scatterplots tries to solve the problem of scatterplots and bubble plots or bubble charts in R.

When to Use Relative Weights Over Shapley

Shapley regression is a popular method for estimating the importance of predictor variables in linear regression. This method can deal with highly correlated predictor variables that are...

The Difference Between Shapley Regression and Relative Weights

Shapley regression and Relative Weights are two methods for estimating the importance of predictor variables in linear regression. Studies have shown that the two, despite being constructed in very different…

Too Hot to Handle? The Problem with Heatmaps

Heatmaps are cool. Most people like them. They are so much prettier than a bar chart. The one below, created in Making your data hot:…

The Secret of “Chartjunk”: Why Misleading Visualizations Aren’t Always Bad

There is a war in the world of visualization. It is about chartjunk. Designers like to create charts like the one above. Many data viz experts…

The NPS Recoding Trick: The Smart Way to Compute the Net Promoter Score

The Net Promoter Score is most people's go-to measure for evaluating companies, brands, and business units. However, the the standard way of computing the NPS - subtract the...

Assigning Respondents to Clusters/Segments in New Data Files in Displayr

Once you have created segments or clusters, it is often useful to assign people in other data sets to the segments (this is also known as segment…

Creating Custom Sankey Diagrams Using R

I have previously shown how Sankey or alluvial diagrams can easily be used to visualize response patterns in surveys and to display decision trees. Following…

Visualizing Response Patterns and Survey Flow With Sankey Diagrams

If you have spent much time analyzing customer feedback survey data, then you have probably spent a lot of time validating it. This normally entails…

Making Your Data Hot: Heatmaps for the Display of Large Tables

Don’t forget that you can easily use Displayr’s heatmap maker to create your free heatmap! Sometimes tables are just too big to read. The table below shows…

It Is Not the Size That Counts: Small Visualizations Are Preferable to Large Visualizations

All else being equal, small visualizations are better than big visualizations. There is no need to take my word for it. You can prove it…

A Pie Chart for Pi Day: The Data Scientist Pie Eating Challenge

Today is national pi day. The number, not the food. As mentioned in a previous post, I love pie charts. And, as luck would have it, I recently chanced...

Decision Tree Visualizations using Sankey Diagrams or Charts

Sankey diagrams are perfect for displaying decision trees (e.g., CHART, CHAID). I used to think that Sankey diagrams were just one of those cool visualizations…