R is one of the most powerful coding languages for analyzing data. It’s used by millions of people across the globe, and is free to…

How to Dynamically Change a Question Based on a Control Box

Control boxes are a popular way for users to change things on a Displayr page. This post will show you how to use a control...

4 Visualizations For Your Customer Satisfaction Data

Customer satisfaction is a valuable customer feedback metric. Here are the four visualizations to find stories in your customer satisfaction data.

Using R in Displayr Video Series

What is Driver Analysis?

Driver analysis, which is also known as key driver analysis, importance analysis, and relative importance analysis, quantifies the importance of a series of predictor variables...

What is Data Merging?

Data merging is the process of combining two or more data sets into a single data set. Most often, this process is necessary when you...

How to Interpret Logistic Regression Coefficients

This post describes how to interpret the coefficients, also known as parameter estimates, from logistic regression (aka binary logit and binary logistic regression). It does...

Visualized: Can we Quantify the Most Popular Music?

Music is an integral part of our pop culture. But can data science reveal the most popular music? Join us as we find out!

How to Calculate an Average Value from Categorical Data in Displayr

The traditional means for outputting categorical data is with percentages representing each category's share of the total sample. However, there are a range of cases where...

The Efficient Algorithm for Choice Model Experimental Designs

In this blog post, I describe the Efficient algorithm for generating choice model designs. The Efficient algorithm is actually a special case of the more general Partial…

The Partial Profiles Algorithm for Experimental Designs

In this blog post, I shall describe the Partial Profiles algorithm for generating choice model designs, which is described in a recent research paper (listed in the references). The…

Using R to Create Free Online Dashboards

It is now possible to create public dashboards, based on R code, for free! To illustrate how it works, I’ve used the free version of Displayr…

Comparing Experimental Designs Based on Model Fit

There are many ways to generate an experimental design. Perhaps the simplest is to randomly assign levels to alternatives in a design. However, other methods…

How to Compute D-Error for a Choice Experiment Using Displayr

D-error is a way of summarizing how good a design is at extracting information from respondents in a choice experiment.

How to Sort Data in R

Sorting is any process in which data is arranged into some meaningful order that makes it easier to interpret and analyze the data. I'll show...

What is R-Squared?

The R-squared statistic quantifies the predictive accuracy of a statistical model. It shows the proportion of variance in the outcome variable that is explained by...

What is the Chi-Square Frequency Test?

The chi-square frequency test gauges whether the observed number of people with different values of a variable is consistent with expectations. Most commonly it is...

How to Create a Top 2 Box Score in Displayr

Top 2 Box Scores are a tidy way of representing favorable responses to scaled questions. In today's post, I'll you how you can easily do...

What are Survey Quotas?

A survey quota is the number of observations needed to meet a specified requirement, such as the number of men and the number of women...

How to Split Text Strings in Displayr

String splitting is the process of breaking up a text string in a systematic way so that the individual parts of the text can be...

How to Create a Bar Chart in Displayr

There are two methods for creating a chart in Displayr: using the chart menu or the visualization menu. Here we will look at how to...

How to Compute D-Error for a Sawtooth Software CBC Experiment

D-error is a way of summarizing how good a design is at extracting information from respondents in a choice experiment.

Algorithms to Create your Choice Model Experimental Design

In a stated preference discrete choice experiment, respondents are asked a number of questions. Each question asks them to choose between a number of alternatives...

What is Data Filtering?

Data filtering is the process of choosing a smaller part of your data set and using that subset for viewing or analysis. Filtering is generally...

What are Data Measurement Scales?

In psychology and many disciplines that draw on psychology, data is classified as having one of four measurement scale types: nominal, ordinal, interval, and ratio.…

How to Reuse Chart Formatting in Displayr

When building a report from your data, it's a great time saver to be able re-use formatting on your charts. This also helps you keep...

How to Stack Data in Displayr Using R

Data stacking is a data preparation step where a data set is split into subsets, and the subsets are merged by case (or stacked on...

How to Create Customized Tables in Displayr Using R

You can create tables with custom formatting in Displayr by writing some R code. This lets you go beyond the table styles that are available...

How to Create a Pseudo-Package in Displayr

A great way to improve your efficiency when using R and Displayr is by creating custom functions that you can reuse over and over. One...

Who is the Greatest Supervillain in Cinema History According to Data?

The chilly menace of Darth Vader as he threatens the entire universe. The creepy, psychotic smile of the Joker. The soulless evil of Lord Voldemort....

Can Data Science Craft the Perfect Tinder Bio?

Let's be honest. You're on Tinder. So is practically everyone else, from the barista who made your morning coffee to that cutie at the gym....

Using Code to Show Significant Results with Arrows in Displayr

The tables in Displayr include arrows and font colors to highlight interesting results. These arrows are also available on many of the charts. Adding similar...

Adding Icons to Dashboards Using Font Awesome

In this post I show how to add novel symbols, from ambulances through to arrows, to Displayr documents. These symbols are part of the Font...

What is a Crosstab?

A crosstab is a table showing the relationship between two or more variables. Where the table only shows the relationship between two categorical variables, a crosstab is…

How to Create a Dynamic Bullet-Point List in Displayr Using R

When building dashboards using Displayr, users tend focus on data tables and various visualizations (for understandable reasons). But suppose we want to display key metrics...

What are Dummy Variables?

A dummy variable is a variable that takes values of 0 and 1, where the values indicate the presence or absence of something (e.g., a…

How to Calculate Penalty Analysis in Displayr

Want to get the jump on your colleagues by learning how to calculate penalty analysis? I'll show you how you can easily create a way...

How to Set Up a Choice-Based Conjoint Analysis in Qualtrics

In this post, we'll see how we can use Displayr to automate the setup of a Choice-Based Conjoint (CBC) analysis in Qualtrics. A CBC analysis...

What are Small Multiples?

A small multiple is a data visualization that consists of multiple charts arranged in a grid. This makes it easy to compare the entirety of...

Using Code to Automatically Update Text and Icons in Dashboards

In this post I show how to write code to create automatically-updating text boxes, where the text and icons update when data changes.

How to Create a Sankey Diagram From a Table in Displayr

In this post, I show you how to use Displayr to create a Sankey diagram when your data is set up as a table (e.g.,...

What is Logistic Regression?

Logistic regression — also known as logit regression, binary logit, or binary logistic regression — is a type of regression analysis used when the dependent...

How to Compute D-error for a Choice Experiment

D-error is a way of summarizing how good or bad a design is at extracting information from respondents in a choice experiment. A design with...

What is D-Error?

D-error is a measure that quantifies how good or bad a design is at extracting information from respondents in an experiment. A lower D-error indicates...

How to Connect Filters to a Combo Box (Control)

With Displayr, you can show different patterns in your data by applying filters to analyses. One way to do this is by creating filters that…

Installing R Packages from GitHub

Often interesting new R packages are only available on GitHub. This post shows how to install an R package from GitHub, in each of R,...

How to Create Sankey Diagrams From Tables (Data Frames) Using R

In this post I show how you can use R to create a Sankey Diagram when your data is set up as a table (data...

What is a Correlation Matrix?

A correlation matrix is a table showing correlation coefficients between variables. Each cell in the table shows the correlation between two variables. A correlation matrix...

How to Create Alternative-Specific Choice Model Designs in Displayr

In a standard choice experiment, respondents are presented with alternatives which have a common set of attributes. Alternative-specific designs relax this requirement and are designed to...

How to Analyze Free-Form Text Data from Surveys

You want to give your survey respondents the opportunity to answer open-ended questions or elaborate on their responses. But how do you analyze the free-form...

How to Guarantee* Your Kickstarter Will Succeed!

*We can't actually guarantee the success of your Kickstarter. Sorry! But we can show you what the data says about past Kickstarters, and the factors...

What is Deep Learning?

Deep learning is a subset of machine learning. Like other machine-learning techniques, deep learning creates a mapping from input data to a target outcome. After...

What are the Different Types of Missing Data?

There are four qualitatively distinct types of missing data. Missing data is either: structurally missing, missing completely at random (MCAR), missing at random, or nonignorable...

What are the Alternatives to Random Sampling?

A random sample is a subset of individuals selected at random from a larger population, where each individual in the population has a known and…

How to Create a Bump Chart (Ranking Plot) from a Table Using Displayr

Bump charts, also known as ranking plots, are a great way to compare rankings of information by groups. Displayr assumes that your data is in...

Factor Analysis and Principal Component Analysis: A Simple Explanation

Factor analysis and principal component analysis identify patterns in the correlations between variables. These patterns are used to infer the existence of underlying latent variables...

What is MaxDiff?

MaxDiff is a survey research technique for working out relative preferences. What do people like most? Second-most? Etc. It is useful in situations when simpler...

What is Spurious Correlation?

A spurious correlation occurs when two variables are statistically related but not directly causally related. These two variables falsely appear to be related to each...

What is Heteroscedasticity?

Heteroscedasticity (also spelled “heteroskedasticity”) refers to a specific type of pattern in the residuals of a model, whereby for some subsets of the residuals the...

What is the Replication Crisis?

The replication crisis, also known as the replicability crisis or the reproducibility crisis, refers to the growing belief that the results of many scientific studies...

Statistics vs Data Science: What’s the Difference?

There is a great deal of overlap between the fields of statistics and data science, to the point where many definitions of one discipline could...

Using ‘Small Multiples’ Visualizations for Big Success

What's better than one big chart? Lots of small charts! The most common name for a series of small charts in a table layout is...

How Many Rats Does it Take to Power a Lightbulb?

Have you ever wondered how many rats it would take to power a light bulb? No, we hadn't either. But now we can't stop thinking...

How Random Forests Fit to Data

A random forest is a collection of decision trees. The forest learns patterns in data and makes predictions based on those patterns. In this post,...

What is Data Stacking?

Data stacking involves splitting a data set up into smaller data files, and stacking the values for each of the variables into a single column....

What are Residuals?

Residuals in a statistical or machine learning model are the differences between observed and predicted values of data. They are a diagnostic measure used when...

How is Splitting Decided for Decision Trees?

Decision trees are a machine learning technique for making predictions. They are built by repeatedly splitting training data into smaller and smaller samples. This post...