Data Science.

What’s in the Future for Predictive Lead Scoring?
28 November 2018 | by Lucy Li
It was only a few years ago that people were proclaiming that the future of B2B marketing had arrived in the form of predictive lead...

Continue reading

Focusing the Results of Correspondence Analysis in Displayr
29 October 2018 | by Jake Hoare
Correspondence analysis is often used to visualize a table of data. The goal is to represent as much information as possible, as accurately as possible. However,...

Continue reading

Decision Trees Are Usually Better Than Logistic Regression
25 October 2018 | by Tim Bock

If you’ve studied a bit of statistics or machine learning, there is a good chance you have come across logistic regression (aka binary logit). It

Continue reading

How to Interpret Logistic Regression Coefficients
25 October 2018 | by Tim Bock

This post describes how to interpret the coefficients, also known as parameter estimates, from logistic regression (aka binary logit and binary logistic regression). It does

Continue reading

What is Feature Engineering?
25 October 2018 | by Tim Bock
Feature engineering refers to a process of selecting and transforming variables when creating a predictive model using machine learning or statistical modeling (such as deep...

Continue reading

Using Displayr
Feature Engineering in Displayr
25 October 2018 | by Tim Bock

Feature engineering refers to the process of manipulating predictor variables (features) with the goal of improving a predictive model. In this post I outline some of the

Continue reading

Feature Engineering for Categorical Variables
24 October 2018 | by Tim Bock

When creating a predictive model, there are two types of predictors (features): numeric variables, such as height and weight, and categorical variables, such as occupation

Continue reading

Using Displayr
How to do Logistic Regression in Displayr
24 October 2018 | by Tim Bock
In this post I describe how to perform a logistic regression in Displayr. I illustrate the basics using a data set on customer churn for...

Continue reading

Feature Engineering for Numeric Variables
24 October 2018 | by Tim Bock

When building a predictive model, it is often practical to improve predictive performance by modifying the numeric variables in some way. In statistics, this is

Continue reading

How to Interpret Logistic Regression Outputs
24 October 2018 | by Tim Bock

Logistic regression, also known as binary logit and binary logistic regression, is a particularly useful predictive modeling technique, beloved in both the machine learning and the statistics communities. It

Continue reading

Using Q
How to Calculate Sentiment Scores for Open-Ended Responses in Q
18 October 2018 | by Matt Steele
Sentiment analysis is a way to quantify the feeling or tone of written text. In a survey context, this is a useful technique for gauging...

Continue reading

3D Correspondence Analysis Plots in Q
27 September 2018 | by Tim Bock
The default correspondence analysis charts in Q are two-dimensional scatterplots (scroll down to see an example). However, you can create a three-dimensional plot by writing...

Continue reading

3d correspondence analysis
3D Correspondence Analysis Plots in Displayr
14 September 2018 | by Tim Bock
In this post I show you to create 3D visualizations of correspondence analysis using Displayr. This allows you to view an extra dimension of your...

Continue reading

How to Compute D-error for a Choice Experiment
21 August 2018 | by Justin Yap
D-error is a way of summarizing how good or bad a design is at extracting information from respondents in a choice experiment. A design with...

Continue reading

What is Deep Learning?
14 August 2018 | by Jake Hoare
Deep learning is a subset of machine learning. Like other machine-learning techniques, deep learning creates a mapping from input data to a target outcome. After...

Continue reading

view of a forest
How Random Forests Fit to Data
06 August 2018 | by Jake Hoare
A random forest is a collection of decision trees. The forest learns patterns in data and makes predictions based on those patterns. In this post,...

Continue reading

tree on a hill
How is Splitting Decided for Decision Trees?
02 August 2018 | by Jake Hoare
Decision trees are a machine learning technique for making predictions. They are built by repeatedly splitting training data into smaller and smaller samples. This post...

Continue reading

What is a Decision Tree?
01 August 2018 | by Jake Hoare
A decision tree makes predictions based on a series of questions. The outcome of each question determines which branch of the tree to follow. They...

Continue reading