Blog.

R.

correspondence analysis example Understanding the Math of Correspondence Analysis with Examples in R
08 August 2017 | by Tim Bock

Correspondence analysis is a popular tool for visualizing the patterns in large tables. To many practitioners it is probably a black box. Table goes in, chart comes out. In this post I explain the mathematics…

Continue reading

singular value decomposition Singular Value Decomposition (SVD): Tutorial Using Examples in R
02 August 2017 | by Tim Bock

If you have ever looked with any depth at statistical computing for multivariate analysis, there is a good chance you have come across the singular value decomposition (SVD). It is a workhorse for techniques that decompose data, such as correspondence analysis and principal…

Continue reading

preview Show information about the snippet editorYou can click on each element in the preview to jump to the Snippet Editor. SEO title preview: Displayr | Automatically Fitting the Support Vector Machine Cost Parameter Automatically Fitting the Support Vector Machine Cost Parameter
18 July 2017 | by Jake Hoare

In an earlier post I discussed how to avoid overfitting when using Support Vector Machines. This was achieved using cross validation. In cross validation, prediction accuracy is maximized by varying the cost parameter. Importantly, prediction accuracy is…

Continue reading

comparing relative weights and partial least squares Comparing Partial Least Squares to Johnson’s Relative Weights
19 June 2017 | by Tim Bock

In this post I explore two different methods for computing the relative importance of predictors in regression: Johnson’s Relative Weights and Partial Least Squares (PLS) regression. Both techniques solve a problem with Multiple Linear Regression, which can perform poorly when there are correlations…

Continue reading

Partial Least Squares can be used find drivers of consumer preference Using Partial Least Squares to Conduct Relative Importance Analysis in R
19 June 2017 | by Jake Hoare

Partial Least Squares (PLS) is a popular method for relative importance analysis in fields where the data typically includes more predictors than observations. Relative importance analysis is a general term applied to any technique used for…

Continue reading

Using Displayr Using Partial Least Squares to Conduct Relative Importance Analysis in Displayr
16 June 2017 | by Jake Hoare

Partial Least Squares (PLS) is a popular method for relative importance analysis in fields where the data typically includes more predictors than observations. Relative importance analysis is a general term applied to any technique used for…

Continue reading

statistical significance trick The Magic Trick that Highlights Interesting Results on Any Table
12 June 2017 | by Tim Bock

This post describes the single biggest time saving technique that I know about for highlighting significant results on a table. The table below, which shows the results of a MANOVA, illustrates the trick. The coloring…

Continue reading

Gradient boosting Gradient Boosting – The Coolest Kid on The Machine Learning Block
06 June 2017 | by Jake Hoare

Gradient boosting is a technique attracting attention for its prediction speed and accuracy, especially with large and complex data. Don’t just take my word for it, the chart below shows the rapid growth of Google…

Continue reading

max-diff-preferences How to Analyze MaxDiff Data in R
23 May 2017 | by Chris Facer

This post discusses a number of options that are available in R for analyzing data from MaxDiff experiments, using the package flipMaxDiff. For a more detailed explanation of how to analyze MaxDiff, and what the outputs…

Continue reading

correspondence analysis movements Using Correspondence Analysis to Compare Sub-Groups and Understand Trends
22 May 2017 | by Tim Bock

This post shows how to use correspondence analysis to compare sub-groups. It focuses on one of the most interesting types of sub-groups: data at different points in time. This is variously known as trend, tracking, longitudinal and time series data. The end-goal…

Continue reading

correspondence analysis movements How to Interpret Correspondence Analysis Plots (It Probably Isn’t the Way You Think)
19 May 2017 | by Tim Bock

Correspondence analysis is a popular data science technique. It takes a large table, and turns it into a seemingly easy-to-read visualization. Unfortunately, it is not quite as easy to read as most people assume. In How…

Continue reading

Max Diff design output How to Create a MaxDiff Experimental Design in R
17 May 2017 | by Tim Bock

Creating the experimental design for a max-diff experiment is easy in R. This post describes how to create and check a max-diff experimental design. If you are not sure what this is, it would be best to read A beginner’s guide to max-diff first.

Continue reading

Keep updated with the latest in data science.