The Role of Imputation in Clustering BIB Data

Sara King/ July 28, 2014/ Tutorial/ 0 comments

Clustering consumer data reveals important information to help refine products for specific market segments. There are compelling reasons to use incomplete block designs to collect consumer data; however, this presents the challenge of dealing with missing data. The purpose of this workshop is to investigate the effect of different imputation techniques on the results of cluster analysis of balanced-incomplete-block (BIB) data.

BIB designs and possible ways to incorporate information from sensory data into the design (cf. Browne et al., 2013) will be reviewed. Then, commonly used imputation and clustering approaches will be discussed. After this background material has been covered, expectation-maximization (EM) algorithm-based imputation will be presented.

The remainder of the workshop will be interactive and hands on. Participants will be given two data sets (provided by Compusense Inc.) resulting from sensory informed BIB designs. Using R software, which will also be provided, participants will be invited to use EM algorithm-based imputation and compare the results to other approaches.

Browne, R.P., Franczak, B., Findlay, C.J., & McNicholas, P.D. (2014). The Role of Imputation in Clustering BIB Data. In: 12th Sensometrics Meeting – Tutorial Day. 28 July. Chicago, Illinois, USA.

Share this Post

Leave a Comment

Your email address will not be published. Required fields are marked *