|
|
Home | About Us | Services | Articles | Online Demos | Contact Us |
| Key Word Search: |
|
|
|
Abbreviated Task Sets:
|
| Data Set | Beverages | Games | Books | Portals |
|---|---|---|---|---|
| Sample Size | 2,367 | 3,276 | 1,794 | 2,298 |
| Attributes | 15 | 18 | 5 | 6 |
| Levels | 57 | 42 | 16 | 24 |
| Parameters | 43 | 25 | 11 | 19 |
| Random Tasks | 4 | 6 | 4 | 12 |
| Fixed Tasks | 1 | 2 | 2 | 2 |
| Alternatives per Task (excluding no-buy) |
8 | 3 | 3 | 2 |
| No buy alternative | no | yes | no | yes |
| Data Collection | In-person | Online | Online | Online |
| Survey Versions | 3 | 999 | 999 | 999 |
| Analytic Method | Constant Sum Choice |
Partial Profile | Discrete Choice | Discrete Choice |
Note that all HB runs were made with 100,000 iterations burned and every 10th of the next 10,000 saved. This number of burned iterations is much larger than typical. It was discovered early on in the analysis that HB does not converge as quickly when the number of tasks is reduced and that 100,000 burned iterations was a safe number to use for all data sets. This finding is reflected in Huber et al. (1998).
It should also be noted that computer run time was substantially lengthened by the increase in number of iterations. Run times for this analysis varied from two hours to 30 hours depending on study parameters and computer capabilities. Computers used in this analysis had clock speeds that ranged from 333 MHz to 1.2 GHz.
Also, monotonicity constraints were employed during utility estimation for three of the studies: Beverages, Books and Portals.
In all cases, the experimental designs were tested prior to field by estimating an aggregate multinomial logit (MNL) model using random data. Model convergence was confirmed and the standard errors of all partworths were examined for uniformity and magnitude.
Reducing the number of choice tasks will increase measurement error. Increasing sample size will decrease sample error. One hypothesis is that, by employing HB, we can use the increased precision from a larger sample size to counteract, at least to some degree, the decreased precision due to fewer choice tasks. Admittedly, there is substantial risk in sacrificing measurement precision for the sake of choice task parsimony. A critical issue examined in this paper is whether or not increased sample size can adequately compensate for this lost precision.
HB builds two models, an upper model, which is based on the total sample data, and a set of lower models, each one of which is based on a single person’s data. HB “borrows” information from the upper model whenever the lower model does not fit the data particularly well. In this way, HB is able to estimate individual level choice utilities from data that is often too thin to estimate individual level utilities directly.
Table 2 below lists MAEs, for a study we recently conducted, Portals, calculated for a variety of sample size and number of tasks combinations. There appears to be an obvious trade-off between sample size and number of tasks. A sample size of 500, using 12 tasks, yields an MAE roughly equivalent to a sample size of 1,000 using 6 tasks, for example. At the very least, one can observe that MAE increases as sample size decreases and as number of tasks decreases.
The pattern observed below is consistent with the observation by Johnson and Orme (1996) “Apparently, one gets approximately the same increase in precision from increasing the number of tasks as from proportionately increasing the number of respondents.” Although Johnson and Orme (1996) were encouraging the use of more tasks to increase precision, the data examined here suggest the reverse is also true. That is, increased sample size will also increase precision.
Portals
The Portals Study was an online conjoint study among registered users of an extremely large, well-known Internet portal. Registered users were sent an email inviting them to participate in an online study. For respondent convenience, a hyperlink to the online survey was embedded in the email invitation.
Respondents were shown 14 choice tasks. Each choice task contained two alternative portal configurations and a no-buy option. 12 of the choice tasks were used in the estimation of utilities and two were used as holdouts.
The MAEs in Table 2 below suggest that this study could have been conducted with as few as 3 random choice tasks (plus holdouts) if the sample were 2,298 and with as few as 4 random tasks if the sample were 1,000. All 12 random tasks were necessary to generate acceptable MAEs from a sample size of 500.
| n= | 2,298 | 1,000 | 500 | 200 |
|---|---|---|---|---|
| Tasks | ||||
| 12 | 3.0 | 3.42 | 3.87 | 4.63 |
| 6 | 3.45 | 3.56 | 4.54 | |
| 4 | 3.96 | 4.3 | ||
| 3 | 4.06 | 4.87 | ||
| 2 | 5.49 | |||
| 1 | 6.57 |
In Table 3, notice that the hit rates drop gradually until the number of tasks is reduced to one. The hit rate of 1 task per respondent drops more sharply than at any other task reduction step.
| Tasks | Hit Rate (n=2,298) |
| 12 | 86.1% |
| 6 | 83.8% |
| 4 | 82.1% |
| 3 | 81.2% |
| 2 | 79.9% |
| 1 | 71.7% |
For the original study design, the average standard error in the aggregate MNL model was 0.02 and the maximum was 0.03. As reference, the average and maximum standard errors for an aggregate MNL model with 3 tasks per respondent and sample size of 2,298 were 0.04 and 0.06, respectively. The average and maximum standard errors for an aggregate MNL model with 4 tasks per respondent and sample size of 1,000 are 0.06 and 0.07, respectively.
When using an abbreviated task set, you need to take advantage of every opportunity to increase your model accuracy. Many experimental design software programs provide design diagnostics, typically an efficiency estimate. We used Sawtooth Software’s CBC Advanced Design Module for designing all the studies reported here.
If you design a choice study with one set of tasks (one version) and compare the efficiency numbers from that design to another design using a large number of sets of tasks, say 999, you may find that the design efficiency is enhanced by the increased number of versions.
It is not always the case that an increased number of task set versions will increase design efficiency, of course, as some simple, small designs may achieve perfect balance and orthogonality with relatively few choice tasks. But in practice, most complex study designs can be noticeably improved by increasing the number of versions of choice task sets employed.
For that reason, when using the abbreviated task set approach outlined here, use as many questionnaire versions (number of task sets) as is practical.
Even with a complex study design, a large number of versions of the choice tasks is not necessarily a critical factor, however. We recently designed a study, Beverages, with only three survey versions. It was a paper and pencil study with expensive visual exhibits which limited the number of survey versions we could employ. We used four random tasks, estimated 43 parameters (57 total attribute levels) and had an MAE of 1.05 (see Table 6 below).
When designing your choice tasks, most experimental design software uses a randomized search algorithm. The software creates thousands of possible choice tasks and efficiently searches through them, selecting and then discarding tasks until an adequate set is found. When you have a large number of tasks, and if your experimental design is straightforward, whatever the software generates is probably going to be adequate. But if you’re using a limited number of choice tasks, like we are here, we suggest changing the random seed your software starts with 15 or 20 times. You’ll get a different design with each new random seed. The design efficiency of those 15-20 designs may vary noticeably. Select the design with the best minimal partworth efficiency.
Design efficiency is not sufficient to guarantee that your experimental design will yield an adequate choice model, however. For example, if you create a design with partworth efficiencies all equal to 1 but you have an extremely small sample size, you may not be able to estimate your logit model. Design efficiency calculations have nothing to do with sample size but sample size has a great deal to do with whether or not a model is estimable.
One simple way to determine if your model will successfully generate an adequate disaggregate choice model is to test it using synthetic data. To do this, create a data set of random choice data. Make as many records as you expect to have sample. Estimate an aggregate MNL model. First, check to see if the model converges. Assuming it converges, check the standard errors of each partworth. Our experience suggests that if all the partworth standard errors are 0.05 or less, a good model will result when the real data comes in.
Games
The Games Study was an online conjoint study among registered users of a particular online gaming site. Registered users were sent an email inviting them to participate in an online study. For respondent convenience, a hyperlink to the online survey was embedded in the email invitation.
Respondents were shown eight choice tasks. Each task contained three alternative gaming sites and a no-buy option. Six choice tasks were used in utility estimation and two tasks were used as holdouts.
The Games study was designed using partial profile choice. Approximately one-third of the total number of attributes were represented at any one time. Thus, a sample of 1,000 in the Games data set is roughly equivalent to a sample size of 330 using a full-profile data set, in terms of attribute level exposure. The robustness of HB is evident in its ability to estimate good models with sample sizes as low as 200 and as few as 3 tasks (Table 4) for this partial profile design.
Note that where larger sample size or greater number of tasks yields MAEs of 4 or greater, further MAEs are not calculated. Also note there is some slight instability in MAE estimates due to sampling error in the subsample draws.
| n= | 3,276 | 2,000 | 1,000 | 500 | 200 |
| Tasks | |||||
| 6 | 1.79 | 1.93 | 1.87 | 2.41 | 2.56 |
| 4 | 1.85 | 2.51 | 2.83 | 3.33 | 3.59 |
| 3 | 2.63 | 2.64 | 2.75 | 3.4 | 3.85 |
| 2 | 3.52 | 3.36 | 3.34 | 3.61 | 5.37 |
| 1 | 4.47 | 5.89 | 4.94 | 14.19 |
Hit rates are unusually high (see Table 5). This is most likely due to a large no-buy share. What is noteworthy, however, is the modest decline in hit rate as task number decreases.
| Tasks | Hit Rate (n=2,298) |
| 6 | 81.4% |
| 4 | 79.9% |
| 3 | 78.4% |
| 2 | 78.3% |
| 1 | 76.3% |
This study was originally designed for 6 tasks per respondent, with 999 versions of the choice task set, yielding virtually unique questionnaires per respondent. Sample size was expected to approach 2,000. By examining the standard errors of the partworths in the aggregate MNL model estimated with synthetic, i.e., random, data of the size n=2,000, the design was expected to provide an accurate model with 6 tasks per respondent.
For the original study design (6 tasks and 2,000 respondents), the average standard error in the aggregate MNL model was 0.02 and the maximum was 0.03. As reference, the average and maximum standard errors for an aggregate MNL model with 1 task per respondent and sample size of 3,276 were 0.04 and 0.06, respectively. The average and maximum standard errors for an aggregate MNL model with 2 tasks per respondent and sample size of 200 were 0.13 and 0.16, respectively.
Beverages
The Beverages Study was conducted among grocery shoppers in a South American country. Respondents were shown a series of 5 boards depicting 8 different beverage products they might buy in a grocery store. Four of the boards were used to estimate individual level choice utilities. The fifth board was used as a holdout task.
The interviews were personal, one-on-one interviews conducted in six regions within the South American country. Sample size was approximately 400 per region. Respondents were shown a board of 8 alternative beverages and asked how many of each they would buy if these were the beverages available to them in the grocery store they typically frequented. This numeric data was converted to constant sum for the purpose of utility estimation.
One unusual aspect of this study was the high quality of exhibits shown to respondents. Products were depicted visually. Most study attributes did not need to be listed explicitly on a card but were incorporated into the visual shown. For example, one attribute was container structure. Rather than listing the word “can” or “bottle” on a card and asking preference, respondents were shown some products in cans and others in bottles.
For the Beverage study in Table 6 below, MAEs of under 3 were obtained using just 1 choice task per person with a sample size of 2,367 or 2 tasks per person with a sample size of 500. This is particularly remarkable considering the large number of parameters to be estimated (43) and the small number of questionnaire versions available (3).
| n= | 2,367 | 1,000 | 500 | 200 |
|---|---|---|---|---|
| Tasks | ||||
| 4 | 1.05 | 2.11 | 2.42 | 3.94 |
| 2 | 1.86 | 2.2 | 2.9 | 3.75 |
| 1 | 2.89 | 4.33 | 6.28 | 10.29 |
This study was originally designed for 4 tasks per respondent, with three versions of the choice task set, yielding 12 different choice tasks. Sample size was expected to approach 2,400. By examining the standard errors of the partworths in the aggregate MNL model estimated with synthetic, i.e., random, data of 2,400 cases, the design was expected to provide an accurate model with 4 tasks per respondent.
The average standard error in the aggregate MNL model was 0.06 and the maximum was 0.11. As reference, the average and maximum standard errors for an aggregate MNL model with 2 tasks per respondent and sample size of 200 were 1.3 and 6.9, respectively.
Note that to construct the aggregate MNL model, while it was not necessary to assume the Beverages Study design was that of a full-profile choice-based conjoint, rather than constant sum, we did so for reasons of convenience. It could be assumed the true standard errors are somewhat smaller since constant sum choice provides more information per task than discrete choice.
It is possible that the large number of alternatives per task, the constant sum choice analytic method and/or the visual expression of choice tasks contributed to the success of the study.
Hit rates were not calculated for the Beverages data set because hit rates are not appropriate for constant sum data.
Books
The Books Study was an online conjoint study. Respondents were shoppers of a particular bookstore. Shoppers were sent an email inviting them to participate in an online study. For respondent convenience, a hyperlink to the online survey was embedded in the email invitation.
Respondents were shown six choice tasks. Each choice task contained three alternative bookstores. There was no no-buy option. Four of the choice tasks were used in the estimation of individual-level utilities and the remaining two choice tasks were used as holdouts.
The Books model, as shown in Table 7, has the poorest untuned MAEs of any data set examined. However, with only 4 tasks per person and given a fairly large sample size of 1,794, the MAE of 4.32 is marginally acceptable. Note that the MAE estimates at smaller sample sizes n=500 and n=200 were extremely volatile and not reported. Asterisks were inserted to denote instability. This is most probably due to a combination of sampling error and relatively poor model performance.
| n= | 1,794 | 1,000 | 500 | 200 |
|---|---|---|---|---|
| Tasks | ||||
| 4 | 4.32 | 5.03 | ** | ** |
| 3 | 5.66 | 6.45 | ** | ** |
| 2 | 7.96 | ** | ** |
Hit rates were again extremely high, most likely due to the dominance of one brand in the holdout tasks. However, notice the very modest declines in hit rates as number of tasks decreases.
| Tasks | Hit Rate (n=1,794) |
| 4 | 87.7% |
| 3 | 87.04% |
| 2 | 86.73% |
As was the case with the other studies, the Books design was tested prior to field by estimating its MNL model using synthetic data.
For the original study design (4 tasks and 1,794 respondents), the average standard error in the aggregate MNL model was 0.02 and the maximum was 0.03. As reference, the average and maximum standard errors for an aggregate MNL model with 3 tasks per respondent and sample size of 1,794 were 0.03 and 0.04, respectively. The average and maximum standard errors for an aggregate MNL model with 4 tasks per respondent and sample size of 1,000 were 0.03 and 0.04, respectively.
The relatively large MAE at 4 tasks per respondent may be due to the small number of attributes in the study failing to model respondents’ choice behavior and/or the failure to include the most relevant attributes to respondent choice behavior in the study. A qualitative examination of the attributes would suggest the latter alternative as a potential explanation for the relatively large MAE value.
A third possible explanation for the relatively poor performance of the Books model is that there may have been excessive heterogeneity in the data set which, given a small number of choice tasks, would have put undue pressure on HB’s upper model. The lower model could not estimate accurate utilities because there were too few tasks. The upper model could not act as an adequate surrogate because of sample heterogeneity.
Sentis and Li (2001) examined seven different commercial data sets and found that segmenting the data, via Latent Class, prior to HB did not improve the HB results for any of the seven data sets. However, the commercial studies they examined were designed with a more orthodox number of tasks than the studies examined here. When using an extremely small number of choice tasks, it appears plausible that the LC/HB combination might indeed provide better utility estimates than HB alone.
To test this third hypothesis, we went back to the Books data and, using the 4 random tasks as basis, identified a four-segment Latent Class solution. We reran HB within each segment and pooled the resulting utilities. Similarly for Portals, using only 2 random tasks, we created a two-segment Latent Class solution, reran HB within segment and pooled the resulting utilities.
For both Latent Class solutions, we used monotonicity constraints, evaluated one through 10 segment solutions, replicated each solution five times and set the iterations limit to 250. In both cases, we selected the number of segments solution with minimum CAIC.
To insure valid comparability across models, each model in Tables 9, 10 and 12 were exponentially tuned to minimize average MAE.
It is clear from the data in Table 9 below that, for these specific studies, the combination of LC and HB did not generate better share preference estimates than did HB alone. Further, hit rates in both cases were slightly poorer for the LC/HB approach than for HB alone.
| Study | Tasks Used In | Tasks Used In | Error Term | NB within LC | Tuning Scalar | HB Alone | Tuning Scalar |
| Books | 4 | 4 | MAE Hit Rate | 1.36 85.5% | 0.48 | 1.14 87.7% | 0.54 |
| Portals | 2 | 2 | MAE Hit Rate | 1.53 76.9% | 0.50 | 1.44 79.9% | 0.34 |
Perhaps LC/HB did not improve MAEs in these studies because there were a very small number of choice tasks on which to base any segmentation. It still seems possible, then, in the context of extremely few choice tasks per respondent, that HB run within highly homogeneous segments may outperform HB alone.
To test this hypothesis, we went back to the Portals study and, using all 12 original random tasks as basis, identified a six-segment Latent Class solution. We reran HB within each segment and pooled the resulting utilities. Table 10 shows the results of this new analysis compared to the data from Table 9.
| Study | Tasks Used In | Tasks Used In | Error Term | NB within LC | Tuning Scalar | HB Alone | Tuning Scalar |
| Portals | 2 | 2 | MAE Hit Rate | 1.53 76.9% | 0.50 | 1.44 79.9% | 0.34 |
| Portals | 2 | 12 | MAE Hit Rate | 1.04 80.2% | 0.29 | 1.44 79.9% | 0.34 |
The LC segmentation based on 12 tasks generated superior MAEs to both the original, unsegmented model and also the 2 task segmented model. Further, hit rates, while only slightly improved relative to the original estimates, are substantially better than the 2-task model. This seems to suggest that increased sample homogeneity can potentially improve HB estimates of both MAEs and hit rates.
These improvements in both MAEs and hit rates were achieved despite the decrease in sample size that resulted from using a six-segment solution rather than a two segment or no segment solution. This seems to add further credence to the hypothesis that sample homogeneity can potentially improve HB estimates when few choice tasks per respondent are employed.
Because HB is borrowing heavily from the upper model, more iterations are required to gain convergence. So, to be safe, set your burned iterations to 100,000, as did Huber, et al. (1998). When you use an abbreviated task set, you are not likely to get convergence as quickly as you would with a more conventional number of tasks.
As an aside, this result may shed some light on Sentis and Li (2000). In that paper, they observe that you only need burn as few as 1,000 HB iterations, not 20,000 iterations, as is generally assumed. We suspect that, because typical commercial data sets use a larger number of choice tasks than is minimally necessary, HB did not need to borrow as much from the upper model and, therefore, converged faster.
We subjectively reviewed 69 log files from HB runs made for this paper. Sawtooth provides four measures of fit in its HB log files. Based on years of experience, we’ve found that that the first two diagnostics, percent certainty and root likelihood seem to converge quicker than the last two, average variance and parameter RMS. So to simplify the task and yet remain conservative, we scanned the parameter RMS statistics from all 69 log files to qualitatively determine when convergence was achieved. We then regressed the number of iterations at which convergence was subjectively determined to occur against sample size and number of tasks.
Table 11 clearly indicates that sample size does not appear to affect how many iterations are needed for convergence but the number of tasks has a substantial impact on the number of runs needed for convergence. This finding is entirely consistent with the hypothesis that more iterations are needed to obtain convergence when the lower model has less individual-level information available.
| Variable | B | Std Error | Beta | t | Sig |
| Sample Size | -0.004 | 0.004 | -0.120 | -1.070 | 0.288 |
| Number of Tasks | -9.252 | 2.283 | -0.456 | -4.052 | 0.000 |
Of the 69 HB logs examined, the average convergence was at roughly 60,000 iterations. The range was from 4,000 to 100,000.
Although we have not verified its impact, we suggest you use monotonicity constraints when estimating your utilities. Orme and Johnson (1997) review numerous studies that indicate the use of constraints “can significantly improve the predictive ability of full-profile conjoint utilities.” Johnson (2000a) elaborates, “Constraints usually provide improvements in hit rates and sometimes provide improvements in share predictions.” Additionally, the absence of monotonicity constraints may undermine the confidence of the end-user of the study results if unconstrained partworths have incorrect slopes.
However, if the researcher is primarily concerned with share predictions, is relatively unconcerned with predicting individual choices and is also unconcerned with end-user confidence, he/she may want to consider not including monotonicity constraints because they do not always improve share predictions and may, in some cases, harm share predictions.
All the data reported here, except for that from the Games study, were based on estimations using constraints, both in the HB runs and the Latent Class segmentations.
As a final point, the reflective reader might ask what’s the difference between reducing the number of tasks to one to estimate a disaggregate model and simply running an aggregate model. The answer is a disaggregate model estimated with an abbreviated task set may still outperform the aggregate model. Simply put, even with a small number of choice tasks, we may still be capturing heterogeneity with the disaggregate model that we may not be capturing with the aggregate model.
| Aggregate Model | Disaggregate Model | ||||||
| Study | n | MAE | Tuning Scalar | Tasks | MAE | Tuning Scalar | Tasks |
| Books | 1,794 | 1.68 | 1.2 | 4 | 1.14 | 0.54 | 4 |
| Books | 2,298 | 0.87 | 0.94 | 12 | 0.52 | 0.38 | 4 |
This paper has examined the practicality of conducting commercial choice-based conjoint studies with extremely few tasks per respondent by manipulating sample size and number of tasks. The robustness of HB is evident in the quality of the results examined. In all of the data sets presented here, relatively accurate individual-level choice models were created using no more than four tasks per respondent, given sample size of 1,000 or more.
There are other factors, in addition to sample size and number of tasks, that may affect model error:
Which, if any, of these factors significantly affect model performance?
Nothing is known about the impact of the underlying distributions on the technique outlined here, either. Further research needs to be done on more commercial data sets as well as synthetic data sets to better understand the impact of these issues.
Results of this study may offer additional hypotheses concerning two findings recently published:
In both cases, these findings may be the result of the number of choice tasks used. HB may converge more quickly when there is an abundance of individual-level data. It appears clear that the reverse is also true, namely, that when fewer tasks are used, a larger number of iterations is required to reach convergence.
Similarly, Latent Class segmentation may not offer much assistance in those cases where the individual-level model is information rich. That is, where the upper level HB model does not contribute much to the lower level model.
Conversely, increasing sample homogeneity by segmentation prior to running HB does appear to potentially improve model performance, when very few tasks per respondent are employed and an adequate segmentation can be created. However, in practice, developing an optimal segmentation scheme given very few choice tasks per respondent appears somewhat problematic. If a large number of choice tasks per respondent are used, LC segmentation does not appear to be helpful. If very few tasks are used, there is little data on which to base the segmentation. Perhaps in the future other methods may be developed for constructing homogeneous segments that don’t rely exclusively on choice tasks. If so, the potential for the LC/HB approach may be substantially enhanced.
A related question is how well Latent Class alone estimates individual-level utilities, relative to HB, given an abbreviated task set and large sample size. Andrews, et al. (2001) demonstrated that Latent Class may out-perform HB under certain conditions.
The Beverages data set performed particularly well. The Beverages study differed from the other studies in several ways:
Which, if any, of these factors significantly contributed to the unusually strong performance of the Beverages model?
Several biases thought to be inherent in conjoint studies, namely number of level effect, order bias, learning bias, framing bias and respondent fatigue, may all be affected by an abbreviated task set. Further study needs to be undertaken to determine whether or not and if so, to what degree, any of these biases might be affected by the use of abbreviated task sets.
It appears that surprisingly good individual-level choice models can be constructed with as few as one to four choice tasks per respondent when using HB.
This approach requires a careful effort, involving these factors:
Computer run times can be significantly and adversely affected by the increase in sample size and burned iterations.
There are numerous practical situations where sample size is more easily attainable than a large number of choice tasks per respondent. In those situations, the reduced task set approach may prove valuable and useful.
Allenby, G. M., Arora, N., and Ginter, J. L. (1995) “Incorporating Prior Knowledge into the Analysis of Conjoint Studies,” Journal of Marketing Research (May), American Marketing Association, Chicago, IL.
Andrews, Rick L., Ansari, Asim, and Currim, Imran S. (2002) “Hierarchical Bayes Versus Finite Mixture Conjoint Analysis Models: A Comparison of Fit, Prediction, and Partworth Recovery,” Journal of Marketing Research (February), American Marketing Association, Chicago, IL.
Arora, Neeraj, and Huber, Joel (2001) “Improving Parameter Estimates and Model Prediction by Aggregate Customization in Choice Experiments,” Journal of Consumer Research (September).
Huber, J., Arora, N., and Johnson, R. (1998) “Capturing Heterogeneity in Consumer Choices,” 1998 Advanced Research Techniques Forum Proceedings, American Marketing Association, Chicago, IL.
Johnson, Richard M. (2000a) “Monotonicity Constraints in Choice-Based Conjoint with Hierarchical Bayes,” Sawtooth Software, Inc., Sequim, WA.
Johnson, Richard M. (2000b) “Understanding HB: An Intuitive Approach,” 2000 Sawtooth Software Conference Proceedings, Sawtooth Software, Inc., Sequim, WA.
Johnson, Richard M. (1999) “The Joys and Sorrows of Implementing HB Methods for Conjoint Analysis,” Sawtooth Software, Inc., Sequim, WA.
Johnson, Richard M. and Bryan K. Orme (1996) “How Many Questions Should You Ask In Choice-Based Conjoint Studies?” 1996 Advanced Research Techniques Forum Proceedings, American Marketing Association, Chicago, IL.
McCullough, Dick (2003) “Getting the Most Bang from the Fewest Questions: A New Approach to Designing and Analyzing Conjoint Studies,” 003 Professional Marketing Research Society Conference Proceedings, Vancouver, B.C.
McCullough, Dick (2002) “A User’s Guide To Conjoint Analysis”, Marketing Research (Summer), American Marketing Association, Chicago, IL.
McCullough, Dick (2001) “Trade-off Study Sample Size: How Low Can We Go?,” 2001 Sawtooth Software Conference Proceedings, Sawtooth Software, Inc., Sequim, WA.
Orme, Bryan (1998) “Sample Size Issues for Conjoint Analysis Studies,” Sawtooth Software, Inc., Sequim, WA.
Orme, Bryan and Richard M. Johnson (1997) “Using Utility Constraints to Improve the Predictability of Conjoint Analysis,” Sawtooth Software, Inc., Sequim, WA.
Sawtooth Software (1993) “The CBC User Manual,” Sawtooth Software, Inc., Sequim, WA.
Sawtooth Software (1998) “The CBC/HB Module for Hierarchical Bayes Estimation,” Technical Paper accessible from sawtoothsoftware.com web site.
Sentis, Keith and Lihua Li (2000) “HB Plugging and Chugging: How Much Is Enough?” 2000 Sawtooth Software Conference Proceedings, Sawtooth Software, Sequim, WA.
Sentis, Keith and Lihua Li (2001) “One Size Fits All or Custom Tailored: Which HB Fits Better?” 2001 Sawtooth Software Conference Proceedings, Sawtooth Software, Sequim, WA.
MACRO CONSULTING
Intelligent Decisions