Understanding Power Analysis

What is Statistical Power?

Statistical power is the probability of correctly detecting an effect when it truly exists. In choice experiments, power depends on:

Effect size: Larger effects are easier to detect
Sample size: More respondents provide more precision
Design efficiency: Better designs extract more information per respondent
Model complexity: More parameters require larger samples

Why Conduct Power Analysis?

Sample size planning: Determine minimum respondents needed
Budget planning: Estimate data collection costs
Design comparison: Choose between alternative experimental designs
Feasibility assessment: Check if research questions are answerable with available resources

Power vs. Precision

Power analysis in cbc_power() focuses on precision (standard errors) rather than traditional hypothesis testing power, because:

Provides more actionable information for sample size planning
Relevant for both significant and non-significant results
Easier to interpret across different effect sizes
More directly tied to practical research needs

Basic Power Analysis

Start with a basic power analysis using auto-detection of parameters:

# Basic power analysis with auto-detected parameters
power_basic <- cbc_power(
  data = choices,
  outcome = "choice",
  obsID = "obsID",
  n_q = 6,
  n_breaks = 10
)

# View the power analysis object
power_basic
#> CBC Power Analysis Results
#> ==========================
#> 
#> Sample sizes tested: 60 to 600 (10 breaks)
#> Significance level: 0.050
#> Parameters: price, typeGala, typeHoneycrisp, freshnessAverage, freshnessExcellent
#> 
#> Power summary (probability of detecting true effect):
#> 
#> n = 60:
#>   price       : Power = 0.376, SE = 0.1163
#>   typeGala    : Power = 0.882, SE = 0.2116
#>   typeHoneycrisp: Power = 0.999, SE = 0.2144
#>   freshnessAverage: Power = 0.797, SE = 0.2052
#>   freshnessExcellent: Power = 1.000, SE = 0.2244
#> 
#> n = 180:
#>   price       : Power = 0.983, SE = 0.0667
#>   typeGala    : Power = 0.993, SE = 0.1137
#>   typeHoneycrisp: Power = 1.000, SE = 0.1194
#>   freshnessAverage: Power = 1.000, SE = 0.1147
#>   freshnessExcellent: Power = 1.000, SE = 0.1205
#> 
#> n = 360:
#>   price       : Power = 1.000, SE = 0.0463
#>   typeGala    : Power = 1.000, SE = 0.0806
#>   typeHoneycrisp: Power = 1.000, SE = 0.0841
#>   freshnessAverage: Power = 1.000, SE = 0.0831
#>   freshnessExcellent: Power = 1.000, SE = 0.0872
#> 
#> n = 480:
#>   price       : Power = 1.000, SE = 0.0402
#>   typeGala    : Power = 1.000, SE = 0.0694
#>   typeHoneycrisp: Power = 1.000, SE = 0.0728
#>   freshnessAverage: Power = 1.000, SE = 0.0714
#>   freshnessExcellent: Power = 1.000, SE = 0.0741
#> 
#> n = 600:
#>   price       : Power = 1.000, SE = 0.0362
#>   typeGala    : Power = 1.000, SE = 0.0620
#>   typeHoneycrisp: Power = 1.000, SE = 0.0655
#>   freshnessAverage: Power = 1.000, SE = 0.0638
#>   freshnessExcellent: Power = 1.000, SE = 0.0668
#> 
#> Use plot() to visualize power curves.
#> Use summary() for detailed power analysis.

# Access the detailed results data frame
head(power_basic$power_summary)
#>   sample_size          parameter   estimate  std_error t_statistic     power
#> 1          60              price -0.1910977 0.11625005    1.643850 0.3761148
#> 2          60           typeGala  0.6653365 0.21160734    3.144203 0.8818410
#> 3          60     typeHoneycrisp  1.0970572 0.21442926    5.116173 0.9992008
#> 4          60   freshnessAverage  0.5729977 0.20520714    2.792289 0.7973884
#> 5          60 freshnessExcellent  1.4156784 0.22440157    6.308683 0.9999932
#> 6         120              price -0.2159881 0.08224018    2.626309 0.7474069
tail(power_basic$power_summary)
#>    sample_size          parameter   estimate  std_error t_statistic power
#> 45         540 freshnessExcellent  1.1001792 0.07013013   15.687682     1
#> 46         600              price -0.2827316 0.03621695    7.806610     1
#> 47         600           typeGala  0.5415617 0.06195998    8.740508     1
#> 48         600     typeHoneycrisp  1.0187001 0.06551582   15.548918     1
#> 49         600   freshnessAverage  0.6439791 0.06383966   10.087445     1
#> 50         600 freshnessExcellent  1.1218004 0.06676086   16.803264     1

Parameter Specification Options

Auto-Detection (Recommended)

By default, cbc_power() automatically detects all attribute parameters from your choice data:

# Auto-detection works with dummy-coded data
power_auto <- cbc_power(
  data = choices,
  outcome = "choice",
  obsID = "obsID",
  n_q = 6,
  n_breaks = 8
)

# Shows all parameters: price, typeGala, typeHoneycrisp, freshnessAverage, freshnessExcellent

Specify Dummy-Coded Parameters

You can explicitly specify which dummy-coded parameters to include:

# Focus on specific dummy-coded parameters
power_specific <- cbc_power(
  data = choices,
  pars = c(
    # Specific dummy variables
    "price",
    "typeHoneycrisp",
    "freshnessExcellent"
  ),
  outcome = "choice",
  obsID = "obsID",
  n_q = 6,
  n_breaks = 8
)

Use Decoded Data with Attribute Names

For easier interpretation, decode the choice data first to use original attribute names:

# Decode choice data to get back categorical variables
choices_decoded <- cbc_decode(choices)

# Now you can use attribute names instead of dummy variables
power_decoded <- cbc_power(
  data = choices_decoded,
  pars = c("price", "type", "freshness"), # Original attribute names
  outcome = "choice",
  obsID = "obsID",
  n_q = 6,
  n_breaks = 8
)

# Note: This approach estimates effects differently -
# it treats categorical variables as factors rather than separate dummy variables

When to Use Each Approach

Auto-detection: Best for comprehensive power analysis of all effects
Dummy-coded specification: When you want to focus on specific levels of categorical variables
Decoded data: When you want power analysis at the attribute level rather than level-specific effects, or for easier interpretation

Understanding Power Results

The power analysis returns a list object with several components:

power_summary: Data frame with sample sizes, coefficients, estimates, standard errors, t-statistics, and power
sample_sizes: Vector of sample sizes tested
n_breaks: Number of breaks used
alpha: Significance level used
choice_info: Information about the underlying choice simulation

The power_summary data frame contains:

sample_size: Number of respondents in each analysis
parameter: Parameter name being estimated
estimate: Coefficient estimate
std_error: Standard error of the estimate
t_statistic: t-statistic (estimate/std_error)
power: Statistical power (probability of detecting effect)

Visualizing Power Curves

Plot power curves to visualize the relationship between sample size and precision:


# Plot power curves
plot(
  power_basic,
  type = "power",
  power_threshold = 0.9
)

Power analysis chart showing statistical power vs sample size for 5 parameters. A red dashed line marks 90% power threshold. Most parameters achieve adequate power by 100 respondents, though freshnessAverage and typeGala require larger sample sizes than price and other freshness/type parameters.


# Plot standard error curves
plot(
  power_basic,
  type = "se"
)

Standard error chart showing decreasing standard errors as sample size increases from 100 to 600 respondents for 5 parameters. All parameters show the expected decline in standard error with larger samples, with price having consistently lower standard errors than the freshness and type parameters.

Interpreting Results

# Sample size requirements for 90% power
summary(
  power_basic,
  power_threshold = 0.9
)
#> CBC Power Analysis Summary
#> ===========================
#> 
#> Sample size requirements for 90% power:
#> 
#> price          : n >= 180 (achieves 98.3% power, SE = 0.0667)
#> typeGala       : n >= 120 (achieves 97.1% power, SE = 0.1394)
#> typeHoneycrisp : n >= 60 (achieves 99.9% power, SE = 0.2144)
#> freshnessAverage: n >= 120 (achieves 98.9% power, SE = 0.1426)
#> freshnessExcellent: n >= 60 (achieves 100.0% power, SE = 0.2244)

From these results, you can determine:

Which parameters need the largest samples
Whether your planned sample size is adequate
How much precision improves with additional respondents

Mixed Logit Models

Conduct power analysis for random parameter models:

# Create choices with random parameters
priors_random <- cbc_priors(
  profiles = profiles,
  price = rand_spec(
    dist = "n",
    mean = -0.25,
    sd = 0.1
  ),
  type = rand_spec(
    dist = "n",
    mean = c(0.5, 1.0),
    sd = c(0.5, 0.5)
  ),
  freshness = c(0.6, 1.2)
)

choices_mixed <- cbc_choices(
  design,
  priors = priors_random
)

# Power analysis for mixed logit model
power_mixed <- cbc_power(
  data = cbc_decode(choices_mixed),
  pars = c("price", "type", "freshness"),
  randPars = c(price = "n", type = "n"), # Specify random parameters
  outcome = "choice",
  obsID = "obsID",
  panelID = "respID", # Required for panel data
  n_q = 6,
  n_breaks = 10
)

# Mixed logit models generally require larger samples
power_mixed
#> CBC Power Analysis Results
#> ==========================
#> 
#> Sample sizes tested: 60 to 600 (10 breaks)
#> Significance level: 0.050
#> Parameters: price, typeGala, typeHoneycrisp, freshnessAverage, freshnessExcellent, sd_price, sd_typeGala, sd_typeHoneycrisp
#> 
#> Power summary (probability of detecting true effect):
#> 
#> n = 60:
#>   price       : Power = 0.806, SE = 0.1315
#>   typeGala    : Power = 0.907, SE = 0.2277
#>   typeHoneycrisp: Power = 0.999, SE = 0.2483
#>   freshnessAverage: Power = 0.773, SE = 0.2155
#>   freshnessExcellent: Power = 1.000, SE = 0.2434
#>   sd_price    : Power = 0.131, SE = 0.3657
#>   sd_typeGala : Power = 0.137, SE = 0.5345
#>   sd_typeHoneycrisp: Power = 0.050, SE = 0.8205
#> 
#> n = 180:
#>   price       : Power = 0.976, SE = 0.0726
#>   typeGala    : Power = 1.000, SE = 0.1190
#>   typeHoneycrisp: Power = 1.000, SE = 0.1309
#>   freshnessAverage: Power = 0.917, SE = 0.1150
#>   freshnessExcellent: Power = 1.000, SE = 0.1255
#>   sd_price    : Power = 0.050, SE = 0.3522
#>   sd_typeGala : Power = 0.050, SE = 0.3891
#>   sd_typeHoneycrisp: Power = 0.051, SE = 0.9933
#> 
#> n = 360:
#>   price       : Power = 1.000, SE = 0.0479
#>   typeGala    : Power = 1.000, SE = 0.0837
#>   typeHoneycrisp: Power = 1.000, SE = 0.0902
#>   freshnessAverage: Power = 1.000, SE = 0.0820
#>   freshnessExcellent: Power = 1.000, SE = 0.0871
#>   sd_price    : Power = 0.050, SE = 0.1573
#>   sd_typeGala : Power = 0.050, SE = 0.4394
#>   sd_typeHoneycrisp: Power = 0.050, SE = 0.4378
#> 
#> n = 480:
#>   price       : Power = 1.000, SE = 0.0417
#>   typeGala    : Power = 1.000, SE = 0.0716
#>   typeHoneycrisp: Power = 1.000, SE = 0.0754
#>   freshnessAverage: Power = 1.000, SE = 0.0709
#>   freshnessExcellent: Power = 1.000, SE = 0.0744
#>   sd_price    : Power = 0.050, SE = 0.1469
#>   sd_typeGala : Power = 0.050, SE = 0.3126
#>   sd_typeHoneycrisp: Power = 0.050, SE = 0.2305
#> 
#> n = 600:
#>   price       : Power = 1.000, SE = 0.0384
#>   typeGala    : Power = 1.000, SE = 0.0691
#>   typeHoneycrisp: Power = 1.000, SE = 0.0672
#>   freshnessAverage: Power = 1.000, SE = 0.0631
#>   freshnessExcellent: Power = 1.000, SE = 0.0667
#>   sd_price    : Power = 0.051, SE = 0.2043
#>   sd_typeGala : Power = 0.050, SE = 0.5980
#>   sd_typeHoneycrisp: Power = 0.050, SE = 0.1851
#> 
#> Use plot() to visualize power curves.
#> Use summary() for detailed power analysis.

Comparing Design Performance

Design Method Comparison

Compare power across different design methods:

# Create designs with different methods
design_random <- cbc_design(
  profiles,
  n_alts = 2,
  n_q = 6,
  n_resp = 200,
  method = "random"
)
design_shortcut <- cbc_design(
  profiles,
  n_alts = 2,
  n_q = 6,
  n_resp = 200,
  method = "shortcut"
)
design_optimal <- cbc_design(
  profiles,
  n_alts = 2,
  n_q = 6,
  n_resp = 200,
  priors = priors,
  method = "stochastic"
)

# Simulate choices with same priors for fair comparison
choices_random <- cbc_choices(
  design_random,
  priors = priors
)
choices_shortcut <- cbc_choices(
  design_shortcut,
  priors = priors
)
choices_optimal <- cbc_choices(
  design_optimal,
  priors = priors
)

# Conduct power analysis for each
power_random <- cbc_power(
  choices_random,
  n_breaks = 8
)
power_shortcut <- cbc_power(
  choices_shortcut,
  n_breaks = 8
)
power_optimal <- cbc_power(
  choices_optimal,
  n_breaks = 8
)


# Compare power curves
plot_compare_power(
  Random = power_random,
  Shortcut = power_shortcut,
  Optimal = power_optimal,
  type = "power"
)

Advanced Analysis

Returning Full Models

Access complete model objects for detailed analysis:

# Return full models for additional analysis
power_with_models <- cbc_power(
  data = choices,
  outcome = "choice",
  obsID = "obsID",
  n_q = 6,
  n_breaks = 5,
  return_models = TRUE
)

# Examine largest model
largest_model <- power_with_models$models[[length(power_with_models$models)]]
summary(largest_model)
#> =================================================
#> 
#> Model estimated on: Mon Jul 14 10:45:47 2025 
#> 
#> Using logitr version: 1.1.2 
#> 
#> Call:
#> logitr::logitr(data = data_subset, outcome = outcome, obsID = obsID, 
#>     pars = pars, randPars = randPars, panelID = panelID)
#> 
#> Frequencies of alternatives:
#>       1       2 
#> 0.48722 0.51278 
#> 
#> Exit Status: 3, Optimization stopped because ftol_rel or ftol_abs was reached.
#>                                 
#> Model Type:    Multinomial Logit
#> Model Space:          Preference
#> Model Run:                1 of 1
#> Iterations:                   12
#> Elapsed Time:        0h:0m:0.01s
#> Algorithm:        NLOPT_LD_LBFGS
#> Weights Used?:             FALSE
#> Panel ID:                 respID
#> Robust?                    FALSE
#> 
#> Model Coefficients: 
#>                     Estimate Std. Error z-value  Pr(>|z|)    
#> price              -0.282732   0.036217 -7.8066 5.773e-15 ***
#> typeGala            0.541562   0.061960  8.7405 < 2.2e-16 ***
#> typeHoneycrisp      1.018700   0.065516 15.5489 < 2.2e-16 ***
#> freshnessAverage    0.643979   0.063840 10.0874 < 2.2e-16 ***
#> freshnessExcellent  1.121800   0.066761 16.8033 < 2.2e-16 ***
#> ---
#> Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
#>                                      
#> Log-Likelihood:         -2193.8145652
#> Null Log-Likelihood:    -2495.3298500
#> AIC:                     4397.6291305
#> BIC:                     4428.5726000
#> McFadden R2:                0.1208318
#> Adj McFadden R2:            0.1188281
#> Number of Observations:  3600.0000000

Best Practices

Power Analysis Workflow

Start with literature: Base effect size assumptions on previous studies
Use realistic priors: Conservative estimates are often better than optimistic ones
Test multiple scenarios: Conservative, moderate, and optimistic effect sizes
Compare designs: Test different design methods and features
Plan for attrition: Add 10-20% to account for incomplete responses
Document assumptions: Record all assumptions for future reference