20 Analytical Tools Every Market Researcher Should Know

Light

post-banner
Space

 

 

In the age of data science, data is no longer scarce – despite it often being a challenge to wrangle. Even with warehouses and lakes of it, data doesn’t drive results unless it’s used correctly to interpret its implications. To fully activate your data, you need the right statistical data analysis tools.
Here’s a look at 20 analytical tools that all researchers should have within reach, ranging from the basic foundational pieces (“that rings a bell from seventh grade math”) to some more sophisticated tools (“graduate-level stat geek”). While you can’t necessarily hold these tools in your hand, they’re still critical instruments that have a distinct purpose and application.

 

 

Master brand tracking and CX with these words

  1. Mean
    A simple mathematical average. It’s a quick and easy way to determine the central tendency of your data set. Use it to get a benchmark of “typical” behaviors in your population.
  2. Median
    The value where half the data lies above and half the data falls below. Like the mean, it can give you an idea of central tendency, except a median won’t be distorted by outliers. It can help you identify typical benchmarks, without as much influence from extreme behaviors.
  3. Frequency Distribution
    A table showing all the responses to a survey question and how often those responses are given. It’s a convenient way to identify outlier responses you may want to clean out or cap.
  4. Histogram
    A bar chart that tallies how many responses take on each value (the frequency distribution in summary chart form). When the number of distinct response values is large, it’s a great way to visualize “normal” behaviors and tendencies in your data.
  1. Variance
    A measure of how dispersed the data is around the mean. It’s useful in statistical significance testing, which can help you tell whether results from two groups or waves of data are genuinely different or just appear different due to sampling error.
  1. Interquartile Range
    Like when Olympics judges throw out the highest and lowest scores, this identifies where the middle half of the data falls by excluding the top 25% and bottom 25% of the statistical spread. It tells you something similar to the variance, but without getting bogged down in complex statistical concepts.
  1. Crosstab
    A table that crosses responses to two separate questions. It shows you how the two responses relate to each other. Use it to compare sub-groups or to discover overlaps between target demographics or behaviors.
  1. Chi-Square
    A statistic applied to a crosstab that tests whether the distribution of responses to one question varies depending upon responses to another. Use it to determine whether the results from one subgroup or wave of data truly follow a different pattern from another subgroup’s results or simply appear to because of random variation.
  1. t-test
    A value that determines whether your sample results (means or proportions) from two sub-groups represent a true difference between the populations from which they were drawn. The larger the “t-score,” the greater the likelihood that the groups are truly different.
  1. Correlation
    A statistical analysis that indicates the strength of relationship between measures (how consistently does one move together with the other), or whether a (linear) relationship exists at all. Coupled with knowledge about the category or situation, it can be used to make assumptions about causality.
  1. Regression
    a.k.a. bivariate ordinary least squares (OLS)
    – A descriptive technique quantifying how much a dependent variable tends to change given an observed change in a single independent variable. Use it to get a relative sense of potential impact a marketing or product decision may have on customer engagement.
  1. Multiple Regression
    A predictive model that quantifies the effect on one dependent variable based on changes to two or more independent variables. A multiple regression analysis is useful to isolate the impact uniquely attributed to a given variable in the context of others. It can also be helpful to measure how multiple decisions can impact customer engagement when executed simultaneously – though more advanced techniques are required when independent variables are badly inter-correlated, as is frequently the case with survey data.
  1. Supervised Bayesian Network
    A predictive model that quantifies the effect on the dependent variable while taking into account that the independent variables impact each other. It’s helpful to understand how a single marketing or product decision can set off a chain reaction of outcomes for your business.
  1. Factor Analysis – Sometimes you can be swimming in too much data. Factor Analysis is a data reduction process where you can boil down your data to a smaller number of underlying dimensions. Use it to synthesize your results and simplify the story.
  1. Bayesian Network Community
    A group of closely inter-related perceptions in a Bayesian Network. It helps you understand the path by which changes in one perception can ripple through other perceptions to have a cumulative impact on your business, and it can inform which combination of levers you should try to pull.
  1. Perceptual Map
    A visual display of how a set of brands is perceived in the market. It can be used to more clearly identify a brand’s strengths and weaknesses relative to competitors, as well as “white space” in the market.
  1. Cluster Analysis
    The process of grouping respondents into “clusters” based on similar characteristics, behaviors, motivations or other criteria. It’s a great way to arrange potential customers into distinct market groups and select your target(s).
  1. Conjoint Analysis
    A technique used to determine the importance of product features and preferred levels of those attributes. It’s very useful for understanding how shoppers choose products and what trade-offs they’re willing to make, which can inform product and pricing decisions.
  1. MaxDiff
    A survey question format used to understand preference or importance scores for multiple items without overwhelming the respondent with a long list of features. This question format asks a few questions about “most” and “least” preferred or important items within systematically chosen subsets of your list. Use it when you need to prioritize features, benefits or claims and want to avoid “everything’s important” or cross-cultural scale biases that can distort the results.
  1. TURF (Total Unduplicated Reach & Frequency)
    An analysis that helps you identify the most efficient ways to reach a maximum audience with minimal redundancy. Use it when you’re choosing a set of flavors/varieties to put on-shelf or picking a set of claims to make.

 

Want to learn more about how Material’s industry-leading market research services can help you fully activate your data? Let’s start the conversation.