Survey research is one of the most common quantitative methods in social sciences, business, education, and healthcare. Surveys measure attitudes, beliefs, behaviors, experiences through questionnaires. Survey data analysis presents unique challenges distinct from experimental data: surveys often include Likert scales (ordinal data), categorical variables, and large numbers of items that need to be combined into scales. Analyzing survey data requires understanding how to treat Likert responses (as ordinal or interval?), compute scale scores (sum or average?), test scale reliability (internal consistency), and analyze the resulting data appropriately. Many students collect survey data but struggle with analysis—how to create composite scores, whether Cronbach's alpha is adequate, how to treat missing responses, what tests are appropriate for ordinal data. Survey data analysis help covers the full workflow: data cleaning and preparation, reliability testing, descriptive analysis, and inferential statistics—all with survey-specific considerations. This guide covers common survey analysis approaches, how to handle Likert scales, reliability testing, and how to analyze survey findings rigorously.
Survey data preparation
Likert scale coding and composites
- Likert items: Ordinal data (ordered but not equal intervals). 5-point scale: 1=Strongly Disagree to 5=Strongly Agree
- Composite scores: Sum or average items measuring the same construct. E.g., job satisfaction scale = average of 10 job satisfaction items
- Reverse-scoring: Some items worded negatively ("This organization does not value diversity"). Reverse-code these (1→5, 2→4, etc.) before summing
- Missing data handling: If respondent skips one item in a 10-item scale, options: (a) exclude that respondent, (b) use average of answered items, (c) impute mean value. Decide before analysis
- Scale normality: Treat Likert composite scores as approximately continuous (interval) if scale has many items (10+). Smaller scales may need non-parametric tests
Data cleaning checklist
- Check for missing data patterns (who didn't respond to certain questions?)
- Identify outliers or unusual response patterns (respondent selected same answer for all 50 items)
- Verify reverse-coded items are reverse-scored before computing composites
- Create composite/scale variables (don't analyze individual items separately if they form a scale)
- Check that all responses are within valid range (no "6" on 5-point scale)
- Verify skip logic was followed (if respondent selected "No" they shouldn't have answered follow-up questions)
Reliability testing (internal consistency)
Cronbach's alpha
- What it measures: Internal consistency of a scale—do all items measuring the construct correlate with each other?
- Range: 0–1. Higher is better. .70 acceptable (some use .60 for exploratory research)
- Interpretation: α = .82 means items are reasonably intercorrelated; items measure the same construct
- Calculation: SPSS: Analyze → Reliability → Alpha. Include all items in scale
- Note: High alpha alone doesn't prove the scale measures what you think it does (construct validity issue). It just shows internal consistency
Item-total correlations
- What they show: Does each item correlate with the overall scale? If an item doesn't correlate, it may not belong in the scale
- Adequate correlation: Item-total r ≥ .30 is typical. If item-total r < .20, consider removing that item
- What SPSS shows: Corrected Item-Total Correlation; Cronbach's Alpha if Item Deleted (what would alpha be if you removed this item)
- Using this information: If removing an item increases alpha substantially, that item doesn't fit the scale—consider removing it
Descriptive analysis
Frequency distributions
- For categorical items: How many respondents selected each response? E.g., "Gender: 60% Female, 40% Male"
- For continuous/composite scales: Mean, SD, range, skewness, kurtosis. Check distribution—is it normal or skewed?
- Presentation: Frequency tables (counts, percentages), histograms (visual distribution), summary statistics tables
Cross-tabulations (crosstabs)
- What they show: Relationship between two categorical variables. E.g., how many males vs. females in each job satisfaction category?
- Presentation: 2×2 table (row percentages or column percentages) showing cell counts and percentages
- Statistical test: Chi-square test (are the variables related?); report χ², df, p-value
Inferential analysis of survey data
Comparing groups on survey scales
- Two groups (t-test): Do males and females differ in job satisfaction? Compare means on the satisfaction scale
- Three+ groups (ANOVA): Do satisfaction levels differ across three age groups? F-test followed by post-hoc comparisons
- Note on Likert: Strictly, single Likert items are ordinal; use Mann-Whitney U or Kruskal-Wallis. But composite scales (average of items) approximate continuous data; t-test/ANOVA acceptable with large samples
Relationships between survey scales
- Correlation: Pearson's r for continuous/composite scales. Spearman's rho for ordinal items
- Regression: Predict one scale from others. E.g., does job satisfaction predict organizational commitment?
Common survey analysis mistakes
- Not reverse-coding negatively-worded items before summing: If you don't reverse-code, high scorers on positive items and low scorers on negative items won't sum correctly
- Analyzing individual items instead of composite scales: If 10 items form a "job satisfaction" scale, analyze the scale (average), not each item separately
- Low Cronbach's alpha but using the scale anyway: If α = .45, the items don't hang together. Either remove problematic items or don't use a composite score
- Treating Likert scales as continuous without justification: Single 5-point items are ordinal. If using parametric tests on single items, justify this choice or use non-parametric alternatives
- Ignoring missing data: Decide on missing data strategy before analysis. Listwise deletion (exclude anyone missing any item) can drastically reduce sample size
- p-hacking (trying multiple tests until one is significant): Run planned analyses based on your hypotheses. Don't run every possible test and report the ones that are significant
Survey analysis checklist
- ☐ Missing data strategy decided and documented
- ☐ Reverse-coding completed before computing scales
- ☐ Composite scales created (sum or average of items)
- ☐ Reliability tested (Cronbach's alpha ≥ .70)
- ☐ Item-total correlations reviewed (≥ .30)
- ☐ Scale normality checked (distribution appropriate for tests used)
- ☐ Descriptive statistics reported (N, M, SD, range)
- ☐ Appropriate statistical tests selected (t-test vs. Mann-Whitney)
- ☐ p-values and effect sizes reported
- ☐ Results presented in tables with clear labels
- ☐ Interpretation connected to research questions
Get survey data analysis help
From Likert scale composites to reliability testing to inferential analysis, survey analysis help ensures your data is properly prepared and analyzed rigorously.
Order survey analysis helpFAQ
Strictly, no—Likert items are ordinal. But composite scales (average of multiple items) approximate continuous data, especially with 10+ items. Single 5-point items should use non-parametric tests (Mann-Whitney U, Kruskal-Wallis, Spearman's rho). Justify your choice in your methods section
Typical threshold is .70. Some fields accept .60 for exploratory research. Above .90 may indicate redundancy (items too similar). Report the alpha you achieved; if it's lower than desired, discuss limitations
General rule: 30+ minimum (for basic analyses). For complex models (regression with multiple predictors), use power analysis: typically 10–20 participants per predictor. For descriptive surveys, larger is better (100+) to represent the population
Options: (1) exclude those respondents (listwise deletion), (2) use the average of responded items in the scale, or (3) impute the mean. Document your choice. For 10-item scales with 1–2 missing items, option 2 (average of responded items) is often reasonable