Sampling Methods In Research | Probability Vs Non-Probability

Quick answer. Sampling methods fall into two families. Probability sampling (simple random, stratified, cluster, systematic) gives every member of the population a known chance of selection — required for statistical generalisation. Non-probability sampling (purposive, snowball, quota, convenience) selects participants based on judgement or accessibility — standard in qualitative research and exploratory studies. Choose based on your research question: questions about population proportions or causal effects need probability sampling; questions about meaning, experience, or process work fine with purposive non-probability sampling.

Sampling is the bridge between your population of interest and the participants you can actually study. The choice of sampling method shapes what you can claim from your data, what statistical tests are valid, and how examiners will judge your methodology chapter. This guide walks through both families, names every common method, and gives you a decision tree for matching method to research question.

Population, Sampling Frame, Sample

Three terms you need to use precisely:

Population — everyone or everything your research question is about (“first-year undergraduates at Canadian universities”).
Sampling frame — the operational list from which you actually draw participants (“the student registries at five GTA universities”).
Sample — the people who consent and participate (“120 students”).

The gap between population and sampling frame is your coverage error. The gap between sampling frame and sample is your nonresponse error. Both matter; both need to be reported.

Probability Sampling Methods

Simple random sampling

Every member of the sampling frame has equal probability of selection. Implementation: list everyone, assign a random number, take the first N. Gold standard for unbiased estimation but rarely practical because you need a complete frame. Best for: small populations with an accessible list.

Stratified random sampling

Divide the population into mutually-exclusive strata (e.g., year of study, gender, institution). Randomly sample within each stratum. Strata can be sampled proportionally or oversampled for small groups. Gives more precise estimates per sub-group than simple random. Best for: studies where you need to compare across known sub-populations.

Systematic sampling

Pick every k-th member from an ordered list. If your list has 10,000 and you want 100, take every 100th (k = 100). Random starting point. Cheaper than simple random but vulnerable if there’s a periodicity in the list.

Cluster sampling

Randomly select clusters (whole groups), then study everyone or a sub-sample within selected clusters. Standard when individual sampling is impossible (e.g., randomly select 10 classrooms, then survey every student in those classrooms). Sacrifices some precision for cost. Best for: nationally-representative studies, school-based research, geographically dispersed populations.

Multistage sampling

Combine clustering with stratification across multiple stages (e.g., stage 1 random sample of provinces, stage 2 random sample of schools, stage 3 random sample of students). Standard in large national surveys (Statistics Canada PISA, Canadian Community Health Survey).

Non-Probability Sampling Methods

Purposive (judgemental) sampling

Researcher selects participants based on specific characteristics relevant to the research question. Variants: maximum-variation (deliberate diversity), homogeneous (deliberate similarity), critical-case (extreme or revelatory cases), typical-case. Standard in qualitative research.

Snowball sampling

Start with a few participants meeting your criteria; ask them to refer others. Useful for hidden populations (sex workers, undocumented migrants, members of stigmatised communities). Biased toward connected individuals; mitigated by respondent-driven sampling (RDS) which adjusts for network structure.

Quota sampling

Set quotas for sub-groups (e.g., 50 women, 50 men, 25 each from four provinces) then recruit until quotas are filled. Faster than stratified random but doesn’t allow probability-based inference because selection within quotas is convenience-based.

Convenience sampling

Recruit whoever is easily accessible (your university’s subject pool, social-media solicitations, MTurk, Prolific). Cheapest method. Generalisation is severely limited — results apply only to your sample, not to a broader population. Acceptable for pilot studies, exploratory research, and methodological development. Examiners flag overuse.

Hire A Professional Academic Writer

PhD-qualified Canadian writer
AI-free, plagiarism-free guaranteed
On-time delivery, every time
Unlimited free revisions 14 days
24/7 chat support

Order Now Learn More

ResearchProspect Canada academic writers

Probability vs Non-Probability: Decision Matrix

Question type	Recommended family	Specific method
What proportion of Canadian students use ChatGPT for essays?	Probability	Stratified or multistage
Does X intervention reduce anxiety scores?	Probability (RCT context)	Random assignment from accessible pool
How do trans students experience clinical placements?	Non-probability	Purposive + snowball
What themes characterise PhD-student burnout narratives?	Non-probability	Purposive (maximum variation)
How does adoption of AI tools spread through a Slack community?	Non-probability	Network sample / RDS
National prevalence of food insecurity among grad students	Probability	Multistage with stratification

Sample Size: How Many Is Enough?

For probability samples answering quantitative questions, calculate sample size via power analysis (see our hypothesis-testing guide). Typical Master’s thesis: N = 100–300. PhD: N = 300–1,000+ depending on effect size and design.

For non-probability qualitative samples, the criterion is saturation — you stop recruiting when new interviews produce no new themes. Empirical work suggests saturation typically occurs at:

12–15 interviews for homogeneous populations exploring a focused question.
20–30 interviews for diverse populations or broad questions.
6–8 focus groups (4–8 participants each).

Smaller numbers are acceptable for phenomenological studies (5–8 deep interviews) and case studies (1–6 cases).

Common Sampling Errors

Mismatched method — using convenience sampling and claiming national generalisation. Examiners will catch this.
Undefined frame — saying “a random sample of Canadian students” without naming the registry, list, or recruitment channel.
Self-selection bias unacknowledged — web-survey respondents differ from non-respondents in systematic ways. Report response rates and discuss potential bias.
Survivor bias — sampling only those who completed a programme misses dropouts who are often the more informative group.
WEIRD samples — Western, Educated, Industrialised, Rich, Democratic. Acknowledge if your sample is WEIRD; do not claim universal results.

Reporting Sampling in Your Dissertation

In the methodology chapter, dedicate one section to sampling. State:

Target population.
Sampling frame (specific list, registry, or platform).
Sampling method with full name and any sub-variant.
Inclusion and exclusion criteria.
Recruitment procedure (channels, incentives, timing).
Final sample size with justification (power analysis or saturation).
Response rate (probability) or saturation criterion (qualitative).
Demographic characteristics of the sample.
Discussion of bias and generalisation limits.

Frequently Asked Questions

Can I mix probability and non-probability sampling?

Yes, in mixed-methods designs. Quantitative arm uses probability; qualitative arm uses purposive. Document each separately; do not pool the samples for inference.

Is MTurk a probability or non-probability sample?

Non-probability. MTurk and Prolific are convenience pools that skew younger, more educated, and more US-based than the general population. Use them for pilots and method development, not population-level claims.

How do I report saturation in a qualitative thesis?

Track new codes added after each interview. When the last three interviews produce no new codes, you have reached thematic saturation. Report this explicitly: “Saturation was achieved after interview 18; interviews 19 and 20 confirmed no additional themes emerged.”

Does ethics approval differ by sampling method?

Yes for some. Snowball sampling raises confidentiality concerns (referring participants implies the referrer’s membership in the criterion group). Vulnerable populations (Indigenous communities, prisoners, minors) require additional ethics protocols. See TCPS 2 Chapter 9 for Indigenous research and Chapter 12 for human biospecimen sampling.

What sample size do I need for a survey?

For 95% confidence, ±5% margin of error, and an unknown population proportion (p = 0.50), n = 384. Adjust for finite populations using a correction factor. Stratification requires this minimum per stratum.

How do I report sample size in the abstract?

Always include the total N and, for grouped designs, the per-group N (e.g., “120 students, 60 per condition”). See our abstract-writing guide for full conventions.

About Jason White

Qualitative Research Specialist

Jason White writes on qualitative methods — interviewing, coding, grounded theory — and supports postgraduate students with NVivo, ATLAS.ti and Dedoose workflows.

Qualitative researchNVivoGrounded theory

Sampling Methods in Research: Probability vs Non-Probability