Skip to main content

Inferential Statistics

Population vs Sample

ConceptDescriptionProsConsExample
PopulationEntire group of interest (all individuals/objects)Complete source of truthUsually too large/infinite to study fullyAll voters in a country
SampleSubset of population chosen for studyPractical, manageable, enables inferenceMay mislead if non-representative1,000 voters surveyed

Probability Sampling

MethodDescriptionProsConsExample
Simple Random Sampling (SRS)Every individual has equal chance of selectionEasy, unbiasedImpractical for large groups; may miss subgroupsRandomly choosing 100 IDs
Stratified SamplingPopulation divided into homogeneous subgroups (strata); random sampling within eachEnsures subgroup representation; more preciseRequires population info; complexSampling students by grade level
Systematic SamplingSelect every k-th element after random startSimple, efficientBias risk if population has patternsEvery 10th store customer
Cluster SamplingDivide into clusters, randomly select clusters, then sample all in themCost-effective; useful for dispersed groupsHigher error if clusters heterogeneousSurvey all students in 10 selected schools

Non-Probability Sampling

MethodDescriptionProsConsExample
Convenience SamplingUse easily accessible participantsFast, cheapStrong bias risk, not representativeSurveying people in one street
Purposive SamplingResearcher selects based on judgment/criteriaGood for niche cases or expertsBias-prone, not generalizableInterviewing only industry experts
Quota SamplingFill quotas for subgroups without randomizationEnsures subgroup presenceStill biased; no random selection50 men, 50 women chosen by interviewer

Sampling Distribution & Estimation

ConceptDescriptionKey Points
Sampling DistributionDistribution of a statistic across all possible samplesCentral Limit Theorem ensures approximate normality for large n. Standard Error measures precision
Point EstimateSingle sample statistic used to estimate population parameterSimple, quick, but no reliability measure
Interval Estimate (Confidence Intervals)Range around point estimate with confidence levelCaptures uncertainty, widely used in decision-making. Not probability for one sample - reflects long-run accuracy