Probability: Basics

Overview

Definition
Use Cases
Examples
Concepts

Probability is a measure of the likelihood or chance that a certain event will occur. It is quantified as a number between 0 and 1. An event with a probability of 1 is considered a certainty, while an event with a probability of 0 is considered impossible. The probability of an event is often expressed as a fraction or decimal, and can also be expressed as a percentage.

Formally, probability can be defined in several ways:

Classical definition (also known as the mathematical definition) defines probability as the ratio of the number of favorable outcomes to the number of possible outcomes. This definition assumes that all outcomes are equally likely
Relative frequency definition of probability defines it as the limit of the frequency of a particular outcome as the number of trials approaches infinity
Axiomatic definition, which is the most general and abstract, defines probability based on a set of axioms related to the properties that probabilities should have

Venn Diagrams

Complement
Subset
Union
Intersection

Complement $\overline{A}$ ( $A'$ ): all elements of S that are not in A

Subset $A ⊂ B$ : all elements of A are also elements of B

Union $A ∪ B$ : all elements of S that are in A or B

Intersection $A ∩ B$ : all elements of S that are in A and B

Sample Spaces, Events, and Probability Axioms

Sample Space
Event
Probability Axioms

Sample space (denoted by S), is the set of all possible outcomes of a random experiment. It encompasses every conceivable outcome that could result from the experiment. For example, when rolling a 6-sided dice, the sample space is S = { 1, 2, 3, 4, 5, 6 }

Determining the sample space involves identifying all possible outcomes of a given random experiment. This process requires careful consideration of the experiment's nature and the potential outcomes it could yield. Various techniques can be employed to determine sample spaces, including enumeration, listing all possible outcomes explicitly, and logical reasoning based on the experiment's conditions and constraints.

For example, when flipping two coins successively, we can determine the sample space by considering all possible combinations of outcomes:

$S = { HH, HT, TH, TT }$

H heads
T tails

Event (denoted by E), is any subset of the sample space. It represents a specific outcome or a collection of outcomes of interest. Events can range from simple, consisting of a single outcome, to compound, containing multiple outcomes. For instance, in the context of rolling a dice, the event "rolling an even number" corresponds to the subset E = { 2, 4, 6 }

Classifications

Events in probability theory can be classified based on various criteria, including their complexity, relationship to the sample space, and interdependence.

Simple Events: consisting of a single outcome, such as rolling a specific number on a dice
Compound Events: comprising multiple outcomes, such as rolling an even number or drawing a red card from a deck
Exhaustive Events: that cover all possible outcomes of a random experiment, leaving no room for other outcomes
Mutually Exclusive events that cannot occur simultaneously, meaning if one event happens, the other event cannot occur
Independent Events: where the occurrence of one event does not influence the occurrence of the other event

Probability Rules and Laws

Addition
Multiplication
Conditional
Bayes' Theorem
Law of Total Probability

Addition rule of probability states that the probability of the union of 2 events B is equal to the sum of their individual probabilities minus the probability of their intersection:

$P(A∪B)=P(A)+P(B)-P(A∩B)$

This rule holds for both mutually exclusive and non-mutually exclusive events.

Example

Consider tossing a fair six-sided die. Let event A be rolling an even number (6) and event B be rolling a number less than 4 {1, 2, 3}. The probability of either rolling an even number or a number less than 4 is:

$P(A∪B)=P(A)+P(B)-P(A∩B)=\frac{3}{6}+\frac{3}{6}-\frac{1}{6}=\frac{5}{6}$

Multiplication rule of probability states that the probability of the intersection of two events A and B is equal to the probability of A multiplied by the conditional probability of B given A:

$P(A∩B)=P(A)×P(B∣A)$

This rule is applicable when events A and B are independent or when the conditional probability of B given A is known.

Example

Consider drawing two cards successively from a standard deck of 52 cards without replacement. Let event A be drawing a red card on the first draw and event B be drawing a red card on the second draw given that a red card was drawn on the first draw. The probability of drawing 2 red cards is:

$P(A∩B)=P(A)×P(B∣A)=\frac{26}{52}×\frac{25}{51}=\frac{25}{102}$

Conditional probability measures the likelihood of an event B occurring given that another event A has already occurred. It is denoted by P(B∣A) and can be calculated using the formula:

$P(B∣A)=\frac{P(A∩B)}{P(A)}$

This concept is crucial for understanding the relationship between events and updating probabilities based on new information.

Example

Given a standard deck of 52 cards, the probability of drawing a king from a shuffled deck given that the first card drawn was an ace:

$P(King∣Ace)=\frac{P(Ace and King)}{P(Ace)}=\frac{\frac{4}{52}}{\frac{4}{52}}=\frac{1}{13}$

Bayes' theorem provides a method for updating probabilities based on new evidence or information. It states that the probability of an event A occurring given that event B has occurred is proportional to the probability of B given A times the prior probability of A, divided by the probability of B:

$P(A∣B)=\frac{P(B∣A)×P(A)}{P(B)}$

Bayes' theorem is widely used in statistics, machine learning, and various fields for inference and decision-making.

Example

Consider a medical test for a rare disease that has a false positive rate of 5% and a false negative rate of 1%. If 0.1% of the population has the disease, what is the probability that a person has the disease given that they test positive?

$P(Disease|Positive)=\frac{P(Positive|Disease)×P(Disease)}{P(Positive)}$

The law of total probability states that the probability of an event B can be calculated by summing the probabilities of B given different outcomes of another event A, weighted by the probabilities of those outcomes occurring:

$P(B)=\sum_i P(B∣A_i)×P(A_i)$

This law is particularly useful when the sample space can be partitioned into mutually exclusive events.

Example

Suppose there are two factories producing a certain type of product. Factory A produces 60% of the products, and factory B produces 40%. The defect rates for products from factories A and B are 5% and 3%, respectively.The probability that a randomly selected product is defective:

$P(Defective)=P(Defective|A)×P(A)+P(Defective|B)×P(B)=(0.05×0.60)+(0.03×0.40)=0.033$

Probability Distributions

Definition
Discrete Probability Distributions
Continuous Probability Distributions

Probability distributions describe the likelihood of various outcomes in a given scenario. Understanding different probability distributions is essential for modeling real-world phenomena and making predictions.

Overview​

Venn Diagrams​

Sample Spaces, Events, and Probability Axioms​

Classifications​

Probability Rules and Laws​

Probability Distributions​

Overview

Venn Diagrams

Sample Spaces, Events, and Probability Axioms

Classifications

Probability Rules and Laws

Probability Distributions