auto-update(nvim): 2025-01-19 02:07:58
Some checks are pending
Deploy Quartz site to GitHub Pages using Nix / build (push) Waiting to run
Deploy Quartz site to GitHub Pages using Nix / deploy (push) Blocked by required conditions

This commit is contained in:
Youwen Wu 2025-01-19 02:07:58 -08:00
parent d3e0141da5
commit 503af18623
Signed by: youwen5
GPG key ID: 865658ED1FE61EC3
2 changed files with 173 additions and 8 deletions

View file

@ -1,7 +1,7 @@
#import "./dvd.typ": *
#import "@youwen/zen:0.1.0": *
#import "@preview/ctheorems:1.1.3": *
#show: dvdtyp.with(
#show: zen.with(
title: "PSTAT120A Course Notes",
author: "Youwen Wu",
date: "Winter 2025",
@ -10,6 +10,13 @@
#outline()
= Introduction
PSTAT 120A is an introductory course on probability and statistics. However, it
is a theoretical course rather an applied statistics course. You will not learn
how to read or conduct real-world statistical studies. Leave your $p$-values at
home, this ain't your momma's AP Stats.
= Lecture #datetime(day: 6, month: 1, year: 2025).display()
== Preliminaries
@ -237,6 +244,12 @@ Requires equally likely outcomes and finite sample spaces.
== Relative frequency approach
An approach done commonly by applied statisticians who work in the disgusting
real world. This is where we are generally concerned with irrelevant concerns
like accurate sampling and $p$-values and such. I am told this is covered in
PSTAT 120B, so hopefully I can avoid ever taking that class (as a pure math
major).
$
P(A) = (hash "of times" A "occurs in large number of trials") / (hash "of trials")
$
@ -252,14 +265,26 @@ its parlance to lend credibility to subjective judgements of confidence.
== Axiomatic approach
Our focus in PSTAT 120A. It seems rather silly to call this approach axiomatic
given we are essentially just defining a function with a few given properties
and deriving theorems from it while working atop our pre-existing (shaky,
non-rigorous) "axioms" of set theory, but this is the terminology that the
course uses.
Consider a random experiment. Then:
#definition[
Let $P : X -> RR$ be a function satisfying the following axioms (properties).
The *sample space* $Omega$ is the set of all possible outcomes of the
experiment.
]
#definition[
Elements of $Omega$ are called *sample points*.
]
#definition[
Subsets of $Omega$ are called *events*. The collection of events (in other
terms, the power set of $Omega$) in $Omega$ is denoted by $cal(F)$.
]
#definition[
The *probability measure*, or probability distribution, or simply probability s a function $P$.
Let $P : cal(F) -> RR$ be a function satisfying the following axioms (properties).
+ $P(A) >= 0, forall A$
+ $P(Omega) = 1$
@ -267,6 +292,15 @@ course uses.
$ P(union.big_(i=1)^infinity A_i) = sum_(i=1)^infinity P(A_i) $
]
The 3-tuple $(Omega, cal(F), P)$ is called a *probability space*.
#remark[
In more advanced texts you will see $Omega$ introduced as a so-called
$sigma$-algebra. A $sigma$-algebra on a set $Omega$ is a nonempty collection
$Sigma$ of subsets of $Omega$ that is closed under set complement, countable
unions, and as a corollary, countable intersections.
]
Now let us show various results with $P$.
#proposition[
@ -450,3 +484,111 @@ Properties of the #smallcaps[pdf]:
#example[
Waiting time for bus: $Omega = {s : s >= 0}$.
]
= Notes on counting
The cardinality of $A$ is given by $hash A$. Let us develop methods for finding
$hash A$ from a description of the set $A$ (in other words, methods for
counting).
== General multiplication principle
#fact[
Let $A$ and $B$ be finite sets, $k in ZZ^+$. Then let $f : A -> B$ be a
function such that each element in $B$ is the image of exactly $k$ elements
in $A$ (such a function is called _$k$-to-one_). Then $hash A = k dot hash
B$.
]<ktoone>
#example[
Four fully loaded 10-seater vans transported people to the picnic. How many
people were transported?
By @ktoone, we have $A$ is the set of people, $B$ is the set of vans, $f : A -> B$ maps a person to the van they ride in. So $f$ is a 10-to-one function, $hash A = 40$, $hash B = 4$, and clearly the answer is $10 dot 4 = 40$.
]
#definition[
An $n$-tuple is an ordered sequence of $n$ elements.
]
Many of our methods in probability rely on multiplying together multiple
outcomes to obtain their combined amount of outcomes. We make this explicit below in @tuplemultiplication.
#fact[
Suppose a set of $n$-tuples $(a_1, ..., a_n)$ obeys these rules:
+ There are $r_1$ choices for the first entry $a_1$.
+ Once the first $k$ entries $a_1, ..., a_k$ have been chosen, the number of alternatives for the next entry $a_(k+1)$ is $r_(k+1)$, regardless of the previous choices.
Then the total number of $n$-tuples is the product $r_1 dot r_2 dot r_2 dot dots dot r_n$.
]<tuplemultiplication>
#proof[
It is trivially true for $n = 1$ since you have $r_1$ choices of $a_1$ for a
1-tuple $(a_1)$.
Let $A$ be the set of all possible $n$-tuples and $B$ be the set of all
possible $(n+1)$-tuples. Now let us assume the statement is true for $A$.
Proceed by induction on $B$, noting that for each $n$-tuple in $A$, $(a_1,
..., a_n)$, we have $r_(n+1)$ tuples in $A$.
Let $f : B -> A$ be a function which takes each $(n+1)$-tuple and truncates the $a_(n+1)$ term, leaving us with just an $n$-tuple of the form $(a_1, a_2, ..., a_n)$.
$ f((a_1, ..., a_n, a_(n + 1))) = (a_1, ..., a_n) $
Now notice that $f$ is precisely a $r_(n+1)$-to-one function! Recall by
our assumption that @tuplemultiplication is true for $n$-tuples, so $A$ has $r_1 dot
r_2 dot ... dot r_n$ elements, or $hash A = r_1 dot ... dot r_n$. Then by
@ktoone, we have $hash B = hash A dot r_(n+1) = r_1 dot r_2 dot
... dot r_(n+1)$. Our induction is complete and we have proved @tuplemultiplication.
]
@tuplemultiplication is sometimes called the _general multiplication principle_.
We can use @tuplemultiplication to derive counting formulas for various
situations. Let $A_1, A_2, A_n$ be finite sets. Then as a corollary of
@tuplemultiplication, we can count the number of $n$-tuples in a finite
Cartesian product of $A_1, A_2, A_n$.
#fact[
Let $A_1, A_2, A_n$ be finite sets. Then
$
hash (A_1 times A_2 times ... times, A_n) = (hash A_1) dot (hash A_2) dot ... dot (hash A_n) = Pi^n_(i=1) (hash A_i)
$
]
#example[
How many distinct subsets does a set of size $n$ have?
The answer is $2^n$. Each subset can be encoded as an $n$-tuple with entries 0
or 1, where the $i$th entry is 1 if the $i$th element of the set is in the
subset and 0 if it is not.
Thus the number of subsets is the same as the cardinality of
$ {0,1} times ... times {0,1} = {0,1}^n $
which is $2^n$.
This is why given a set $X$ with cardinality $aleph$, we write the
cardinality of the power set of $X$ as $2^aleph$.
]
== Permutations
Now we can use the multiplication principle to count permutations.
#fact[
Consider all $k$-tuples $(a_1, ..., a_k)$ that can be constructed from a set $A$ of size $n, n>= k$ without repetition. The total number of these $k$-tuples is
$ (n)_k = n dot (n - 1) ... (n - k + 1) = n! / (n-k)! $
In particular, with $k=n$, each $n$-tuple is an ordering or _permutation_ of $A$. So the total number of permutations of a set of $n$ elements is $n!$.
]
#proof[
We construct the $k$-tuples sequentially. For the first element, we choose
one element from $A$ with $n$ alternatives. The next element has $n - 1$
alternatives. In general, after $j$ elements are chosen, there are $n - j +
1$ alternatives.
Then clearly after choosing $k$ elements for our $k$-tuple we have by
@tuplemultiplication the number of $k$-tuples being $n dot (n - 1) dot ...
dot (n - k + 1) = (n)_k$.
]

View file

@ -73,3 +73,26 @@
$ {{x_1, x_2, x_3, x_4} : x_i >= 0, i = 1,...,6 sum_(j=1)^4 x_j = 6} $
]
]
+ #[
#set enum(numbering: "a)", spacing: 2em)
+ #[
We want to determine how many ways to choose 8 people from 27 people, or $vec(27,8) = 2220075$.
]
+ #[
This is the same as the choosing 4 of the 12 men and 4 of the 15 women, and pairing each group of men with each group of women once. So,
$ vec(12,4) times vec(15, 4) = 675675 $
]
+ #[
First we determine the amount of ways to choose less than 2 women.
$ vec(15, 0) vec(12, 8) + vec(15, 1) times vec(12,7) $
Then the total amount of ways to choose 8 people, from part a, is $vec(27,8)$.
Then the chance of forming a committee with less than 2 women is
$ (vec(15, 0) vec(12, 8) + vec(15, 1) vec(12,7)) / vec(27,8) $
So our final answer is
$ 1 - (vec(15, 0) vec(12, 8) + vec(15, 1) vec(12,7)) / vec(27,8) $
]
]