auto-update(nvim): 2025-01-19 02:07:58

2025-01-19 02:07:58 -08:00 · 2025-01-19 02:07:58 -08:00 · 503af18623
commit 503af18623
parent d3e0141da5
2 changed files with 173 additions and 8 deletions
--- a/documents/by-course/pstat-120a/course-notes/main.typ
+++ b/documents/by-course/pstat-120a/course-notes/main.typ
@ -1,7 +1,7 @@
-#import "./dvd.typ": *
+#import "@youwen/zen:0.1.0": *
 #import "@preview/ctheorems:1.1.3": *

-#show: dvdtyp.with(
+#show: zen.with(
  title: "PSTAT120A Course Notes",
  author: "Youwen Wu",
  date: "Winter 2025",
@ -10,6 +10,13 @@

 #outline()

+= Introduction
+
+PSTAT 120A is an introductory course on probability and statistics. However, it
+is a theoretical course rather an applied statistics course. You will not learn
+how to read or conduct real-world statistical studies. Leave your $p$-values at
+home, this ain't your momma's AP Stats.
+
 = Lecture #datetime(day: 6, month: 1, year: 2025).display()

 == Preliminaries
@ -237,6 +244,12 @@ Requires equally likely outcomes and finite sample spaces.

 == Relative frequency approach

+An approach done commonly by applied statisticians who work in the disgusting
+real world. This is where we are generally concerned with irrelevant concerns
+like accurate sampling and $p$-values and such. I am told this is covered in
+PSTAT 120B, so hopefully I can avoid ever taking that class (as a pure math
+major).
+
 $
  P(A) = (hash "of times" A "occurs in large number of trials") / (hash "of trials")
 $
@ -252,14 +265,26 @@ its parlance to lend credibility to subjective judgements of confidence.

 == Axiomatic approach

-Our focus in PSTAT 120A. It seems rather silly to call this approach axiomatic
-given we are essentially just defining a function with a few given properties
-and deriving theorems from it while working atop our pre-existing (shaky,
-non-rigorous) "axioms" of set theory, but this is the terminology that the
-course uses.
+Consider a random experiment. Then:

 #definition[
-  Let $P : X -> RR$ be a function satisfying the following axioms (properties).
+  The *sample space* $Omega$ is the set of all possible outcomes of the
+  experiment.
+]
+
+#definition[
+  Elements of $Omega$ are called *sample points*.
+]
+
+#definition[
+  Subsets of $Omega$ are called *events*. The collection of events (in other
+  terms, the power set of $Omega$) in $Omega$ is denoted by $cal(F)$.
+]
+
+#definition[
+  The *probability measure*, or probability distribution, or simply probability s a function $P$.
+
+  Let $P : cal(F) -> RR$ be a function satisfying the following axioms (properties).

  + $P(A) >= 0, forall A$
  + $P(Omega) = 1$
@ -267,6 +292,15 @@ course uses.
    $ P(union.big_(i=1)^infinity A_i) = sum_(i=1)^infinity P(A_i) $
 ]

+The 3-tuple $(Omega, cal(F), P)$ is called a *probability space*.
+
+#remark[
+  In more advanced texts you will see $Omega$ introduced as a so-called
+  $sigma$-algebra. A $sigma$-algebra on a set $Omega$ is a nonempty collection
+  $Sigma$ of subsets of $Omega$ that is closed under set complement, countable
+  unions, and as a corollary, countable intersections.
+]
+
 Now let us show various results with $P$.

 #proposition[
@ -450,3 +484,111 @@ Properties of the #smallcaps[pdf]:
 #example[
  Waiting time for bus: $Omega = {s : s >= 0}$.
 ]
+
+= Notes on counting
+
+The cardinality of $A$ is given by $hash A$. Let us develop methods for finding
+$hash A$ from a description of the set $A$ (in other words, methods for
+counting).
+
+== General multiplication principle
+
+#fact[
+  Let $A$ and $B$ be finite sets, $k in ZZ^+$. Then let $f : A -> B$ be a
+  function such that each element in $B$ is the image of exactly $k$ elements
+  in $A$ (such a function is called _$k$-to-one_). Then $hash A = k dot hash
+  B$.
+]<ktoone>
+
+#example[
+  Four fully loaded 10-seater vans transported people to the picnic. How many
+  people were transported?
+
+  By @ktoone, we have $A$ is the set of people, $B$ is the set of vans, $f : A -> B$ maps a person to the van they ride in. So $f$ is a 10-to-one function, $hash A = 40$, $hash B = 4$, and clearly the answer is $10 dot 4 = 40$.
+]
+
+#definition[
+  An $n$-tuple is an ordered sequence of $n$ elements.
+]
+
+Many of our methods in probability rely on multiplying together multiple
+outcomes to obtain their combined amount of outcomes. We make this explicit below in @tuplemultiplication.
+
+#fact[
+  Suppose a set of $n$-tuples $(a_1, ..., a_n)$ obeys these rules:
+
+  + There are $r_1$ choices for the first entry $a_1$.
+  + Once the first $k$ entries $a_1, ..., a_k$ have been chosen, the number of alternatives for the next entry $a_(k+1)$ is $r_(k+1)$, regardless of the previous choices.
+
+  Then the total number of $n$-tuples is the product $r_1 dot r_2 dot r_2 dot dots dot r_n$.
+]<tuplemultiplication>
+
+#proof[
+  It is trivially true for $n = 1$ since you have $r_1$ choices of $a_1$ for a
+  1-tuple $(a_1)$.
+
+  Let $A$ be the set of all possible $n$-tuples and $B$ be the set of all
+  possible $(n+1)$-tuples. Now let us assume the statement is true for $A$.
+  Proceed by induction on $B$, noting that for each $n$-tuple in $A$, $(a_1,
+  ..., a_n)$, we have $r_(n+1)$ tuples in $A$.
+
+  Let $f : B -> A$ be a function which takes each $(n+1)$-tuple and truncates the $a_(n+1)$ term, leaving us with just an $n$-tuple of the form $(a_1, a_2, ..., a_n)$.
+  $ f((a_1, ..., a_n, a_(n + 1))) = (a_1, ..., a_n) $
+  Now notice that $f$ is precisely a $r_(n+1)$-to-one function! Recall by
+  our assumption that @tuplemultiplication is true for $n$-tuples, so $A$ has $r_1 dot
+  r_2 dot ... dot r_n$ elements, or $hash A = r_1 dot ... dot r_n$. Then by
+  @ktoone, we have $hash B = hash A dot r_(n+1) = r_1 dot r_2 dot
+  ... dot r_(n+1)$. Our induction is complete and we have proved @tuplemultiplication.
+]
+
+@tuplemultiplication is sometimes called the _general multiplication principle_.
+
+We can use @tuplemultiplication to derive counting formulas for various
+situations. Let $A_1, A_2, A_n$ be finite sets. Then as a corollary of
+@tuplemultiplication, we can count the number of $n$-tuples in a finite
+Cartesian product of $A_1, A_2, A_n$.
+
+#fact[
+  Let $A_1, A_2, A_n$ be finite sets. Then
+
+  $
+    hash (A_1 times A_2 times ... times, A_n) = (hash A_1) dot (hash A_2) dot ... dot (hash A_n) = Pi^n_(i=1) (hash A_i)
+  $
+]
+
+#example[
+  How many distinct subsets does a set of size $n$ have?
+
+  The answer is $2^n$. Each subset can be encoded as an $n$-tuple with entries 0
+  or 1, where the $i$th entry is 1 if the $i$th element of the set is in the
+  subset and 0 if it is not.
+
+  Thus the number of subsets is the same as the cardinality of
+  $ {0,1} times ... times {0,1} = {0,1}^n $
+  which is $2^n$.
+
+  This is why given a set $X$ with cardinality $aleph$, we write the
+  cardinality of the power set of $X$ as $2^aleph$.
+]
+
+== Permutations
+
+Now we can use the multiplication principle to count permutations.
+
+#fact[
+  Consider all $k$-tuples $(a_1, ..., a_k)$ that can be constructed from a set $A$ of size $n, n>= k$ without repetition. The total number of these $k$-tuples is
+  $ (n)_k = n dot (n - 1) ... (n - k + 1) = n! / (n-k)! $
+
+  In particular, with $k=n$, each $n$-tuple is an ordering or _permutation_ of $A$. So the total number of permutations of a set of $n$ elements is $n!$.
+]
+
+#proof[
+  We construct the $k$-tuples sequentially. For the first element, we choose
+  one element from $A$ with $n$ alternatives. The next element has $n - 1$
+  alternatives. In general, after $j$ elements are chosen, there are $n - j +
+  1$ alternatives.
+
+  Then clearly after choosing $k$ elements for our $k$-tuple we have by
+  @tuplemultiplication the number of $k$-tuples being $n dot (n - 1) dot ...
+  dot (n - k + 1) = (n)_k$.
+]
--- a/documents/by-course/pstat-120a/hw1/main.typ
+++ b/documents/by-course/pstat-120a/hw1/main.typ
@ -73,3 +73,26 @@
        $ {{x_1, x_2, x_3, x_4} : x_i >= 0, i = 1,...,6 sum_(j=1)^4 x_j = 6} $
      ]
  ]
+
+ #[
+    #set enum(numbering: "a)", spacing: 2em)
+
+    + #[
+        We want to determine how many ways to choose 8 people from 27 people, or $vec(27,8) = 2220075$.
+      ]
+    + #[
+        This is the same as the choosing 4 of the 12 men and 4 of the 15 women, and pairing each group of men with each group of women once. So,
+        $ vec(12,4) times vec(15, 4) = 675675 $
+      ]
+    + #[
+        First we determine the amount of ways to choose less than 2 women.
+
+        $ vec(15, 0) vec(12, 8) + vec(15, 1) times vec(12,7) $
+        Then the total amount of ways to choose 8 people, from part a, is $vec(27,8)$.
+
+        Then the chance of forming a committee with less than 2 women is
+        $ (vec(15, 0) vec(12, 8) + vec(15, 1) vec(12,7)) / vec(27,8) $
+        So our final answer is
+        $ 1 - (vec(15, 0) vec(12, 8) + vec(15, 1) vec(12,7)) / vec(27,8) $
+      ]
+  ]