Random variables

Random variables#

What you will learn

Summing independent random variables results in another random variable called sumple sum. The mean of the sample sum is different from the population mean or expectation which is an exact quantity we want to approximate by sampling.
The Law of Large Numbers is a principle that states that as the number $N$, the sample mean approaches the population mean with a standard deviation falling off as $N^{-1/2}$
The Central Limit Theorem (CLT) tells us that summing independent and identically distributed random variables with well-defined means and variances results in Gaussian distribution regardless of the nature of a random variable.
A model of random walk describes the erratic, unpredictable motion of atoms and molecules, providing a fundamental model for diffusion processes and molecular motion in fluids. The number of steps to the right (or left) of a 1D random walker results in a binomial probability distribution. Following CLT binomial distribution in the large N limit can be shown to be well approximated by gaussian with the same mean and variance.

Introducing random variables#

compton — Fig. 4 A random variable is what we interact with in experiments and simulations to infer probability distributions over the sample space.#

A random variable X is a variable whose value depends on the realization of experiment or simulations.
- $X(\omega)$ is a function from possible outcomes of a sample space $\omega \in \Omega$.
- For a coin toss $\Omega={H,T}$ $X(H)=+1$ and $X(T)=-1$. Every time the experiment is done, X returns either +1 or -1. We could also make functions of random variables, e.g., every time X=+1, we ear 25 cents, etc.
Random variables are classified into two main types: discrete and continuous.
- Discrete Random Variable: It assumes a number of distinct values. Discrete random variables are used to model scenarios where outcomes can be counted, such as the number of particles emitted by a radioactive source in a given time interval or the number of photons hitting a detector in a certain period.
- Continuous Random Variable: It can take any value within a continuous range. These variables describe quantities that can vary smoothly, such as the position of a particle in space, the velocity of a molecule in a gas, or the energy levels of an atom.

Random Numbers in Python#

The numpy.random module provides highly efficient random number generators, implemented in optimized C code for fast performance.
The most commonly used random number generators in NumPy are:
- np.random.rand() – Generates uniform random numbers in the interval ([0,1]).
- np.random.randn() – Generates standard normal (Gaussian) random numbers with a mean of 0 and variance of 1.
Since random numbers are inherently unpredictable, running the same code multiple times will produce different results. To ensure reproducibility, you can set a fixed random seed before generating random numbers using:
```
np.random.seed(8376743)
```
Setting the seed ensures that the same sequence of random numbers is generated each time the code runs.

import numpy as np
import matplotlib.pyplot as plt

X = np.random.rand(50)

print(X)
plt.plot(X, '-o')

[0.65172823 0.50155284 0.6786157  0.04338033 0.83599233 0.52137073
17692283 0.96807033 0.52418201 0.13436554 0.80020977 0.46708702
97218002 0.07619816 0.10884647 0.21221267 0.45563034 0.49128016
48459624 0.43671982 0.1133161  0.49002044 0.80578872 0.31702386
15459046 0.58393091 0.603391   0.65902294 0.4698574  0.94236801
14999757 0.29081914 0.91419066 0.21310008 0.74203299 0.64148515
04746033 0.04421533 0.51246646 0.1587099  0.98256732 0.97369909
04917204 0.56436587 0.66931144 0.81625581 0.3898311  0.09385271
76919043 0.85901324]

[<matplotlib.lines.Line2D at 0x7efcb5a9dd60>]

../_images/1a65980123158f91cafb1cc63baf55f7a3cd4cb3188b2c44ae4918b4ed5e5920.png

Probability Distribution of a Random Variable#

For any random variable $ X $, we are interested in finding the probability distribution over its possible values $ x $, denoted as $ p_X(x) $.
It is important to distinguish between:
- $ x $, which represents a specific value the variable can take (e.g., $ 1,2, \dots, 6 $ for a die).
- $ X $, which is the random variable itself, generating values $x$ according to the probability distribution $p(x)$.

What is a Histogram

A histogram provides an empirical estimate of a distribution by grouping data into bins and counting occurrences within each bin.
For continuous distributions, histograms approximate the probability density function (PDF).
For discrete distributions, histograms approximate the probability mass function (PMF).
The choice of bin width significantly impacts visualization:
- Too few bins can obscure details. Too many bins can introduce noise, making patterns less clear.

Histogramming in numpy

np.histogram(X, bins=20): Divides the range of values of X into e.g. 20 bins and counts how many data points fall into each bin. histogram returns:
- bin_edges: The boundaries of each bin.
- counts: The number of values in each bin.

Visualization

plt.bar(...): Plots the histogram using a bar chart.
plt.hist(): Can directly plot histogram of random variable
The Seaborn library provides convenient visualization tools for random numbers. For example, sns.histplot(np.random.randn(1000), kde=True) can be used to visualize the distribution of 1000 normally distributed random numbers with a smooth density curve.

import numpy as np
import matplotlib.pyplot as plt

# Generate 1000 random values from a normal distribution
X = np.random.rand(1000)

# Compute the histogram using NumPy
counts, bin_edges = np.histogram(X, bins=20)

# Print the histogram X
print("Bin edges:", bin_edges)
print("Counts per bin:", counts)

# Plot the histogram
plt.bar(bin_edges[:-1], counts, width=np.diff(bin_edges), edgecolor='black', alpha=0.7)
plt.xlabel("Value")
plt.ylabel("Count")
plt.title("Histogram of a Random Variable")
plt.show()

Bin edges: [9.27239968e-04 5.06536237e-02 1.00380008e-01 1.50106391e-01
99832775e-01 2.49559159e-01 2.99285543e-01 3.49011926e-01
98738310e-01 4.48464694e-01 4.98191078e-01 5.47917461e-01
97643845e-01 6.47370229e-01 6.97096613e-01 7.46822997e-01
96549380e-01 8.46275764e-01 8.96002148e-01 9.45728532e-01
95454915e-01]
Counts per bin: [61 52 43 40 51 59 59 56 38 44 42 52 52 49 61 55 52 47 48 39]

../_images/894983af567d7ba1ed62367605d874c251f5c75e3cd6bb00171417b801a69604.png

import seaborn as sns

sns.histplot(np.random.rand(1000), kde=True)

<Axes: ylabel='Count'>

../_images/24cefba203ad3f88db421afaa0d78e84d55fd0edc175d394bda3ca08b66084a6.png

../_images/6567ce1a099fd9f82254b6fd4094bc339352111f2b4600b5241a17b337b45fb7.png

Expectation and Variance#

The expectation of a random variable, $ E[x] $, represents the theoretical mean, distinguishing it from the sample mean computed in simulations.
For example, consider the difference between:
- The average height of people computed from a sample of cities.
- The true mean height of the entire world population.
As the sample size increases, the sample mean converges to the expectation.
Expectation can be applied to variable or any function $f(x)$.
An important type of expectation is applied to the square of mean subtracted $x$ which quantifies variance, or fluctuations of $x$.

Expectation of a Random Variable

\[ E[f(x)] = \int f(x) \cdot p(x) \,dx \]

\[ E[x] = \int x \cdot p(x) \,dx = \mu \]

Variance as the Expectation of Mean Fluctuations

\[ V[x] = E[(x - E[x])^2] = E[x^2] - E[x]^2 = \sigma^2 \]

We often use the shorthand notation for variance:
$\sigma^2 = V[x]$, where $ \sigma $ is the standard deviation.

Binomial#

A an example of discrete distribution Binomial is defined by a Probability Mass Function (PMF)

\[P(n |p, N) = \frac{N!}{(N-n)! n!}p^n (1-p)^{N-n}\]

$E[n] = Np$
$V[n] = 4Np(1-p)$

Random Variable

$B(n, p)$ modeled by np.random.binomial(n, p, size)

../_images/082fa48d738e5e02d517c600d7c3146873847287fd32e7c84851ddc52ac005bd.png

Gaussian#

A an example of continuous distribution Gaussian is defined by a Probability Distribution Function

\[P(x |\mu, \sigma) = \frac{1}{\sigma \sqrt{2\pi}}e^{-\frac{(x-\mu)^2}{2\sigma^2}}\]

$E[x] = \mu$
$V[x] = \sigma^2$

Random Variable

$N(a, b)$ modeled by np.random.normal(loc,scale, size=(N, M))
$N(0, 1)$ modeled by np.random.randn(N, M, P, ...)

../_images/7ecf68882813a125312e0acff7a18a4f1f83789aae0e3f85b428df89d15dc486.png

Uniform Distribution#

A simple example of a continuous distribution is the Uniform distribution, where all values within a given range are equally likely. It is defined by the Probability Density Function (PDF):

\[\begin{split} P(x | a, b) = \begin{cases} \frac{1}{b - a}, & a \leq x \leq b \\ 0, & \text{otherwise} \end{cases} \end{split}\]

Expectation and Variance:
- $E[x] = \frac{a + b}{2}$
- $V[x] = \frac{(b - a)^2}{12}$

Random Variable

$U(a, b)$ is modeled by:

np.random.uniform(low, high, size=(N, M))

$U(0,1)$ (standard uniform) is modeled by:
```
np.random.rand(N, M, P, ...)
```

../_images/167b58a40b773391d304658c2b981d7279305e7feead1c472c583827670279e0.png

Exact vs sampled probability distributions#

../_images/531431eaaa8da521853f4b0205753298fc662b67274748f6973c9749f9569588.png

Transforming Random Variables#

When a random variable $X $ is transformed by adding, multiplying by a constant, or applying a function $ Y = f(X) $, its probability distribution changes accordingly from $ p(x) $ to $ p(y) $.
Two commonly used transformations involve multiplying or adding constants to random variables:
- Generating a Gaussian (Normal) distribution from a standard normal:
  
  \[ N(\mu, \sigma^2) = \mu + \sigma \cdot N(0,1) \]
- Generating a Uniform distribution from a standard uniform:
  
  \[ U(a, b) = (b - a) \cdot U(0,1) + a \]

Transforming Random Variables

When transforming a random variable $ X $ to a new variable $ Y = f(X) $, the probability density functions are related by a Jacobian factor to account for how the transformation stretches or compresses the distribution:

\[ p(x) dx = p(y) dy \]

which gives:

\[ p(y) = p(x) \cdot \Bigg| \frac{dx}{dy} \Bigg| \]

Examples of Simple Transformations:
1. Addition: $ Y = X + a $
  - The probability remains unchanged except for a shift:
    
    \[ p(y) = p(x + a) \cdot 1 \]
2. Multiplication: $ Y = aX $
  - The distribution scales with a factor $ \frac{1}{|a|} $:
    
    \[ p(y) = p(x) \cdot \frac{1}{|a|} \]
These transformations yield useful properties:
- Shifting the Mean:
  
  \[ E[X + a] = E[X] + a \]
- Scaling the Variance:
  
  \[ V[aX] = a^2 V[X] \]
Using these properties, we can generate:
- A Gaussian (Normal) distribution from a standard normal:
  
  \[ N(\mu, \sigma^2) = \mu + \sigma \cdot N(0,1) \]
- A Uniform distribution from a standard uniform:
  
  \[ U(a, b) = (b - a) \cdot U(0,1) + a \]

../_images/6f71a412d83b772c7cf66935ad9fda1f234198626d580ff0b1960eecabbfe8f2.png

Sum of Two Random Variables#

Consider the sum of two random variables, such as:
- The sum of numbers obtained from rolling two dice.
- The sum of two coin flips (e.g., heads = 1, tails = 0).
- Sum of kinetic eneries of ideal gas.

\[ X = X_1 + X_2 \]

The sum of random variables is itself a random variable!
We want to understand how to described the properties of summed random variables as they offer a prototype of how large systems emerge froms mall components.
Given probability distirbution of $X_1$ and $X_2$ how do we find probability distribution of X?

Play with sums of RVs

Take unifiorm random number between 0 an 1. Generate 10, 20, 100 and see how are they behaving. Here are some helpful tips to generate Random Variables.

np.random.random(n) generates array of random variables of size $n$
np.random.random((n, m)) generates array of random variables of shape $(n, m)$

Expectation and Variance of the Sum#

Expectation is always a linear operator, which follows from the definition of expectation and the linearity of integration:

\[ E[X_1 + X_2] = E[X_1] + E[X_2] \]

However, variance is not generally a linear operator. To see this let us write explicit formula first:

\[ V[X_1 + X_2] = E\left[(X_1 + X_2 - E[X_1 + X_2])^2\right] \]

Defining the mean-subtracted variables: $Y_i = X_i - E[X_i]$ we express variance of sum in terms of variances of component random variables

\[ V[X_1 + X_2] = E\left[(X_1 - E[X_1] + X_2 - E[X_2])^2\right] = E\left[(Y_1 + Y_2)^2\right] \]

Since $ V[X_i] = E[Y_i^2] $, this simplifies to:

\[ V[X_1 + X_2] = E[Y^2_1] + V[Y^2_2] + 2E[Y_1 Y_2] = V[X_1] + V[X_2] + 2 Cov[X_1, X_2] \]

The cross term is called Covariance which measures the degree to which two random variables vary together.
To obtain a scale-independent measure, we then define the correlation coefficient. Corr the sign of which shows if correlation is positivie or negative.

Covariance and Correlation of Two Random Variables

\[ \text{Cov}[X_1, X_2] = E[(X_1 - E[X_1])(X_2 - E[X_2])] \]

\[ \text{Corr}[X_1, X_2] = \frac{\text{Cov}[X_1, X_2]}{\sigma_{X_1} \sigma_{X_2}} \]

In the special case where $X_1 $ and $X_2 $ are statistically independent, covariance (or correlation) is zero and we have additivity of variances!

\[V[X_1+X_2] = V[X_1]+V[X_2]\]

This result is fundamental in statistical mechanics, probability theory, and the sciences, as it explains why variances add for independent random variables.

../_images/6c3f82c94e06f8c7c21ec661168835e8952d5f78f59e6066376f50dff9c0dbfb.png

Sum of $ N $ Random Variables#

Consider a sequence of independent and identically distributed (i.i.d.) random variables, $ X_1, X_2, \ldots, X_n $.
Since they are identically distributed, each variable has a well-defined mean $ \mu $ and variance $ \sigma^2 $.
Our goal is to understand how the sum and mean of these variables depend on the sample size $ n $.
For convenience we also introduce notation for zero mean random variables $Y_i = X_i-E[X_i]$, since $E[Y_i]=0$

Sample Sum and Sample Mean

\[ S_n = \sum_{i=1}^{n} X_i, \quad M_n = \frac{1}{n} \sum_{i=1}^{n} X_i \]

$ S_n $ is the sample sum, and $ M_n $ is the sample mean.
These quantities fluctuate with sample size $ n $, but we expect them to converge to their expectations for large $ n $

Mean and Variance of the Sum of i.i.d. Random Variables

Expectation of the Sum:

\[ E[S_n] = E\left[ \sum_{i=1}^{n} X_i \right] = \sum_{i=1}^{n} E[X_i] = n\mu \]

Variance of the Sum:

\[ V[S_n] = E\left[ (S_n - n\mu)^2 \right] = \Bigg[\sum_{i=1}^{n} Y_i \Bigg]^2 = \sum_{i=1}^{n} \sum_{j=1}^{n}E[Y_i Y_j] = \sum_{i=1}^{n} V[X_i] = n\sigma^2 \]

Law of Large Numbers#

For the sample mean the result of summatiion of i.i.d variables implies

\[ E[M_n] = \frac{1}{n} E[S_n] = \mu \]

\[ V[M_n] = \frac{1}{n^2} V[S_n] = \frac{\sigma^2}{n} \]

Thus, the sample mean is an unbiased estimator of $ \mu $, and its variance decreases as $ 1/n $, meaning that the estimate becomes more stable as $ n $ increases.

Law of Large Numbers (LLN)

\[ E[M_n] \to \mu \]

\[V[M_n] \to \sigma^2 / n\]

Implication:

The sample mean provides a reliable estimate of $\mu$ for large $n$.
The variance of $M_n$ decreases as , meaning fluctuations shrink as $1/\sqrt{n}$.
This justifies ensemble averaging in statistical mechanics, ensuring macroscopic observables (e.g., temperature, pressure) are stable and predictable.

<matplotlib.legend.Legend at 0x7efc7a4f8130>

../_images/afa18e1fd15181fc0ac12054eac4ca25efb1ca952cd7f6d22cfb2185e040e8d5.png

The Central Limit Theorem (CLT)#

Central Limit Theorem asserts that the probability distribution function or PDF of sum of random variables becomes gaussian distribution with mean $n\mu$ and $n\sigma^2$.
Note that CLT is based on assumption that the mean and variance, $\mu$ and $\sigma^2$, are finite!. Thus, CLT does not hold for certain power-law distributed random variables.

Central Limit Theorem (CLT)

Sum of any i.i.d variables (even if they are not gaussian) leads to normally distributed random variable (the sum $s_n$)

\[X_1 +X_2+...+X_n \rightarrow N(n\mu, n\sigma^2)\]

The probability density function (PDF) of $S_n$ is approaching gaussian:

\[p(s) = \frac{1}{(2\pi n\sigma^2)^{1/2}}e^{-\frac{(s-n\mu)^2}{2 n\sigma^2}}\]

../_images/e6feef64cd3e939ef21d06e6d54dd923060e220d5b7dbf3d89ae15c08202be46.png

Standardized random variables#

\[S_n = \sum^{n}_{i=1} X_i \rightarrow N(n\mu, n\sigma^2)\]

The CLT motivates us to consuder dividing sum by $n^{1/2}$ and subtracting the mean which gives us another gaussian distributed variable, a more simple one with no parameters!

\[\frac{1}{\sigma n^{1/2} }N(n\mu, n\sigma^2)- n\mu = N(0,1)\]

The process of de-meaning and scaling random variable by $\sigma$ is called standardization which gives us standard random variabls defined as $E[X]=0$, and $V[X]=1$.
We can standardize variables before or after sum.

\[Z_i = (X_i - \mu)/\sigma\]

\[S'_n = \frac{S_n - n\mu}{n^{1/2}\sigma}\]

Notice again that we are dividing the sum by $n^{1/2}$. If we were to devide sum by $n$ then we have mean the variance of which goes. This would be Law of Large Numbers.

../_images/cca3efa3e11c22b3976341503ce9e04fe94eff68fe62b87e9226e2855a6a35d9.png

Example of CLT applied to random walk problem

Applying the formulas to random walk model we get mean and variance for single step

\[E[X_1] = f \cdot 1 + (1-f) \cdot (-1) = 2f-1\]

\[V[X_1] = E[X^2_1] - E[X_1]^2 = f \cdot 1^2+ (1-f) (-1)^2 - (2f-1)^2 = 4 f(1-f)\]

Since steps of a random walker are independent we can compute the variance of a total displacement by multiplying mean and varaince of a single step by N

\[E[x]=N(2f -1)\]

\[V[x]=N\bar{\sigma^2_1} = 4Nf (1-f)\]

The variance of the mean $\bar{x} = x/N$ would then be:

\[V[\bar{x}] = \frac{4f (1-f)}{N}\]

::

Simulating a 1D unbiased random walk#

Each random walker will be modeled by a random variable $X_i$, assuming +1 or -1 values at every step. We will run N random walkers (rows) over n steps (columns)
We then take cumulative sum over n steps thereby summing n random variables for N walkers. This will be done via a convenient np.cumsum() method.

../_images/db408c66097f246719002a718553c84fa4e507952060f026db2a38b22201da38.png

../_images/433277186e833fc32176984490caa7c15f37f84560dd0eac71613bb29e5c1d91.png

../_images/2687ea0c3732c97ad3a21eecdf87a0e7e67f38cbd9beffdf6cd5d5555746f9dd.png

Animation size has reached 21025445 bytes, exceeding the limit of 20971520.0. If you're sure you want a larger animation embedded, set the animation.embed_limit rc parameter to a larger value (in MB). This and further frames will be dropped.

Problems#

Problem 1 Binomial as generator of Gaussian and Poisson distributions#

Show that in large number limit binomial distribution tends to gaussian. Show is by expanding binomial distirbution $logp(n)$ in power series showing that terms beyond quadratic can be ignored.
In the limit $N\rightarrow \infty$ but for very small values of $p \rightarrow 0$ such that $\lambda =pN=const$ there is another distribution that better approximates Binomial distribution: $p(x)=\frac{\lambda^k}{k!}e^{-\lambda} $ It is known as Poisson distribution.
Poisson distribution is an excellent approximation for probabilities of rare events. Such as, infrequently firing neurons in the brain, radioactive decay events of Plutonium or rains in the desert.
Derive Poisson distribution by taking the limit of $p\rightarrow 0$ in binomial distribution.
Using numpy and matplotlib plot binomial probability distribution against Gaussian and Poisson distributions for different values of N=(10,100,1000,10000).
For a value N=10000 do four plots with the following values p=0.0001, 0.001, 0.01, 0.1. You can use subplot functionality to make a pretty 4 column plot. (See plotting module)

fig, ax =  plt.subplots(nrows=1, ncols=4)
ax[0].plot()
ax[1].plot()
ax[2].plot()
ax[3].plot()

Problem-2 Confined diffusion.#

Simulate 2D random walk in a circular confinement. Re-write 2D random walk code to simulate diffusion of a particle which is stuck inside a sphere. Study how root mean square deviation of position scales with time.

Carry out simulations for different confinement sizes.
Make plots of simulated trajectories.

Problem-3 Return to the origin!#

Simulate random walk in 1D and 2D for a different number of steps $N=10, 10^2,10^3, 10^4, 10^5$
Compute average number of returns to the origin $\langle n_{orig} \rangle$. That is number of times a random walker returns to the origin $0$ for 1D or (0,0)$ for 2D . You may want to use some 1000 trajectories to obtain average.
Plot how $\langle n_{orig} \rangle$ depends on number of steps N for 1D and 2D walker.

Problem-4 Breaking the CLT; Cauchy vs Normal random walk in 2D#

For this problem we are going to simulate two kinds of random walks in continuum space (not lattice): Levy flights and Normal distributd random walk.

To simulate a 2D continuum space random walk we need to generate random step sizes $r_x$, $r_y$. Also you will need unifrom random namber to sample angles in 2D giving you a conitnuum random walk in 2D space: $x = r_x sin\theta$ and $y=r_ycos\theta$

Normally: $r\sim N(0,1)$
Cauchy distribution (long tails, infinite variance) $r\sim Cauchy(0,1)$
Unform angles $\theta \sim U(0,1)$

Visualize random walk using matplotlib and study statistics of random walkers the way that is done for normal random walk/brownian motion examples!

Problem-5 Continuous time random walk (CTRW)#

Simulate 1D random walk but instead of picking times at regular intervals pick them from exponential distribution.
Hint: you may want to use random variables from scipy.stats.exp

scipy.stats.expon

Study the root mean square deviation as a function of exponential decay parameter $\lambda$ of exponential distribution $e^{-\lambda x}$.

Random variables

Contents

Random variables#

Introducing random variables#

Random Numbers in Python#

Probability Distribution of a Random Variable#

Expectation and Variance#

Binomial#

Gaussian#

Uniform Distribution#

Exact vs sampled probability distributions#

Transforming Random Variables#

Sum of Two Random Variables#

Expectation and Variance of the Sum#

Sum of \( N \) Random Variables#

Law of Large Numbers#

The Central Limit Theorem (CLT)#

Standardized random variables#

Simulating a 1D unbiased random walk#

Problems#

Problem 1 Binomial as generator of Gaussian and Poisson distributions#

Problem-2 Confined diffusion.#

Problem-3 Return to the origin!#

Problem-4 Breaking the CLT; Cauchy vs Normal random walk in 2D#

Problem-5 Continuous time random walk (CTRW)#