QuantumInfo.ClassicalInfo.Entropy

Shannon entropy

Definitions and facts about the Shannon entropy function -x*ln(x), both on a single variable and on a distribution.

There is significant overlap with `Real.negMulLog` and `Real.binEntropy` in Mathlib, and probably these files could be combined in some form.

14 declarations

definition

One-event Shannon entropy $H_1(p) = -p \ln p$

#H₁

The one-event entropy function $H_1: [0, 1] \to \mathbb{R}$ maps a probability $p$ to its Shannon entropy, defined as $H_1(p) = -p \ln p$ . Here, $p$ is an element of the unit interval $[0, 1]$ , and by convention (inherited from the underlying `negMulLog` function), $0 \ln 0 = 0$ .

definition

$H_1(0) = 0$

#H₁_zero_eq_zero

For the Shannon entropy function $H_1: [0, 1] \to \mathbb{R}$ defined by $H_1(p) = -p \ln(p)$ , the value of the function at $p = 0$ is equal to $0$ .

definition

$H_1(1) = 0$

#H₁_one_eq_zero

The Shannon entropy function $H_1$ evaluated at the probability $p = 1$ is equal to $0$ .

theorem

$H_1(p) \ge 0$ for $p \in [0, 1]$

#H₁_nonneg

Let $p \in [0, 1]$ be a probability. The Shannon entropy of $p$ , denoted by $H_1(p)$ , is non-negative, i.e., $0 \le H_1(p)$ .

theorem

$H_1(p) < 1$ for all $p \in \text{Prob}$

#H₁_le_1

For any probability $p$ in the unit interval $[0, 1]$ , the Shannon entropy $H_1(p)$ is strictly less than 1.

theorem

$H_1(p) \le 1/e$

#H₁_le_exp_m1

For any probability $p \in [0, 1]$ , the Shannon entropy $H_1(p)$ satisfies the inequality $H_1(p) \le e^{-1}$ , where $e$ is Euler's number.

theorem

Concavity of the Shannon Entropy $H_1$

#H₁_concave

For any probabilities $x, y \in [0, 1]$ and any weight $p \in [0, 1]$ , the Shannon entropy function $H_1$ is concave, satisfying the inequality: $p \cdot H_1(x) + (1 - p) \cdot H_1(y) \le H_1(p \cdot x + (1 - p) \cdot y)$ where $p[a \leftrightarrow b]$ denotes the convex combination $p \cdot a + (1 - p) \cdot b$ .

definition

Shannon entropy $H_s$ of a discrete distribution

#Hₛ

Let $\alpha$ be a finite set and $d$ be a probability distribution on $\alpha$ . The Shannon entropy $H_s(d)$ of the distribution is defined as the sum over all elements $x \in \alpha$ of the individual entropy contributions $H_1(p_x)$ , where $p_x$ is the probability assigned to $x$ by $d$ . Mathematically, this is expressed as: $H_s(d) = \sum_{x \in \alpha} H_1(d(x))$ where $H_1(p) = -p \ln(p)$ for $p \in [0, 1]$ .

theorem

The Shannon entropy $H_s(d)$ is non-negative

#Hₛ_nonneg

Let $\alpha$ be a finite set and let $d$ be a probability distribution on $\alpha$ . The Shannon entropy of the distribution, denoted by $H_s(d)$ , is non-negative, i.e., $0 \leq H_s(d)$ .

theorem

$H_s(d) \le \ln |\alpha|$

#Hₛ_le_log_d

Let $\alpha$ be a finite set and $d$ be a probability distribution on $\alpha$ . The Shannon entropy of the distribution, denoted $H_s(d)$ , is less than or equal to the natural logarithm of the cardinality of $\alpha$ , i.e., $H_s(d) \le \ln |\alpha|$ .

theorem

The Shannon entropy of a constant distribution is $0$

#Hₛ_constant_eq_zero

Let $\alpha$ be a type and $i \in \alpha$ be an element. Let $d = \delta_i$ be the constant probability distribution centered at $i$ (where $\delta_i(y) = 1$ if $y = i$ and $0$ otherwise). Then the Shannon entropy of this distribution is zero, i.e., $H(d) = 0$ .

theorem

$H_s(\text{uniform}) = \ln |\alpha|$

#Hₛ_uniform

Let $\alpha$ be a non-empty finite set. For the uniform probability distribution $P$ on $\alpha$ , the Shannon entropy $H_s(P)$ is equal to the natural logarithm of the cardinality of $\alpha$ , denoted $\ln |\alpha|$ .

theorem

$H_s(\text{coin } p) = H_b(p)$

#Hₛ_coin

Let $p \in [0, 1]$ be a probability. The Shannon entropy $H_s$ of a coin distribution (a two-event distribution) with success probability $p$ , denoted by $\text{ProbDistribution.coin } p$ , is equal to the binary entropy of $p$ , denoted by $H_b(p)$ (or `Real.binEntropy p`).

theorem

Shannon entropy $H_s$ is invariant under reordering of probabilities

#Hₛ_eq_of_multiset_map_eq

Let $d_1$ and $d_2$ be probability distributions on finite sets $\alpha$ and $\beta$ , respectively. If the multiset of probabilities occurring in $d_1$ is equal to the multiset of probabilities occurring in $d_2$ , then the Shannon entropy of $d_1$ is equal to the Shannon entropy of $d_2$ , denoted $H_s(d_1) = H_s(d_2)$ .

QuantumInfo.ClassicalInfo.Entropy

Shannon entropy

One-event Shannon entropy H1(p)=−pln⁡pH_1(p) = -p \ln pH1​(p)=−plnp

H1(0)=0H_1(0) = 0H1​(0)=0

H1(1)=0 H_1(1) = 0 H1​(1)=0

H1(p)≥0H_1(p) \ge 0H1​(p)≥0 for p∈[0,1]p \in [0, 1]p∈[0,1]

H1(p)<1H_1(p) < 1H1​(p)<1 for all p∈Probp \in \text{Prob}p∈Prob

H1(p)≤1/eH_1(p) \le 1/eH1​(p)≤1/e

Concavity of the Shannon Entropy H1H_1H1​

Shannon entropy HsH_sHs​ of a discrete distribution

The Shannon entropy Hs(d)H_s(d)Hs​(d) is non-negative

Hs(d)≤ln⁡∣α∣H_s(d) \le \ln |\alpha|Hs​(d)≤ln∣α∣

The Shannon entropy of a constant distribution is 000

Hs(uniform)=ln⁡∣α∣H_s(\text{uniform}) = \ln |\alpha|Hs​(uniform)=ln∣α∣

Hs(coin p)=Hb(p)H_s(\text{coin } p) = H_b(p)Hs​(coin p)=Hb​(p)

Shannon entropy HsH_sHs​ is invariant under reordering of probabilities

One-event Shannon entropy $H_1(p) = -p \ln p$

$H_1(0) = 0$

$H_1(1) = 0$

$H_1(p) \ge 0$ for $p \in [0, 1]$

$H_1(p) < 1$ for all $p \in \text{Prob}$

$H_1(p) \le 1/e$

Concavity of the Shannon Entropy $H_1$

Shannon entropy $H_s$ of a discrete distribution

The Shannon entropy $H_s(d)$ is non-negative

$H_s(d) \le \ln |\alpha|$

The Shannon entropy of a constant distribution is $0$

$H_s(\text{uniform}) = \ln |\alpha|$

$H_s(\text{coin } p) = H_b(p)$

Shannon entropy $H_s$ is invariant under reordering of probabilities