VOOZH about

URL: https://en.wikipedia.org/wiki/Generalized_relative_entropy

⇱ Generalized relative entropy - Wikipedia


Jump to content
From Wikipedia, the free encyclopedia

Generalized relative entropy (πŸ‘ {\displaystyle \varepsilon }
-relative entropy) is a measure of dissimilarity between two quantum states. It is a "one-shot" analogue of quantum relative entropy and shares many properties of the latter quantity.

In the study of quantum information theory, we typically assume that information processing tasks are repeated multiple times, independently. The corresponding information-theoretic notions are therefore defined in the asymptotic limit. The quintessential entropy measure, von Neumann entropy, is one such notion. In contrast, the study of one-shot quantum information theory is concerned with information processing when a task is conducted only once. New entropic measures emerge in this scenario, as traditional notions cease to give a precise characterization of resource requirements. πŸ‘ {\displaystyle \varepsilon }
-relative entropy is one such particularly interesting measure.

In the asymptotic scenario, relative entropy acts as a parent quantity for other measures besides being an important measure itself. Similarly, πŸ‘ {\displaystyle \varepsilon }
-relative entropy functions as a parent quantity for other measures in the one-shot scenario.

Definition

[edit]

To motivate the definition of the πŸ‘ {\displaystyle \varepsilon }
-relative entropy πŸ‘ {\displaystyle D^{\varepsilon }(\rho \|\sigma )}
, consider the information processing task of hypothesis testing. In hypothesis testing, we wish to devise a strategy to distinguish between two density operators πŸ‘ {\displaystyle \rho }
and πŸ‘ {\displaystyle \sigma }
. A strategy is a POVM with elements πŸ‘ {\displaystyle Q}
and πŸ‘ {\displaystyle I-Q}
. The probability that the strategy produces a correct guess on input πŸ‘ {\displaystyle \rho }
is given by πŸ‘ {\displaystyle \operatorname {Tr} (\rho Q)}
and the probability that it produces a wrong guess is given by πŸ‘ {\displaystyle \operatorname {Tr} (\sigma Q)}
. πŸ‘ {\displaystyle \varepsilon }
-relative entropy captures the minimum probability of error when the state is πŸ‘ {\displaystyle \sigma }
, given that the success probability for πŸ‘ {\displaystyle \rho }
is at least πŸ‘ {\displaystyle \varepsilon }
.

For πŸ‘ {\displaystyle \varepsilon \in (0,1)}
, the πŸ‘ {\displaystyle \varepsilon }
-relative entropy between two quantum statesπŸ‘ {\displaystyle \rho }
and πŸ‘ {\displaystyle \sigma }
is defined as πŸ‘ {\displaystyle D^{\varepsilon }(\rho \|\sigma )=-\log {\frac {1}{\varepsilon }}\min \left\{\left\langle Q,\sigma \right\rangle |0\leq Q\leq I{\text{ and }}\left\langle Q,\rho \right\rangle \geq \varepsilon \right\}~.}

From the definition, it is clear that πŸ‘ {\displaystyle D^{\varepsilon }(\rho \|\sigma )\geq 0}
. This inequality is saturated if and only if πŸ‘ {\displaystyle \rho =\sigma }
, as shown below.

Relationship to the trace distance

[edit]

Suppose the trace distance between two density operators πŸ‘ {\displaystyle \rho }
and πŸ‘ {\displaystyle \sigma }
is πŸ‘ {\displaystyle {\left\|\rho -\sigma \right\|}_{1}=\delta ~.}

For πŸ‘ {\displaystyle 0<\varepsilon <1}
, it holds that πŸ‘ {\displaystyle \log {\frac {\varepsilon }{\varepsilon -(1-\varepsilon )\delta }}\quad \leq \quad D^{\varepsilon }(\rho \|\sigma )\quad \leq \quad \log {\frac {\varepsilon }{\varepsilon -\delta }}~.}

In particular, this implies the following analogue of the Pinsker inequality[1]

πŸ‘ {\displaystyle {\frac {1-\varepsilon }{\varepsilon }}{\left\|\rho -\sigma \right\|}_{1}\quad \leq \quad D^{\varepsilon }(\rho \|\sigma )~.}

Furthermore, the proposition implies that for any πŸ‘ {\displaystyle \varepsilon \in (0,1)}
, πŸ‘ {\displaystyle D^{\varepsilon }(\rho \|\sigma )=0}
if and only if πŸ‘ {\displaystyle \rho =\sigma }
, inheriting this property from the trace distance. This result and its proof can be found in Dupuis et al.[2]

Proof of inequality a)

[edit]

Upper bound: Trace distance can be written as

πŸ‘ {\displaystyle {\left\|\rho -\sigma \right\|}_{1}=\max _{0\leq Q\leq 1}\operatorname {Tr} (Q(\rho -\sigma ))~.}

This maximum is achieved when πŸ‘ {\displaystyle Q}
is the orthogonal projector onto the positive eigenspace of πŸ‘ {\displaystyle \rho -\sigma }
. For any POVM element πŸ‘ {\displaystyle Q}
we have πŸ‘ {\displaystyle \operatorname {Tr} (Q(\rho -\sigma ))\leq \delta }
so that if πŸ‘ {\displaystyle \operatorname {Tr} (Q\rho )\geq \varepsilon }
, we have πŸ‘ {\displaystyle \operatorname {Tr} (Q\sigma )~\geq ~\operatorname {Tr} (Q\rho )-\delta ~\geq ~\varepsilon -\delta ~.}

From the definition of the πŸ‘ {\displaystyle \varepsilon }
-relative entropy, we get πŸ‘ {\displaystyle 2^{-D^{\varepsilon }(\rho \|\sigma )}\geq {\frac {\varepsilon -\delta }{\varepsilon }}~.}

Lower bound: Let πŸ‘ {\displaystyle Q}
be the orthogonal projection onto the positive eigenspace of πŸ‘ {\displaystyle \rho -\sigma }
, and let πŸ‘ {\displaystyle {\bar {Q}}}
be the following convex combination of πŸ‘ {\displaystyle I}
and πŸ‘ {\displaystyle Q}
: πŸ‘ {\displaystyle {\bar {Q}}=\left(\varepsilon -\mu \right)I+\left(1-\varepsilon +\mu \right)Q}
where πŸ‘ {\displaystyle \mu ={\frac {(1-\varepsilon )\operatorname {Tr} (Q\rho )}{1-\operatorname {Tr} (Q\rho )}}~.}

This means πŸ‘ {\displaystyle \mu =(1-\varepsilon +\mu )\operatorname {Tr} (Q\rho )}
and thus πŸ‘ {\displaystyle \operatorname {Tr} ({\bar {Q}}\rho )~=~\left(\varepsilon -\mu \right)+\left(1-\varepsilon +\mu \right)\operatorname {Tr} (Q\rho )~=~\varepsilon \,.}
Moreover, πŸ‘ {\displaystyle \operatorname {Tr} ({\bar {Q}}\sigma )~=~\varepsilon -\mu +\left(1-\varepsilon +\mu \right)\operatorname {Tr} (Q\sigma )~.}
Using πŸ‘ {\displaystyle \mu =(1-\varepsilon +\mu )\operatorname {Tr} (Q\rho )}
, our choice of πŸ‘ {\displaystyle Q}
, and finally the definition of πŸ‘ {\displaystyle \mu }
, we can re-write this as πŸ‘ {\displaystyle {\begin{aligned}\operatorname {Tr} ({\bar {Q}}\sigma )&=\varepsilon -\left(1-\varepsilon +\mu \right)\operatorname {Tr} (Q\rho )+\left(1-\varepsilon +\mu \right)\operatorname {Tr} (Q\sigma )\\&=\varepsilon -{\frac {\left(1-\varepsilon \right)\delta }{1-\operatorname {Tr} (Q\rho )}}\\[1ex]&\leq \varepsilon -\left(1-\varepsilon \right)\delta ~.\end{aligned}}}

Hence πŸ‘ {\displaystyle D^{\varepsilon }(\rho \|\sigma )\geq \log {\frac {\varepsilon }{\varepsilon -\left(1-\varepsilon \right)\delta }}~.}

Proof of inequality b)

[edit]

To derive this Pinsker-like inequality, observe that πŸ‘ {\displaystyle \log {\frac {\varepsilon }{\varepsilon -\left(1-\varepsilon \right)\delta }}~=~-\log \left(1-{\frac {\left(1-\varepsilon \right)\delta }{\varepsilon }}\right)~\geq ~\delta {\frac {1-\varepsilon }{\varepsilon }}~.}

Alternative proof of the Data Processing inequality

[edit]

A fundamental property of von Neumann entropy is strong subadditivity. Let πŸ‘ {\displaystyle S(\sigma )}
denote the von Neumann entropy of the quantum state πŸ‘ {\displaystyle \sigma }
, and let πŸ‘ {\displaystyle \rho _{ABC}}
be a quantum state on the tensor product Hilbert space πŸ‘ {\displaystyle {\mathcal {H}}_{A}\otimes {\mathcal {H}}_{B}\otimes {\mathcal {H}}_{C}}
. Strong subadditivity states that πŸ‘ {\displaystyle S(\rho _{ABC})+S(\rho _{B})\leq S(\rho _{AB})+S(\rho _{BC})}
where πŸ‘ {\displaystyle \rho _{AB},\rho _{BC},\rho _{B}}
refer to the reduced density matrices on the spaces indicated by the subscripts. When re-written in terms of mutual information, this inequality has an intuitive interpretation; it states that the information content in a system cannot increase by the action of a local quantum operation on that system. In this form, it is better known as the data processing inequality, and is equivalent to the monotonicity of relative entropy under quantum operations:[3] πŸ‘ {\displaystyle S(\rho \|\sigma )-S({\mathcal {E}}(\rho )\|{\mathcal {E}}(\sigma ))\geq 0}
for every CPTP map πŸ‘ {\displaystyle {\mathcal {E}}}
, where πŸ‘ {\displaystyle S(\omega \|\tau )}
denotes the relative entropy of the quantum states πŸ‘ {\displaystyle \omega ,\tau }
.

It is readily seen that πŸ‘ {\displaystyle \varepsilon }
-relative entropy also obeys monotonicity under quantum operations:[4] πŸ‘ {\displaystyle D^{\varepsilon }(\rho \|\sigma )\geq D^{\varepsilon }({\mathcal {E}}(\rho )\|{\mathcal {E}}(\sigma ))}
, for any CPTP map πŸ‘ {\displaystyle {\mathcal {E}}}
. To see this, suppose we have a POVM πŸ‘ {\displaystyle (R,I-R)}
to distinguish between πŸ‘ {\displaystyle {\mathcal {E}}(\rho )}
and πŸ‘ {\displaystyle {\mathcal {E}}(\sigma )}
such that πŸ‘ {\displaystyle \langle R,{\mathcal {E}}(\rho )\rangle =\langle {\mathcal {E}}^{\dagger }(R),\rho \rangle \geq \varepsilon }
. We construct a new POVM πŸ‘ {\displaystyle ({\mathcal {E}}^{\dagger }(R),I-{\mathcal {E}}^{\dagger }(R))}
to distinguish between πŸ‘ {\displaystyle \rho }
and πŸ‘ {\displaystyle \sigma }
. Since the adjoint of any CPTP map is also positive and unital, this is a valid POVM. Note that πŸ‘ {\displaystyle \langle R,{\mathcal {E}}(\sigma )\rangle =\langle {\mathcal {E}}^{\dagger }(R),\sigma \rangle \geq \langle Q,\sigma \rangle }
, where πŸ‘ {\displaystyle (Q,I-Q)}
is the POVM that achieves πŸ‘ {\displaystyle D^{\varepsilon }(\rho \|\sigma )}
. Not only is this interesting in itself, but it also gives us the following alternative method to prove the data processing inequality.[2]

By the quantum analogue of the Stein lemma,[5]

πŸ‘ {\displaystyle {\begin{aligned}\lim _{n\to \infty }{\frac {1}{n}}D^{\varepsilon }\left(\rho ^{\otimes n}\|\sigma ^{\otimes n}\right)&=\lim _{n\to \infty }{\frac {-1}{n}}\log \min {\frac {1}{\varepsilon }}\operatorname {Tr} \left(\sigma ^{\otimes n}Q\right)\\&=D(\rho \|\sigma )-\lim _{n\to \infty }{\frac {1}{n}}\left(\log {\frac {1}{\varepsilon }}\right)\\&=D(\rho \|\sigma )~,\end{aligned}}}

where the minimum is taken over πŸ‘ {\displaystyle 0\leq Q\leq 1}
such that πŸ‘ {\displaystyle \operatorname {Tr} (Q\rho ^{\otimes n})\geq \varepsilon ~.}

Applying the data processing inequality to the states πŸ‘ {\displaystyle \rho ^{\otimes n}}
and πŸ‘ {\displaystyle \sigma ^{\otimes n}}
with the CPTP map πŸ‘ {\displaystyle {\mathcal {E}}^{\otimes n}}
, we get πŸ‘ {\displaystyle D^{\varepsilon }(\rho ^{\otimes n}\|\sigma ^{\otimes n})~\geq ~D^{\varepsilon }({\mathcal {E}}(\rho )^{\otimes n}\|{\mathcal {E}}(\sigma )^{\otimes n})~.}
Dividing by πŸ‘ {\displaystyle n}
on either side and taking the limit as πŸ‘ {\displaystyle n\rightarrow \infty }
, we get the desired result.

See also

[edit]

References

[edit]
  1. ^ Watrous, J. Theory of Quantum Information, Fall 2013. Ch. 5, page 194 https://cs.uwaterloo.ca/~watrous/CS766/DraftChapters/5.QuantumEntropy.pdf[permanent dead link]
  2. ^ a b Dupuis, F.; KrΓ€mer, L.; Faist, P.; Renes, J. M.; Renner, R. (2013). "Generalized Entropies". XVIIth International Congress on Mathematical Physics. WORLD SCIENTIFIC. pp. 134–153. arXiv:1211.3141. doi:10.1142/9789814449243_0008. ISBN 978-981-4449-23-6. S2CID 118576547.
  3. ^ Ruskai, Mary Beth (2002). "Inequalities for quantum entropy: A review with conditions for equality". Journal of Mathematical Physics. 43 (9). AIP Publishing: 4358–4375. arXiv:quant-ph/0205064. Bibcode:2002JMP....43.4358R. doi:10.1063/1.1497701. ISSN 0022-2488. S2CID 3051292.
  4. ^ Wang, Ligong; Renner, Renato (15 May 2012). "One-Shot Classical-Quantum Capacity and Hypothesis Testing". Physical Review Letters. 108 (20) 200501. arXiv:1007.5456. Bibcode:2012PhRvL.108t0501W. doi:10.1103/physrevlett.108.200501. ISSN 0031-9007. PMID 23003132. S2CID 3190155.
  5. ^ DΓ©nez Petz (2008). "8". Quantum Information Theory and Quantum Statistics. Theoretical and Mathematical Physics. Berlin, Heidelberg: Springer Berlin Heidelberg. Bibcode:2008qitq.book.....P. doi:10.1007/978-3-540-74636-2. ISBN 978-3-540-74634-8.