Professional Documents
Culture Documents
a probabilistic model with square-state space to reproduce e.g., the optimal success probabilities for
binary state discrimination and N numbers of symmetric quantum states. The existences of families
of ensembles in binary cases are shown both in classical and quantum theories in any generic cases.
Typeset by REVTEX
2
orthogonal states in general quantum systems. In partic- point of view for state < λ, s, t >= λs + (1 − λ)t, each
ular, we discuss the relation between our method and the maps oi of an observable o should be an affine functional:
one used in [18], and show that our method generalizes oi (λs + (1 − λ)t) = λoi (s) + (1 − λ)oi (t), since the right
the results in [18] to general probabilistic theories. hand side is a sum of probabilities to obtain ith outputs
The paper is organized as follows. In Sec. II, we give a for exclusive events of states s and t with probability λ
brief review of general probabilistic theories. In Sec. III, and 1 − λ, while oi (s), oi (t) are conditional probabilities
we introduce a Helstrom family of ensembles and show to obtain ith output conditioned that the states are s and
the relation with an optimal measurement in state dis- t, respectively. An effect e on S is an affine functional
crimination problem (Propositions 1, 2, Theorem 1). We from S to [0, 1]. There are two trivial effects, unit effect
also prove the existences of the families of ensembles for u and zero effect 0, defined by u(s) = 1, 0(s) = 0 for all
N = 2 in classical and quantum systems in generic cases s ∈ S . With this language, an N -valued observable o is
PN
(Theorems 2, 3). In Sec. IV, we illustrate our method a set of effects oi (i = 1, . . . , N ) satisfying i=1 oi = u,
in 2-level quantum systems, and reproduce the optimal meaning that oi (s) is the probability to obtain the ith
success probabilities for binary state discrimination and output when measuring the observable o in the state s.
N numbers of symmetric quantum states. As an example We denote by E and ON the sets of all the effects and
of neither classical nor quantum theories, we introduce a N -valued observables, respectively. While the output of
general probabilistic model with square-state space. Our an observable can be not only from real numbers but
method is also applied to this model to exemplify its us- also any symbols, like “head” or “tail”, hereafter we of-
ability. In Sec. V, we summarize our results. ten identify them with {1, . . . , N }. Physically natural
topology on S is given by the (weakest) topology so that
all the effects are continuous. Without loss of generality
II. BRIEF REVIEW OF GENERAL [22], S is assumed to be compact with respect to this
PROBABILISTIC THEORIES topology. Typical examples of the general probabilistic
theories will be classical and quantum systems. For sim-
plicity, the classical and quantum systems we consider in
In order to overview general probabilistic theories as
this paper will be finite systems:
the operationally most general theories of probability, let
us start from a very primitive consideration of physical
theories where a probability plays a fundamental role. In [Example 1: Classical Systems] Finite classical sys-
such a theory, a particular rule (like Borel rule in Quan- tem is described by a finite probability theory. Let
tum mechanics) to obtain a probability for some output Ω = {ω1 , . . . , ωd } be a finite sample space. A state is
when measuring an observable o under a state s should be a probability distribution p = (p1 , . . . , pd ), meaning that
provided. Therefore, states and observables are two fun- the probability to observe ωi is pi . Therefore, the P state
damental ingredients with an appropriate physical law space is Scl = {p = (p1 , . . . , pd ) ∈ Rd | pi ≥ 0, i pi =
to obtain probabilities in general probabilistic theories. 1} ⊂ Rd , and forms a (standard) simplex [29]. The set
Let us denote the set of states by S. In a simplified (i)
of extreme points is {p(i) }di=1 where pj = δij . An effect
view, an N -valued observable o [26] can be considered e is given
as an N numbers of maps oi on a state space S so that P by a random variable f : Ω → [0, 1] such that
e(p) = i f (ωi )pi (0 ≤ f (i) ≤ 1).
oi (s) ∈ [0, 1] provides a probability to obtain ith output
when measuring this observable under a state s ∈ S . It is
operationally natural to assume that if one can prepare [Example 2: Quantum Systems] d-level quantum sys-
states s ∈ S and t ∈ S , then there exists a probabilistic- tem is described by an d dimensional Hilbert space H.
mixture state < λ, s, t >∈ S for any λ ∈ [0, 1] which A state is described by a density operator ρ, an Hermi-
represents an ensemble of preparing state s with prob- tian positive operator on H with unit trace, and the state
ability λ and state t with probability 1 − λ. Further- space is given by Squ = {ρ ∈ LH (H) | ρ ≥ 0, tr ρ = 1} ⊂
more, it is natural to assume the so-called separating LH (H); here real vector space LH (H) is the set of all Her-
condition for states; namely, two states s1 and s2 should mitian operator on H. A pure state is a one dimensional
be identified when there are no observables to statisti- projection operator onto a unit vector ψ ∈ H, written as
cally distinguish them. Then, it has been shown [7, 17] ρ = |ψihψ| in Dirac notation. An effect e is described
that without loss of generality, the state space S is em- [17, 21] by a positive operator B such that 0 ≤ B ≤ I
bedded into a convex (sub)set in a real vector space V through e(ρ) = tr(Bρ), which is called an element of
such that a probabilistic-mixture state is given by a con- positive-operator-valued measure (POVM) [30].
vex combination < λ, s, t >= λs + (1 − λ)t [27]. Hence, In the following, we assume thatPall observables {oi }N
i=1
hereafter the state space S is assumed to be convex set composed of effects oi satisfying i oi = u are in princi-
in a real vector space V with the above mentioned in- ple measurable. Then, only the structure of state space
terpretation. An extreme point [28] of a state space S characterizes the general probabilistic theories. Roughly
is called a pure state, otherwise a mixed state. Physi- speaking, for each (compact) convex set one can consider
cally, a pure state is a state which cannot be prepared as the corresponding general probabilistic model. When we
an ensembles of different states. From the preparational consider a composition of state spaces, the so-called no-
3
signaling condition is usually required to keep the causal- Definition 1 Given N distinct states {si ∈ S }N i=1 and a
ity. prior probability distribution {pi }N i=1 , we call a family of
We refer [7, 11] for the details of general probabilistic N -numbers of ensembles {p̃i , si ; 1 − p̃i , ti } (i = 1, . . . , N )
theories and [8] where generalized No-broadcasting and a “weak Helstrom family of ensembles” (or simply a
No-cloning theorems have been shown in general proba- “weak Helstrom family”) for states {si } and a probability
bilistic theories. {pi } if there exist N -numbers of binary probability distri-
butions {p̃i , 1 − p̃i } (0 < p̃i ≤ 1) and N -numbers of states
{ti ∈ S }Ni=1 satisfying
III. HELSTROM FAMILY OF ENSEMBLES IN
GENERAL PROBABILISTIC THEORIES pi pj
(i) = ≤ 1, (5)
p̃i p̃j
As a state discrimination is one of the central problems (ii) p˜i si + (1 − p˜i )ti = p˜j sj + (1 − p˜j )tj , (6)
in quantum information theory, we consider a problem
to discriminate states in general probabilistic theories by for any i, j = 1, . . . , N
means of Bayesian strategy. Suppose Alice is given a Note that condition (6) means that N ensembles
state chosen from {si ∈ S }N i=1 with Pa prior probability {p̃i , si ; 1 − p̃i , ti } are statistically equivalent (among ob-
distribution {pi ∈ R}N i=1 (p i ≥ 0, i pi = 1), and her servables). Therefore, a weak Helstrom family is a family
goal is to guess the state. She wants to find an opti- of statistically equivalent ensembles which are mixtures
mal measurement to maximize the success probability. of states {si } and {ti } with weights p̃i and 1 − p̃i satisfy-
Without loss of generality, it is sufficient to consider an ing condition (5). We call ti a conjugate state to si . The
N -valued observable E = {ei }N i=1 ∈ ON from which she probabilistic-mixture state determined by N ensembles
decides the state was in si when obtaining the output i. {p̃i , si ; 1 − p̃i , ti } with condition (6) is called a reference
Then, the success probability is state and is denoted by s:
X
N
s := p˜i si + (1 − p˜i )ti (∀i = 1, . . . , N ). (7)
PS (E) = pi ei (si ). (1)
i=1 We call the ratio p ≤ 1 a Helstrom ratio defined by
The optimal success probability PS is given by optimizing pi
p := (∀i = 1, . . . , N ), (8)
PS (E) among all the N -valued observables: p̃i
PS = sup PS (E). (2) which turns out to play an important role in an optimal
E∈ON
state discrimination. We call a weak Helstrom family a
For a binary discrimination (N = 2), it can be written trivial (resp. nontrivial) family when p = 1 (resp. p < 1).
as Note that a weak Helstrom family always exists for any
distinct states {si } and a prior probability distribution
PS = p2 + sup[p1 e(s1 ) − p2 e(s2 )], (3) {pi }. For instance,
P it is easy to see that p˜i = pi (p = 1)
e∈E 1
and ti = 1−p i
( j6=i pj sj ) gives a weakP Helstrom family
where in the final expression we have used e1 + e2 = u. of ensembles with a reference state s = i pi si , although
This problem is well investigated in quantum mechanics, it is a trivial family. (See later examples for nontrivial
and the optimal success probability to discriminate two families.) Moreover, if {p̃i , si ; 1 − p̃i , ti } (i = 1, . . . , N ) is
distinct density operators ρ1 , ρ2 with a prior distribution a weak Helstrom family with a Helstrom ratio p < 1 and
p1 , p2 is given by a reference state s, then for any p < p′ ≤ 1, one can con-
struct another weak Helstrom family with a Helstrom ra-
(Q) 1−p̃i pi
PS = p2 + sup tr[E(p1 ρ1 − p2 ρ2 )] tio p′ . Indeed, since 0 ≤ 1− ′
p˜i ′ < 1 for p̃i = p′ (< 1), one
0≤E≤I 1−p̃i 1−p̃i
can take conjugate states as t′i := 1− p˜i ′ ti + (1 − 1−p˜i ′ )si .
1
= (1 + ||p1 ρ1 − p2 ρ2 ||1 ). (4) Then it is easy to see that the family of {si , p̃i ; ti , 1 − p̃′i }
′ ′
2
is a weak Helstrom family with a Helstrom ratio p′ and
Here, the √ norm is a trace norm defined by ||A||1 := the same reference state s.
tr |A| = tr A† A. Since this bound is sometimes referred Let us explain a geometrical meaning of a weak Hel-
as the Helstrom bound, let us call PS (2) also the Hel- strom family of ensembles which makes it easier to find
strom bound for any N and for any general probabilistic it. First we explain this for the most interesting cases
theories. in the context of state discrimination, i.e., those with
In order to obtain the Helstrom bound in general prob- the uniform probability distribution pi = 1/N . In these
abilistic theories, we shall introduce a family of ensembles cases, condition (5) tells that p˜i should give the same
which is later shown to be closely related to the optimiz- weights q := p˜i = N1p , and condition (6) geometrically
ing problem in Eq (2). In the following, we assume that means that all ti should located in S such that all si and
a prior probability distribution satisfy pi 6= 0, 1 removing ti have the common interior point s with the same ra-
trivial cases: tio q. Global picture for this is that one has to find ti
4
q s1
ratio 1−q . Fig. 1 [A] illustrates an example for N = 3
s s
with the uniform distribution. One immediately recog- t1 t1
t2
nizes the reverse homothethy between two polytopes (tri- t2 s3
s3
u1
angles) generated by {si } and {ti } with the internal ho-
mothetic center s. As is later shown, it is preferable to u2
t2
t1 [A℄ [B℄ t2 t02
Parallel s1
s1
S
Supporting s
Hyperplanes s2 s s0
distinguishable t1 t1 s2 t01
[C℄ t2
t2
s1
s
FIG. 2: Geometrical appearance of two distinguishable states
t1 , t2 . t1 s2
E = {ei }Ni=1 satisfying ei (ti ) = 0 for all i = 1, . . . , N . In FIG. 3: [A] A typical Helstrom family of ensembles in R2 ; [B]
this case, the observable E gives an optimal measurement a Helstrom family of ensembles is not unique; [C] A Helstrom
to discriminate {si } with a prior distribution {pi }. family of ensembles exists for models S with infinite numbers
of extreme points.
Two states t1 , t2 ∈ S are said to be distinguishable if
there exists an observable E = {e1 , e2 } which discrim-
inates t1 and t2 with certainty (for any prior distribu- with the uniform distribution p1 = p2 = 1/2 are ensem-
tions), or equivalently satisfy bles of a Helstrom family if t1 , t2 are distinguishable and
e1 (σ1 ) = 1, e1 (σ2 ) = 0 (⇔ e2 (σ1 ) = 0, e2 (σ2 ) = 1). s := qs1 + (1 − q)t1 = qs2 + (1 − q)t2 , (12)
(11)
Therefore, as a corollary of Proposition 2 for N = 2, we with some 0 ≤ q := p̃1 = p̃2 ≤ 1. From (12), s1 − s2 and
obtained the following theorem for a binary state dis- t1 − t2 should be parallel, and therefore one easy way
crimination (N = 2). to find Helstrom family is as follows: search conjugate
states t1 and t2 on the boundary of S which are on a line
Theorem 1 Let {p̃i , si ; 1 − p̃i , ti } (i = 1, 2) be a weak parallel to s1 −s2 such that there exist parallel supporting
Helstrom family of ensembles for states s1 , s2 ∈ S and a hyperplanes at t1 and t2 . Then, the crossing point is a
binary probability distribution p1 , p2 such that t1 and t2 reference state s while the ratio between s1 − s (s2 − s)
are distinguishable states. Then, {p̃i , si ; 1 − p̃i , ti } (i = and s − t1 (s − t2 ) determines the Helstrom ratio p = N1q .
1, 2) is a Helstrom family with the Helstrom ratio p = PS . In Fig. 3, Helstrom families for some models on V = R2
An optimal measurement to distinguish s1 and s2 is given are illustrated.
by an observable to distinguish t1 and t2 . Now it is important to ask whether a Helstrom family
of ensembles always exists for any general probabilistic
Proof The distinguishability of t1 and t2 satisfies the theories or not. In this paper, we show a Helstrom fam-
sufficient condition in Proposition 2. ily of ensembles for a binary state discrimination (N = 2)
Let us consider the case where S is a subset of fi- always exist in generic cases for both classical and quan-
nite dimensional real vector space V . From condition tum systems. (For the existence in more general general
(11), geometrical meaning of two distinguishable states probabilistic theories, see our forthcoming paper [22].)
t1 , t2 is that they are on the boundary of S which pos- Here, we mean by generic cases all the cases except for
sess parallel supporting hyperplanes (See Fig. 2). Here, trivial cases where PS = max[p1 , p2 ] with a trivial mea-
a supporting hyperplane at a point s ∈ S is a hyper- surement u, i.e., there are no improvement of guessing
plane H ⊂ V such that s ∈ H and S is contained in one which exceeds the prior knowledge.
of the two closed half-spaces of the hyperplane [23]. In- First, let us consider a quantum system Squ . For dis-
deed, if there exist two parallel supporting hyperplanes tinct density operators ρ1 , ρ2 with a prior probability
H1 and H2 at t1 ∈ S and t2 ∈ S respectively, one can distribution p1 , p2 , define
construct an affine functional f on V such that f (x) = 1 P an Hermitian operator X :=
p1 ρ1 − p2 ρ2 . Let X = i xi Pi be the spectral decom-
on x ∈ H1 and f (y) = 0 for y ∈ H2 . Then, the restriction position of X. The
of f to S is an effect which distinguishes t1 and t2 with P positive and negative parts
P of X are
given by X+ := i:xi ≥0 xi Pi and X− := i:xi <0 |xi |Pi
certainty since S is contained between H1 and H2 and satisfying
f (t1 ) = 1, f (t2 ) = 0. Then, to find a Helstrom family
of ensembles given in Theorem 1 is nothing but a simple X = X+ − X− . (13)
geometrical task. Here, we explain this in the uniform
distribution cases: From the definition of a (weak) Hel- Note that X+ , X− ≥ 0, X+ X− = 0, and ||X+ ||1 −
strom family of ensembles and Theorem 1, two ensembles ||X− ||1 = tr X+ − tr X− = p1 − p2 . X+ or X− might
{p̃i , si ; 1 − p̃i , ti } (i = 1, 2) for a distinct stats s1 , s2 ∈ S be zero operator [31], but in that case the optimization
6
problem is nothing but a trivial case. Indeed, suppose course, this is consistent with the results in [18] and our
that X− = 0. Then, for any POVM element E, it fol- result can be interpreted as a generalization of the results
lows tr EX = tr EX+ ≤ tr I X+ = tr X = p1 − p2 , and in [18] to any dimensional quantum mechanical systems
thus PS = p1 with a trivial measurement I from (4). The for any N states discrimination.
similar argument shows that the case X+ = 0 is again a
trivial case with PS = p2 . Therefore, for any generic case,
we can assume X+ , X− 6= 0, and this makes possible to IV. EXAMPLES
define two density operators by
In this section, we illustrate our method in quantum
1 1
σ1 := X− , σ2 := X+ . (14) 2-level systems (qubit), and also in a simple toy model
||X− ||1 ||X+ ||1 which is neither classical nor quantum.
Notice that they are orthogonal and thus are distinguish-
able with certainty. It follows that sup0≤E≤I tr EX = A. Quantum 2-level systems
tr X+ = ||X+ ||1 where the maximum
P is established by
the projection operator P = i;xi ≥0 Pi . From (4), we
have As is well known, any density operator ρ for qubit
is represented by the Bloch vector b ∈ D3 := {b ∈
PS
(Q)
= p2 + ||X+ ||1 = p1 + ||X− ||1 . (15) R3 | P ||b|| ≤ 1} through the map b 7→ ρ(b) =
1 3
(
2 I + i=1 bi σi ), where σi (i = 1, 2, 3) are Pauli Matri-
(Q)
Let p̃i = pi /PS (i = 1, 2). It follows 0 < p̃i ≤ 1 from ces. Notice that the trace distance between density op-
(15) and p̃p11 = p̃p22 by definition. Finally, direct calculation erators coincides with the Euclid distance in R3 between
using (13), (14) and (15) shows the equation (6). the corresponding Bloch vectors: ||ρ(b1 ) − ρ(b2 )||1 =
Therefore, we have obtained [32] ||b1 − b2 ||.
Theorem 2 For any quantum mechanical systems, a [Examples 3: Binary state discrimination] Let us con-
Helstrom family of ensembles for a binary state discrim- sider a state discrimination between ρ(b1 ) and ρ(b2 ) with
ination exists for any generic cases. a uniform distribution. Following a geometrical view of a
Helstrom family of ensembles in Theorem 1, one can find
As any classical systems is embeddable into quantum sys- it in the following manner: In order that states c1 ∈ D3
tems (into diagonal elements with a fixed base), we have and c2 ∈ D3 have parallel supporting hyperplanes so that
also they are distinguishable, they should be on the Bloch
Theorem 3 For any classical systems Scl , a Helstrom sphere (pure states) in opposite direction [34]. Moreover,
family of ensembles for a binary state discrimination ex- the line c1 −c2 has to be parallel to b1 −b2 from condition
ists for any generic cases. (12). Then, c1 and c2 are uniquely determined by points
on the intersection of the Bloch ball and the hyperplane
More concretely, for given distinct classical P states P s1 = determined by b1 − b2 and the origin (See Fig. 4). Then,
(xi )di=1 , s2 = (yi )di=1 ∈ Scl (xi , yi ≥ 0, i xi = i yi = it is an elementary geometric problem to obtain the ra-
1) with a prior probability distribution p1 , p2 , one can tio: q = 2+||b21 −b2 || . Since the Helstrom ratio is given
define t1 = ||X1− ||1 (− min[Xi , 0])di=1 , t2 = ||X1+ ||1 (max[Xi , by p = pi /p̃i = 1/2q, this reproduces a Helstrom bound
P
0])di=1 where Xi = p1 xi − p2 yi , ||X− ||1 = i:Xi <0 Xi
PS = 21 (1 + 21 ||b1 − b2 ||) by use of Theorem 1. Indeed,
P
and ||X+ ||1 = i:Xi ≥0 Xi . Finally p̃i is given by pi /PS from (4), the optimal success probability to discriminate
P two distinct ρ1 and ρ2 with a uniform prior distribution
= 2pi /(1 + i |Xi |).
In reference [18], a family of ensembles in Theorem 1 is
(and thus a Helstrom family of ensembles) has been used (Q) 1 1
PS = (1 + ||ρ1 − ρ2 ||1 ). (16)
in 2-level quantum systems for a binary state discrimi- 2 2
nation with a uniform prior distribution p0 = p1 = 1/2. (Recall that ||ρ(b1 ) − ρ(b2 )||1 = ||b1 − b2 ||).
The purpose there was to reproduce Helstrom bound (4)
in 2-level quantum systems (with p0 = p1 = 1/2) by [Examples 4: N -numbers of symmetric state discrim-
resorting to (A) remote state preparation and (B) no- ination] In quantum systems, discrimination of N num-
signaling condition [33]. Compared to their results, The- bers of state is much more difficult problem than bi-
orem 2 shows that a Helstrom family of ensembles exists nary cases. For symmetric quantum (pure) states {ρj =
not only in 2-level systems with uniform distributions but |ψj ihψj |}N
j=1 with uniform distribution pi = 1/N , where
also in any quantum systems for generic cases. Moreover, state vectors are given by |ψj i = V j−1 |ψi with a nor-
Theorem 1 implies that a logical connection with an opti- malized vector |ψi and a unitary operator satisfying
mal state discrimination has already appears through the V N = exp(iχ) I (χ ∈ R), Ban et al. [24] obtained the
existence of a Helstrom family of ensembles, resort to nei- optimal success probability:
ther (A) nor (B); and indeed this appears in any general
(Q)
probabilistic theories, not only in quantum systems. Of PS = |hψ|Φ|ψi|2 ,
7
z
[A℄ [B℄
6 5 4
7 3 D
1 2 A
b2 8 = N 1
C 1 q
q
1 q B
b1
b
E
[C℄
6 5 4
7 3
FIG. 4: 2-dimensional section of the Bloch Ball. 8 = N 1
2
PN 2 8
where Φ := 1
j=1 |ψj ihψj |. As a typical example, let 7
3
us consider N symmetric states in 2-level systems (as 4 6
illustrated in Fig. 5 [A] for the case N = 8). Let 5
π
V := exp(−i N σ3 ) be a unitary operator which rotates
Bloch vectors by the angle 2π/N around on the z axis
(V N = − I); and let |ψi := cos( θ2 )|0i + sin( θ2 )|1i where
|0i, |1i the eigenvectors of σ3 with eigenvalues 1, −1. The FIG. 5: [A] Symmetric quantum states ρ1 , . . . , ρ8 in the Bloch
corresponding Bloch vector to |ψi is b = (sin θ, 0, cos θ). ball; [B] 2-dimensional section where points A, B, C are ρj ,
sin θ
Then, it follows σj and ρ, respectively; (AC = cos ξ
, AB = 2 sin(θ + ξ), and
sin θ
thus q = 1 − AC = 1 − 2 sin(θ+ξ) cos ξ
.); [C] Helstrom family of
θ θ 2π(j−1) AB
|ψj i := V j−1 |ψi = cos( )|0i + sin( )ei N |1i, (17) ensembles with conjugate states σj and the reference state ρ.
2 2
for j = 1, . . . , N , with the corresponding Bloch vectors
Note that we have a bound PS ≤ p = N1qξ from Proposi-
b(j) = (sin θ cos 2π(j−1)
N , sin θ sin 2π(j−1)
N , cos θ). It is easy
N tion 1. Therefore, in order to obtain a tighter bound of
to show Φ = 2 (I + cos θσ3 ) [35], and the optimal success
PS , we would like to find a weak Helstrom family with
probability is
larger qξ as much as possible. It is again a simple ge-
sin θ
(Q) 1 ometric problem to obtain qξ = 1 − sin(θ+2ξ)+sin θ (see
PS = (1 + sin θ). (18)
N the caption of Fig. 5 [B]), which takes the maximum
1 π θ
qξM = 1+sin θ at ξM = 4 − 2 (= ∠DAE) (See Fig. 5
In the following, we apply our method and show that [C]). This attains the tight bound (18), and thus we have
there exists a Helstrom family of ensemble for this prob- demonstrated that our method reproduces the optimal
lem with any N and thus reproduce the success prob- success probability. Indeed, we can show that this weak
ability (18). (In the following, we identity the density Helstrom family of ensembles is a Helstrom family from
operator ρj , the state vector ψj , and its Bloch vector bj . Proposition 2: note that σj = |φj ihφj | at ξM is
First, from the symmetry and geometrical view point
of a weak Helstrom family of ensembles, it is clear that π π 2π(j−1)
|φj i = cos( )|0i + sin( )ei( N +π) |1i. (19)
a weak Helstrom family for {ρj = |ψj ihψj |}N j=1 and pi = 4 4
1/N can be constructed as follows: In the Bloch ball, 2π(j−1)
make lines from each ρj to a point on the z-axis, say Let |χj i := cos( π4 )|0i + sin( π4 )ei N |1i which is or-
point C, and extend the lines until they arrives at the thogonal to |φj i for all j = 1, . . . , N . Then, it follows
PN N 2
Bloch sphere, and let conjugate states σj be each end- j=1 |χi ihχi | = 2 I and thus {Ei := N |χi ihχi |} is a
points of the lines from ρj . Fig. 5 [B] shows one of the POVM which satisfies the condition tr Ei σi = 0 in Propo-
2-dimensional sections of the Bloch ball where the points sition 2. Consequently, we have found a Helstrom family
A and B are the corresponding ρj and σj . Then, we have of ensembles and thus obtained the Helstrom bound.
obtained a weak Helstrom family of ensembles {qξ , ρj ; 1−
qξ , σj } where qξ is a ratio CB
AB
, where we explicitly write
the dependence on the angle ξ = ∠DAB, so that the B. Probabilistic Model with square-state space
reference state ρ is the point C:
As an example which is not neither classical nor quan-
ρ = qξ ρj + (1 − qξ )σj (j = 1, . . . , N ). tum systems, let us consider a general probabilistic model
8
bility (Academic, Boston, 1988). with finitely many outputs, since it is enough to consider
[8] H. Barnum, J. Barrett, M. Leifer, and A. Wilce, Phys. for our purpose to discriminate N numbers of states. Note
Rev. Lett. 99 240501 (2007); arXiv: quant-ph/0611295. that it is straightforward to formalize general observables
[9] H. Barnum, J. Barrett, M. Leifer, and A. Wilce, with measure theoretic language.
arXiv:0805.3553. [27] A subset C in a real vector space V is called convex if
[10] C. M. Edwards, Comm. Math. Phys. 16, 207 (1970); E. λs + (1 − λ)t ∈ C for any s, t ∈ C and λ ∈ [0, 1].
B. Davies and J. T. Lewis, Comm. Math. Phys. 17 239 [28] If s ∈ C does not have a nontrivial convex combination
(1970); G. Ludwig, Foundations of Quantum Mechan- in C, i.e., if s = λt + (1 − λ)u for some t, u ∈ C and
ics Vol I and II, (Springer-Verlag, New York, 1983 and λ ∈ (0, 1) implies s = t = u, then s is called an extreme
1985). E.B. Davies, Quantum Theory of Open Systems point.
(Academic, London, 1976). [29] A convex polytope C = convi=1,...,N {ci } =
[11] A. S . Holevo, Probabilistic and Statistical Aspects of { N
P P
i=1 p i ci | p i ≥ 0, i p i = 1} ⊂ V with N num-
Quantum Theory (North-Holland, Amsterdam, 1982). bers of extreme points ci ∈ V is called a simplex if any
[12] C. W. Helstrom, Quantum Detection and Estimation element c ∈ C has the unique convex combinations with
Theory, (Academic, NewYork, 1976). respect to ci s. Equivalently, C is called a simplex iff the
[13] I. D. Ivanovic, Phys. Lett. A 123, 257 (1987);D. Dieks, affine dimension of C is N − 1.
Phys. Lett. A 126, 303 (1988); A. Peres, Phys. Lett. A [30] Note that the observable here will be the so-called POVM
128, 19 (1988); (positive-operator-valued measure), and therefore what
[14] H. P. Yuen, R. S. Kennedy, and M. Lax, IEEE is usually called observable in the standard textbook of
Trans. Inform. Theory IT-21, 125 (1975); See also A. quantum mechanics which is characterized by an Hermi-
S . Holevo, Statistical Structure of Quantum Theory tian operator is a special observable in this paper.
(Springer, Berlin, 2001). [31] Note that this happens even when 0 < p1 , p2 < 1.
[15] C. A. Fuchs, Distinguishability and Accessible Informa- [32] Although we explained in finite level quantum systems, it
tion in Quantum Theory, PhD thesis, The University of is straightforward to generalize to any quantum mechan-
New Mexico (1996). ical systems with Hilbert space with countably infinite
[16] C. H. Bennett and G. Brassard, Proc. of IEEE Int. dimension. (Notice that X is a trace class operator on H
Conf. on Computers, Systems and Signal Processing, 175 and thus has a discrete spectral decomposition.)
(1984). [33] [(A) Remote State Preparation] Let {pi ; ρi } and
[17] M. Ozawa, Rep. on Math. Phys. 18, 11 (1980). {qj , σj } be two ensembles in a quantum system (ρi , σj ∈
[18] W. Y. Hwang, Phys. Rev. A 71, 062315 (2005); J.
P P
S , pi , qj ≥ 0, i pi , j qj = 1) which satisfies
Bae, J. W. Lee, J. Kim, and W. Y. Hwang, arXiv: Pqu P
i pi ρj = j qj σj . Then, there exist a Hilbert space K,
quant-ph/0406032. a state τ on H ⊗K, and local measurements M1 and M2
[19] S. M. Barnett and E. Andersson, Phys. Rev. A 65, on K such that the ensembles {pi ; ρi } and {qj , σj } can be
044307 (2002); D. Qiu, Phys. Lett. A 303, 140 (2002); remotely prepared by measuring M1 and M2 under the
Y. Feng, S. Zhang, R. Duan, and M. Ying, Phys. Rev. A state τ . [(B) No-signaling condition] Any information
66, 062313 (2002). does not instantaneously transmit by local measurement.
[20] S. Croke, E. Andersson, and S. M. Barnett, Phys. Rev. In quantum systems, it is well known that both (A) and
A 77, 012113 (2008). (B) holds.
[21] M. Ozawa, Ann. Phys. 311, 350 (2004). [34] Indeed, this is equivalent to the orthogonality between
[22] K. Nuida, G. Kimura, T. Miyadera, and H. Imai (in ρ(c1 ) and ρ(c2 ).
preparation). [35] For instance, one can make use of the Bloch
[23] S. R. Lay, Convex Sets and Their Applications (Krieger, vector representation. Φ :=
PN
j=1 |ψj ihψj | =
Malabar, 1992). PN 1 P3 (j) N
[24] M. Ban, K. Kurokawa, R. Momose, and O. Hirota, Int. (I + k=1 bk σk ) = ( + cos θσ3 ), since
2 I
Pj=1(j)2 P (j)
J. Theor. Phys. 36, 1269 (1997). b
j 1 σ 1 = b
j 1 σ 1 vanish from the rotational
[25] G. Kimura, K. Imafuku, T. Miyadera, K. Nuida, and H. symmetry around z-axis.
Imai (in preparation).
[26] In this paper, we deal with only finite discrete observables