なぜ


15

私はそう思う

P(A|B)=P(A|B,C)P(C)+P(A|B,¬C)P(¬C)

正しいが、

P(A|B)=P(A|B,C)+P(A|B,¬C)

間違っている。

しかし、後者については「直観」があります。つまり、2つのケース(CまたはNot C)を分割することで確率P(A | B)を考慮します。なぜこの直感が間違っているのですか?


4
方程式をテストする簡単な例を次に示します。2つの独立した公正なコインを投げます。してみましょうA最初に頭を起動したことをイベントで、B第二は、頭をアップする、とした場合も両方とも頭を考え出すことをイベントで。あるいずれかのあなたが書いた式は正しいですか?C
A.レックス

4
総確率法則では、無条件の確率を条件付き確率の合計として表現する場合、条件付けイベントで重み付けする必要があります。たとえば、P(A)=P(A|B)P(B)+P(A|B¯)P(B¯)
AdamO

回答:


25

確率こと、容易カウンタ例として、仮定Aは、である1にかかわらずの値の、C。次に、誤った方程式をとると、次のようになります。P(A)A1C

P(A|B)=P(A|B,C)+P(A|B,¬C)=1+1=2

それは明らかに正しいことはできません。おそらくより大きくすることはできません。これは、2つのケースのそれぞれに重みを割り当てる必要があるという直感を構築するのに役立ちます。そのケースの可能性に比例して、最初の(正しい)方程式になります。1


これにより、最初の方程式に近づきますが、重みは完全には正しくありません。正しい重みについては、A。Rexのコメントをご覧ください。


1
"最初の(正しい)式"の重みがあるべきP ¬ C 、または彼らはする必要がありますP C | B P ¬ C | B P(C)P(¬C)P(CB)P(¬CB)
A.レックス

良い点だA.Rex @、完全な正しさのために、私はそれがあるべきだと思うP ¬ C | B 。方程式の左側のすべて(1つの用語)はBが与えられていることを前提としているため、追加の仮定なしで(BCが互いに独立していると仮定するなど)、右側の場合も同じです側P(C|B)P(¬C|B)BBC
デニススーマーズ

A | Bが確実に200%発生すると考えてください。
マークL.ストーン

@ MarkL.Stoneこれは、常に2回発生するということですか?;)
モニカの復活

9

デニスの答えには、間違った方程式を反証する素晴らしい反例があります。この答えは、次の式が正しい理由を説明しようとしています。

P(A|B)=P(A|C,B)P(C|B)+P(A|¬C,B)P(¬C|B).

すべての用語が上で条件付けされたよう、我々はによって全体の確率空間を置き換えることができますB&ドロップBの用語を。これにより、次のことができます。BBB

P(A)=P(A|C)P(C)+P(A|¬C)P(¬C).

この式があり、なぜ、あなたは求めているP ¬ C それに用語。P(C)P(¬C)

その理由は、あるの一部であるAにおけるC及びP A | ¬ C P ¬ Cがの一部であるA¬ Cと二つまで追加A。図を参照してください。一方、P A | C は、AP Aを含むCの割合ですP(A|C)P(C)ACP(A|¬C)P(¬C)A¬CAP(A|C)CAの割合である ¬ Cを含む A -それらを追加することは無意味であるので、それらは共通の分母を持っていないので、これらは、異なる領域の割合です。P(A|¬C)¬CA

pic


2
Not "everything is conditioned on B". In particular, P(C) and P(¬C) are not, so you can't just drop B. Moreover, this might suggest the equation is wrong!
A. Rex

@A.Rex Technically you're right, I should have said every term involving A is conditioned on B (I made a simple substitution A|BA). I will correct the answer.
Reinstate Monica

5
My objection wasn't a technicality. Your diagram correctly proves that P(A)=P(AC)P(C)+P(A¬C)P(¬C), which after conditioning on B becomes P(AB)=P(AB,C)P(CB)+P(AB,¬C)P(¬CB); note that the probabilities of C and ¬C are also conditioned on B. This is not the first equation given in the OP, which is good news, because the first equation given in the OP is not correct.
A. Rex

@A.Rex You are right once again, C must also conditioned on B as the proportion of the probability space contained in C might not be the same as the proportion of B contained in C. This point escaped me. I will revise again.
Reinstate Monica

7

I know you've already received two great answers to your question, but I just wanted to point out how you can turn the idea behind your intuition into the correct equation.

First, remember that P(XY)=P(XY)P(Y) and equivalently P(XY)=P(XY)P(Y).

To avoid making mistakes, we will use the first equation in the previous paragraph to eliminate all conditional probabilities, then keep rewriting expressions involving intersections and unions of events, then use the second equation in the previous paragraph to re-introduce the conditionals at the end. Thus, we start with:

P(AB)=P(AB)P(B)

We will keep rewriting the right-hand side until we get the desired equation.

The casework in your intuition expands the event A into (AC)(A¬C), resulting in

P(AB)=P(((AC)(A¬C))B)P(B)

As with sets, the intersection distributes over the union:

P(AB)=P((ABC)(AB¬C))P(B)

Since the two events being unioned in the numerator are mutually exclusive (since C and ¬C cannot both happen), we can use the sum rule:

P(AB)=P(ABC)P(B)+P(AB¬C)P(B)

We now see that P(AB)=P(ACB)+P(A¬CB); thus, you can use the sum rule on the event on the event of interest (the "left" side of the conditional bar) if you keep the given event (the "right" side) the same. This can be used as a general rule for other equality proofs as well.

We re-introduce the desired conditionals using the second equation in the second paragraph:

P(A(BC))=P(ABC)P(BC)
and similarly for ¬C.

We plug this into our equation for P(AB) as:

P(AB)=P(ABC)P(BC)P(B)+P(AB¬C)P(B¬C)P(B)

Noting that P(BC)P(B)=P(CB) (and similarly for ¬C), we finally get

P(AB)=P(ABC)P(CB)+P(AB¬C)P(¬CB)

Which is the correct equation (albeit with slightly different notation), including the fix A. Rex pointed out.

Note that P(ACB) turned into P(ABC)P(CB). This mirrors the equation P(AC)=P(AC)P(C) by adding the B condition to not only P(AC) and P(AC), but also P(C) as well. I think if you are to use familiar rules on conditioned probabilities, you need to add the condition to all probabilities in the rule. And if there's any doubt whether that idea works for a particular situation, you can always expand out the conditionals to check, as I did for this answer.


2
+1. I think you extracted the equation that OP tried to intuit: P(AB)=P(ACB)+P(A¬CB).
A. Rex

Thanks! That was the main point I wanted to make, but couldn't figure out a high-level explanation why the intersection goes on the left rather than the right, so I used formulas instead. Also, I just noticed you were the one who pointed out the mistake in OP's formula, so I credited you for that. (I probably wouldn't have noticed either, lol.)
YawarRaza7349

2

Probabilities are ratios; the probability of A given B is how often A happens within the space of B. For instance, P(rain|March) is the number of rainy days in March divided by the number of total days in March. When dealing with fractions, it makes sense to split up numerators. For instance,

P(rain or snow|March)=(number of rainy or snowy days in March)(total number of days in March)=(number of rainy days in March)(total number of days in March)+(number of snowy days in March)(total number of days in March)=P(rain|March)+P(snow|March)

This of course assumes that "snow" and "rain" are mutually exclusive. It does not, however, make sense to split up denominators. So if you have P(rain|February or March), that is equal to

(number of rainy days in February and March)(total number of days in February and March).

But that is not equal to

(number of rainy days in February)(total number of days in February)+(number of rainy days in March)(total number of days in March).

If you're having trouble seeing that, you can try out some numbers. Suppose there are 10 rainy days in February and 8 in March. Then we have

(number of rainy days in February and March)(total number of days in February and March)=(10+8)/(28+31)=29.5%

and

(number of rainy days in February)(total number of days in February)+(number of rainy days in March)(total number of days in March)=(10/28)+(8/31)=35.7%+25.8%=61.5%

The first number, 29.5%, is the average of 35.7% and 25.8% (with the second number weighted slightly more because there is are more days in March). When you say P(A|B)=P(A|B,C)+P(A|B,¬C) you're saying that x1+x2y1+y2=x1y1+x2y2, which is false.


1

If I go to Spain, I can get sunburnt.

P(sunburnt|Spain)=0.2
This tells me nothing about getting sunburnt if not going to Spain, let's say
P(sunburnt|¬Spain)=0.1
This year I'm going to Spain, so
P(sunburnt)=0.2
Letting B=Ω, this is, P(B)=1, your intuition would imply
P(A)=P(A|C)+P(A|¬C)
which by the previous argument, isn't neccesarily true.
弊社のサイトを使用することにより、あなたは弊社のクッキーポリシーおよびプライバシーポリシーを読み、理解したものとみなされます。
Licensed under cc by-sa 3.0 with attribution required.