2つの確率変数の合計としての一様確率変数


18

GrimmetおよびStirzakerから取得

そうでないことを示すU = X + Y Uは均一[0,1]上に分散され、XおよびYは独立しており、同一分布。あなたはないはず XとYが連続変数であることを前提としています。U=X+YUXY

場合の矛盾で十分によって簡単証明XXYは、Yそれが常に可能見つけることと主張することによって別個に仮定され、Uu及びU 'uその結果、P U U + U 'P U U P(Uu+u)P(Uu)一方、P X + Y U = P X + Y U + U 'PX+ YU = PX+ YU + U

ただし、この証明はX は拡張されずYバツYは完全に連続または特異連続です。ヒント/コメント/批評?


3
ヒント:特性関数はあなたの友達です。
2014年

1
XとYはiidなので、それらの特性関数は同一でなければなりません。ただし、モーメント生成関数ではなく特性関数を使用する必要があります-mgfはXに対して存在することが保証されていないため、mgfに不可能なプロパティがあることを示すことは、そのようなXがないことを意味しません。すべてのRVには特性関数があります。あなたはそれが不可能な性質を持って示している場合ので、その後はそのようなX.はありません
紙魚

1
分布の場合、X及びYは任意有する原子を、と言うP { X = A } = P { Y =を} = B > 0、次いでP { X + Y = 2 A } B 2 > 0 SOおよびX + Yは、均一に分散させることができない[ 0 1 ]バツYP{ X= a } = P{ Y= a } = b > 0P{ X+ Y=2a}b2>0X+Y[0,1]。したがって、原子を持つXYの分布の場合を考慮する必要はありません。XY
サルウェートディリップ14年

回答:


13

結果は写真で証明できます。目に見える灰色の領域は、均一な分布は2つの独立した同一に分布した変数の合計として分解できないことを示しています。

表記法

ましょX及びYは、そのようなことはIIDであるX + Yが上に均一な分布を有する[ 0 1 ]。すべてのそれはこの手段0 B 1XYX+Y[0,1]0ab1

PR < X + Y B = B -

Pr(a<X+Yb)=ba.

共通の分布の本質的な支持X及びY故には[ 0 1 / 2 ](別段のためにその正の確率が存在することになるX + Yの外側にある[ 0 1 ])。XY[0,1/2]X+Y[0,1]

してみましょう0 < ε < 1 / 4。ランダム変数の合計がどのように計算されるかを示すこの図を考えてください:0<ϵ<1/4

Figure

基礎となる確率分布は、X Y )の結合分布です。任意のイベントの確率は< X + Y Bは線間延伸対角線バンドによってカバーされる全確率で与えられるX + Y =およびX + Y = B。このような3つのバンドが示されている:から0までε、左下に小さな青い三角形として現れます。1 / 2 - ε1 / 2(X,Y)a<X+Ybx+y=ax+y=b0ϵ1/2ϵ +ϵ、2つの(黄色と緑の)三角形で覆われた灰色の長方形として表示されます。および 1 ϵから 1で、右上に小さな赤い三角形として表示されます。1/2+ϵ1ϵ1

写真が示すもの

図の左下の三角形をそれを含む左下の正方形と比較し、XおよびYの iid仮定を活用することにより、XY

ϵ=Pr(X+Yϵ)<Pr(Xϵ)Pr(Yϵ)=Pr(Xϵ)2.

ϵ=Pr(X+Yϵ)<Pr(Xϵ)Pr(Yϵ)=Pr(Xϵ)2.

Note that the inequality is strict: equality is not possible because there is some positive probability that both XX and YY are less than ϵϵ but nevertheless X+Y>ϵX+Y>ϵ.

Similarly, comparing the red triangle to the square in the upper right corner,

ε = のPr X + Y > 1 - ε < のPr X > 1 / 2 - ε 2

ϵ=Pr(X+Y>1ϵ)<Pr(X>1/2ϵ)2.

最後に、左上と右下にある2つの反対側の三角形を、それらを含む対角バンドと比較すると、別の厳密な不等式が得られます。

2 ε < 2 のPr X ε のPr X > 1 / 2 - ε < のPr 1 / 2 - ε < X + Y 1 / 2 + ε = 2 ε

2ϵ<2Pr(Xϵ)Pr(X>1/2ϵ)<Pr(1/2ϵ<X+Y1/2+ϵ)=2ϵ.

The first inequality ensues from the previous two (take their square roots and multiply them) while the second one describes the (strict) inclusion of the triangles within the band and the last equality expresses the uniformity of X+YX+Y. The conclusion that 2ϵ<2ϵ2ϵ<2ϵ is the contradiction proving such XX and YY cannot exist, QED.


3
(+1) I like this approach. Recovering my back-of-an-envelope from the wastepaper basket I can see I drew the same diagram, except that I didn't mark on the yellow and green triangles inside the band. I did obtain the inequalities for the blue and red triangles. I played around with them and a few other probabilities, but never thought to investigate the probability of the strip, which turns out to be the criticial step. I wonder what thought process might have motivated this insight?
Silverfish

In fact, where @whuber has yellow and green triangles, I did draw on squares (I'd effectively decomposed [0,0.5]2[0,0.5]2 into a grid). Looking at the step which "describes the (strict) inclusion of the triangles within the band", 2Pr(Xϵ)Pr(X>1/2ϵ)<Pr(1/2ϵ<X+Y1/2+ϵ)2Pr(Xϵ)Pr(X>1/2ϵ)<Pr(1/2ϵ<X+Y1/2+ϵ), I wonder whether this would actually be geometrically more natural with squares capping the band than triangles?
Silverfish

1
@Silver I was reminded of an analysis of sums of uniform distributions I posted a couple of years ago. That suggested visualizing the sum X+YX+Y geometrically. It was immediately evident that a lot of probability had to be concentrated near the corners (0,0)(0,0) and (1/2,1/2)(1/2,1/2) in order for the sum to be uniform and for relatively little probability to be near the center diagonal X+Y=1/2X+Y=1/2. That led to the diagram, which I redrew in Mathematica. At that point the answer wrote itself. Yes, using squares in the center band might be neater.
whuber

Thanks! "Note that the inequality is strict: equality is not possible because there is some positive probability that either of XX or YY is less than ϵϵ but nevertheless X+Y>ϵX+Y>ϵ." I'm not sure I follow this. It seems to me the aim here is to show Pr(X+Yϵ)<Pr(XϵYϵ)Pr(X+Yϵ)<Pr(XϵYϵ), doesn't this require a positive probability for some event AA in which both of XX and YY are less than or equal to ϵϵ and yet X+Y>ϵX+Y>ϵ? It is the "either of" vs "both of" I'm vacillating over.
Silverfish

@Silverfish Thank you; I did not express that as I had intended. You are correct: the language is intended essentially to describe the portion of a little square not inside the triangle.
whuber

10

I tried finding a proof without considering characteristic functions. Excess kurtosis does the trick. Here's the two-line answer: Kurt(U)=Kurt(X+Y)=Kurt(X)/2Kurt(U)=Kurt(X+Y)=Kurt(X)/2 since XX and YY are iid. Then Kurt(U)=1.2Kurt(U)=1.2 implies Kurt(X)=2.4Kurt(X)=2.4 which is a contradiction as Kurt(X)2Kurt(X)2 for any random variable.

Rather more interesting is the line of reasoning that got me to that point. XX (and YY) must be bounded between 0 and 0.5 - that much is obvious, but helpfully means that its moments and central moments exist. Let's start by considering the mean and variance: E(U)=0.5E(U)=0.5 and Var(U)=112Var(U)=112. If XX and YY are identically distributed then we have:

E(X+Y)=E(X)+E(Y)=2E(X)=0.5

E(X+Y)=E(X)+E(Y)=2E(X)=0.5

So E(X)=0.25E(X)=0.25. For the variance we additionally need to use independence to apply:

Var(X+Y)=Var(X)+Var(Y)=2Var(X)=112

Var(X+Y)=Var(X)+Var(Y)=2Var(X)=112

Hence Var(X)=124Var(X)=124 and σX=1260.204σX=1260.204. Wow! That is a lot of variation for a random variable whose support ranges from 0 to 0.5. But we should have expected that, since the standard deviation isn't going to scale in the same way that the mean did.

Now, what's the largest standard deviation that a random variable can have if the smallest value it can take is 0, the largest value it can take is 0.5, and the mean is 0.25? Collecting all the probability at two point masses on the extremes, 0.25 away from the mean, would clearly give a standard deviation of 0.25. So our σXσX is large but not impossible. (I hoped to show that this implied too much probability lay in the tails for X+YX+Y to be uniform, but I couldn't get anywhere with that on the back of an envelope.)

Second moment considerations almost put an impossible constraint on XX so let's consider higher moments. What about Pearson's moment coefficient of skewness, γ1=E(XμX)3σ3X=κ3κ3/22γ1=E(XμX)3σ3X=κ3κ3/22? This exists since the central moments exist and σX0σX0. It is helpful to know some properties of the cumulants, in particular applying independence and then identical distribution gives:

κi(U)=κi(X+Y)=κi(X)+κi(Y)=2κi(X)

κi(U)=κi(X+Y)=κi(X)+κi(Y)=2κi(X)

This additivity property is precisely the generalisation of how we dealt with the mean and variance above - indeed, the first and second cumulants are just κ1=μκ1=μ and κ2=σ2κ2=σ2.

Then κ3(U)=2κ3(X)κ3(U)=2κ3(X) and (κ2(U))3/2=(2κ2(X))3/2=23/2(κ2(X))3/2(κ2(U))3/2=(2κ2(X))3/2=23/2(κ2(X))3/2. The fraction for γ1γ1 cancels to yield Skew(U)=Skew(X+Y)=Skew(X)/2Skew(U)=Skew(X+Y)=Skew(X)/2. Since the uniform distribution has zero skewness, so does XX, but I can't see how a contradiction arises from this restriction.

So instead, let's try the excess kurtosis, γ2=κ4κ22=E(XμX)4σ4X3γ2=κ4κ22=E(XμX)4σ4X3. By a similar argument (this question is self-study, so try it!), we can show this exists and obeys:

Kurt(U)=Kurt(X+Y)=Kurt(X)/2

Kurt(U)=Kurt(X+Y)=Kurt(X)/2

The uniform distribution has excess kurtosis 1.21.2 so we require XX to have excess kurtosis 2.42.4. But the smallest possible excess kurtosis is 22, which is achieved by the Binomial(1,12)Binomial(1,12) Bernoulli distribution.


2
(+1) This is a quite clever approach, which was new to me. Thanks. Note that some of your analysis could have been streamlined by considering a uniform centered at zero. (The equivalence of the problem is immediate.) That would have immediately told you that considering skew was a dead-end.
cardinal

@cardinal: I knew the skew was a dead-end before I worked on it. The purpose was expository: it's a self-study question so I didn't want to solve it in full! Rather I wanted to leave a hint on how to deal with the next level up...
Silverfish

@cardinal: I was in two minds whether to center or not. I did back-of-envelope calculations more conveniently, but in the final analysis we just need (1) a simple case of the general result that Kurt(X1+...+Xn)=1nKurt(X)Kurt(X1+...+Xn)=1nKurt(X) for iid XiXi, (2) that Kurt(U)=1.2Kurt(U)=1.2 for any uniform distribution, and (3) Kurt(X)Kurt(X) exists since XX is bounded and σX0σX0 (which is trivial, else σU=0σU=0). So none of the key results actually required centering, though bits may have looked less ugly!
Silverfish

Yes, the word "streamlined" was carefully chosen. :-) I did not intend my comment to be read as criticism of your exposition. Cheers.
cardinal

@cardinal Incidentally, variance considerations alone almost worked, but the uniform isn't quite spread out enough. With a bit more probability mass nearer the extremes, e.g. fT(t)=12t2fT(t)=12t2 on [-0.5, 0.5], then Var(T)=.15Var(T)=.15 and if T=X1+X2T=X1+X2 then σX=.15/20.27>0.25 which is impossible as X is bounded by -0.25 and 0.25. Of course, you will see immediately how this relates to the present example! I wonder if the approach generalises, I'm sure other bounded RVs can't be decomposed into sums but require even higher moments investigated to find the contradiction.
Silverfish
弊社のサイトを使用することにより、あなたは弊社のクッキーポリシーおよびプライバシーポリシーを読み、理解したものとみなされます。
Licensed under cc by-sa 3.0 with attribution required.