列車が到着する前に時間をモデル化するために使用する分布は何ですか？

15

電車の到着時間に関するデータをモデル化しようとしています。「長く待つほど、電車が現れる可能性が高くなります」をキャプチャするディストリビューションを使用したいと思います。そのような分布はCDFのように見えるはずなので、P（train show up | waited 60 minutes）は1に近くなります。ここでどの分布を使用するのが適切ですか？

distributions modeling

— foobar
ソース

10

あなたは25時間を待って何の電車がなかったならば、私は次の分に上げて、列車のチャンスが近いかもしれ疑いラインが一時的または恒久的に閉じられたことは非常に可能であるとして

0

$0$

— ヘンリー

@ヘンリー、これはあなたが以前の確率を信じているかどうかに完全に依存します。たとえば、英国で最も使用されていない鉄道駅theguardian.com/uk-news/2016/dec/09/…には、1日以上（日曜日は運行していない）到着のギャップがあります。

— セクストゥス

@MartijnWeterings -ジャーナリストにおそらくおかげで、Shippeaヒルは、使用中に1200パーセントの増加を見ても、しなかった次の年の使用の最低10のいくつかは、このようなティーズサイド空港として、1つの方向の一方の電車週間持っている

— ヘンリー

17

2つの確率の乗算

と（待機時間）の間の時間に最初に到着する確率は、 $t$ $t+dt$

$t$ と $t+dt$ 間の到着の確率（これは、時間で $s(t)$ 到着率関連する可能性があります） $t$
そして、時刻 $t$ 前に到着しない確率（または、そうでなければ最初ではない）。

この後者の用語は次のものに関連しています。

P (n = 0, t + d t) = (1 - s (t) d t) P (n = 0, t)

$P(n=0,t+dt) = (1-s(t)dt) P(n=0,t)$

または

\frac{\partial P (n = 0, t)}{\partial t} = - s (t) P (n = 0, t)

$\frac{\partial P(n=0,t)}{\partial t} = -s(t) P(n=0,t)$

与える：

P (n = 0, t) = e^{\int_{0}^{t} - s (t) d t}

$P(n=0,t) = e^{\int_0^t-s(t) dt}$

待ち時間の確率分布は次のとおりです。

f (t) = s (t) e^{\int_{0}^{t} - s (t) d t}

$f(t) = s(t)e^{\int_0^t-s(t) dt}$

累積分布の導出。

あるいは、時刻があるという条件付きで到着が1回未満になる確率の式を使用することもできます。 $t$

P (n < 1 | t) = F (n = 0; t)

$P(n<1|t) = F(n=0;t)$

そして、時間 $t$ と $t+dt$ 間に到達する確率は導関数に等しい

f_{arrival time} (t) = - \frac{d}{d t} F (n = 0 | t)

$f_{\text{arrival time}}(t) = - \frac{d}{d t} F(n=0 \vert t)$

このアプローチ/方法は、例えば、ポアソンプロセスのn番目の到着の待機時間としてガンマ分布を導出するのに役立ちます。（ポアソンプロセスの待機時間-ガンマ分布）

2つの例

これを待っているパラドックスに関連付けることができます（待っているパラドックスを説明してください）。

$s(t) = \lambda$
$f (t) = λ e^{- λ t}$ $f(t) = \lambda e^{-\lambda t}$
Constant distribution: If the arrivals are occurring at a constant rate (such as trains arriving according to a fixed schedule), then the probability of an arrival, when a person has already been waiting for some time, is increasing. Say a train is supposed to arrive every $T$ minutes then the frequency, after already waiting $t$ minutes is $s(t) = 1/(T-t)$ and the pdf for the waiting time will be:
$f (t) = \frac{e^{\int_{0}^{t} - \frac{1}{T - t} d t}}{T - t} = \frac{1}{T}$ $f(t)= \frac{e^{\int_0^t -\frac{1}{T-t} dt}}{T-t} = \frac{1}{T}$ which makes sense since every time between $0$ and $T$ should have equal probability to be the first arrival.

So it is this second case, with "then the probability of an arrival, when a person has already been waiting for some time is increasing", that relates to your question.

It might need some adjustments depending on your situation. With more information the probability $s(t) dt$ for a train to arrive at a certain moment might be a more complex function.

Written by StackExchangeStrike

— Sextus Empiricus
ソース

7

The classical distribution to model waiting times is the exponential distribution.

The exponential distribution occurs naturally when describing the lengths of the inter-arrival times in a homogeneous Poisson process.

— S. Kolassa - Reinstate Monica
ソース

2

Yes, but I daresay a Poisson process is not a good model for a train network.

— leftaroundabout