Arranging $n$ balls in $k$ bins so that $m$ consecutive bins are empty

This question is inspired by the following problem:

Randomly place seven balls into ten bins, with no bin containing more than one ball. What is the probability that there will be (at least) two consecutive bins that are empty?

Note that this is not a circular arrangement, although that could make for an interesting (if simpler) question. The actual question I have is the same as the one above, but with the numbers written in emphasized text generalized as positive integers $n$, $k$, and $m$, respectively.

Question:

Randomly place $n$ balls into $k$ bins, with no bin containing more than one ball. What is the probability that there will be (at least) $m$ consecutive bins that are empty?

I suspect that this is a known counting problem (as framed, or through some equivalent rephrase), so I include a reference-request tag. An original solution would, of course, be welcome as well!

As in an already included answer: Specific cases (e.g., $m=2$) would be helpful, too.

edited Dec 11 '18 at 16:25

asked Dec 10 '18 at 23:38

Benjamin Dickman

10.3k22968

$begingroup$
This sounds like a pretty straightforward counting problem if you use inclusion-exclusion. However, it doesn't seem to me that there is a nice general closed form expression.
$endgroup$
– platty
Dec 10 '18 at 23:40

1

$begingroup$
@platty I would appreciate a "straightforward counting" answer!
$endgroup$
– Benjamin Dickman
Dec 10 '18 at 23:40

1

$begingroup$
On second thought, it does seem more complicated. The question I was familiar with assumed a bound $m geq n/2$, from which straightforward PIE suffices. But in the general case, it doesn't seem like a nice closed form should be reasonable -- there are too many cases of differently-sized intersections to handle.
$endgroup$
– platty
Dec 10 '18 at 23:58

add a comment |

This question is inspired by the following problem:

Randomly place seven balls into ten bins, with no bin containing more than one ball. What is the probability that there will be (at least) two consecutive bins that are empty?

Question:

Randomly place $n$ balls into $k$ bins, with no bin containing more than one ball. What is the probability that there will be (at least) $m$ consecutive bins that are empty?

I suspect that this is a known counting problem (as framed, or through some equivalent rephrase), so I include a reference-request tag. An original solution would, of course, be welcome as well!

As in an already included answer: Specific cases (e.g., $m=2$) would be helpful, too.

edited Dec 11 '18 at 16:25

asked Dec 10 '18 at 23:38

Benjamin Dickman

10.3k22968

$begingroup$
This sounds like a pretty straightforward counting problem if you use inclusion-exclusion. However, it doesn't seem to me that there is a nice general closed form expression.
$endgroup$
– platty
Dec 10 '18 at 23:40

1

$begingroup$
@platty I would appreciate a "straightforward counting" answer!
$endgroup$
– Benjamin Dickman
Dec 10 '18 at 23:40

1

$begingroup$
On second thought, it does seem more complicated. The question I was familiar with assumed a bound $m geq n/2$, from which straightforward PIE suffices. But in the general case, it doesn't seem like a nice closed form should be reasonable -- there are too many cases of differently-sized intersections to handle.
$endgroup$
– platty
Dec 10 '18 at 23:58

add a comment |

This question is inspired by the following problem:

Randomly place seven balls into ten bins, with no bin containing more than one ball. What is the probability that there will be (at least) two consecutive bins that are empty?

Question:

Randomly place $n$ balls into $k$ bins, with no bin containing more than one ball. What is the probability that there will be (at least) $m$ consecutive bins that are empty?

I suspect that this is a known counting problem (as framed, or through some equivalent rephrase), so I include a reference-request tag. An original solution would, of course, be welcome as well!

As in an already included answer: Specific cases (e.g., $m=2$) would be helpful, too.

edited Dec 11 '18 at 16:25

asked Dec 10 '18 at 23:38

Benjamin Dickman

10.3k22968

This question is inspired by the following problem:

Randomly place seven balls into ten bins, with no bin containing more than one ball. What is the probability that there will be (at least) two consecutive bins that are empty?

Question:

Randomly place $n$ balls into $k$ bins, with no bin containing more than one ball. What is the probability that there will be (at least) $m$ consecutive bins that are empty?

I suspect that this is a known counting problem (as framed, or through some equivalent rephrase), so I include a reference-request tag. An original solution would, of course, be welcome as well!

As in an already included answer: Specific cases (e.g., $m=2$) would be helpful, too.

probability combinatorics discrete-mathematics reference-request balls-in-bins

edited Dec 11 '18 at 16:25

asked Dec 10 '18 at 23:38

Benjamin Dickman

10.3k22968

edited Dec 11 '18 at 16:25

asked Dec 10 '18 at 23:38

Benjamin Dickman

10.3k22968

edited Dec 11 '18 at 16:25

asked Dec 10 '18 at 23:38

Benjamin Dickman

10.3k22968

asked Dec 10 '18 at 23:38

Benjamin Dickman

10.3k22968

asked Dec 10 '18 at 23:38

Benjamin Dickman

10.3k22968

$begingroup$
This sounds like a pretty straightforward counting problem if you use inclusion-exclusion. However, it doesn't seem to me that there is a nice general closed form expression.
$endgroup$
– platty
Dec 10 '18 at 23:40

1

$begingroup$
@platty I would appreciate a "straightforward counting" answer!
$endgroup$
– Benjamin Dickman
Dec 10 '18 at 23:40

1

$begingroup$
On second thought, it does seem more complicated. The question I was familiar with assumed a bound $m geq n/2$, from which straightforward PIE suffices. But in the general case, it doesn't seem like a nice closed form should be reasonable -- there are too many cases of differently-sized intersections to handle.
$endgroup$
– platty
Dec 10 '18 at 23:58

add a comment |

$begingroup$
This sounds like a pretty straightforward counting problem if you use inclusion-exclusion. However, it doesn't seem to me that there is a nice general closed form expression.
$endgroup$
– platty
Dec 10 '18 at 23:40

1

$begingroup$
@platty I would appreciate a "straightforward counting" answer!
$endgroup$
– Benjamin Dickman
Dec 10 '18 at 23:40

1

$begingroup$
On second thought, it does seem more complicated. The question I was familiar with assumed a bound $m geq n/2$, from which straightforward PIE suffices. But in the general case, it doesn't seem like a nice closed form should be reasonable -- there are too many cases of differently-sized intersections to handle.
$endgroup$
– platty
Dec 10 '18 at 23:58

This sounds like a pretty straightforward counting problem if you use inclusion-exclusion. However, it doesn't seem to me that there is a nice general closed form expression.

– platty
Dec 10 '18 at 23:40

@platty I would appreciate a "straightforward counting" answer!

– Benjamin Dickman
Dec 10 '18 at 23:40

On second thought, it does seem more complicated. The question I was familiar with assumed a bound $m geq n/2$, from which straightforward PIE suffices. But in the general case, it doesn't seem like a nice closed form should be reasonable -- there are too many cases of differently-sized intersections to handle.

– platty
Dec 10 '18 at 23:58

add a comment |

5 Answers
5

active

oldest

votes

We show the probability $p_m(k,n)$ to randomly place $n$ balls into $k$ bins with no bin containing more than one ball, so that at least $m$ consecutive bins are empty is
begin{align*}
color{blue}{p_m(k,n)=binom{k}{n}^{-1}sum_{j=1}^{lfloor k/mrfloor}binom{k-mj}{n}binom{n+1}{j}(-1)^{j+1}}tag{0}
end{align*}

In order to do so we consider the binary alphabet $V={0,1}$ and decode empty bins with $0$ and bins where a ball is placed with $1$.

We are looking for the number $g_m(k,n)$ of binary words of length $k$ which contain $n$ $1$'s and runs of $0$ with length at most $m-1$. The number of wanted words is
begin{align*}
f_m(k,n)=binom{k}{n}-g_m(k,n)
end{align*}

Since there is a total of $binom{k}{n}$ binary words of length $k$ with $n$ $1$'s, the probability $p_m(k,n)$ is
begin{align*}
p_m(k,n)=binom{k}{n}^{-1}f_m(k,n)
end{align*}

Words with no consecutive equal characters at all are called Smirnov or Carlitz words. See example III.24 Smirnov words from Analytic Combinatorics by Philippe Flajolet and Robert Sedgewick for more information.

A generating function for the number of Smirnov words over a binary alphabet is given by
begin{align*}
left(1-frac{2z}{1+z}right)^{-1}tag{1}
end{align*}

We replace occurrences of $0$ in a Smirnov word by one up to $m-1$ zeros to generate strings having runs of $0$ with length less than $m$.
begin{align*}
zlongrightarrow z+z^2+cdots+z^{m-1}=frac{zleft(1-z^{m-1}right)}{1-z}tag{2}
end{align*}

Since there are no restrictions to the length of runs of $1$'s we replace occurrences of $1$'s in a Smirnov word by one or more $1$s.
begin{align*}
zlongrightarrow z+z^2+cdots=frac{z}{1-z}tag{3}
end{align*}

We substitue (2) and (3) in (1). Since we want to keep track of the number of zeros we put $tz$ instead of $z$ in (2). We obtain

begin{align*}
left(1- frac{frac{tzleft(1-(tz)^{m-1}right)}{1-tz}}{1+frac{tzleft(1-(tz)^{m-1}right)}{1-tz}}-frac{frac{z}{1-z}}{1+frac{z}{1-z}}right)^{-1}
&=frac{1-(tz)^m}{1-z(1+t-(tz)^{m})}tag{4}
end{align*}

Denoting with $[z^n]$ the coefficient of $z^n$ in a series we obtain from (4) the number of wanted words as
begin{align*}
color{blue}{f_m(k,n)}&=binom{k}{n}-g_m(k,n)\
&color{blue}{=binom{k}{n}-[z^kt^{k-n}]frac{1-(tz)^m}{1-z(1+t-(tz)^{m})})}tag{5}
end{align*}

Example: Let's look at OPs example. We set $k=10,n=7$ and $m=2$ and we obtain with some help of Wolfram Alpha
begin{align*}
color{blue}{f_2(10,7)}&=binom{10}{7}-[z^{10}t^3]frac{1-(tz)^2}{1-z(1+t-(tz)^{2})}\
&=120-[z^{10}t^3]left(1 + (t + 1) z + (2 t + 1) z^2right.\
&qquadleft. + cdots + (6 t^5 + 35 t^4 + color{blue}{56} t^3 + 36 t^2 + 10 t + 1) z^{10} + cdotsright)\
&=120-56\
&,,color{blue}{=64}
end{align*}
with probability

begin{align*}
color{blue}{p_2(10,7)}=binom{10}{7}^{-1}f_2(10,7)=frac{64}{120}color{blue}{=frac{8}{15}=0.5dot{3}}
end{align*}

The blue colored coefficient of $z^{10}$ shows there are $color{blue}{56}$ binary words of length $10$ with $3$ zeros and runs of $0$ with length less than $k=2$. The blue colored result $color{blue}{64}$ is the number of valid words having runs of zeros of length at least $2$.

These $64$ words are
$$
begin{array}{cccc}
color{blue}{00}01111111&1color{blue}{00}0111111&11color{blue}{00}011111&111color{blue}{00}01111\
color{blue}{00}10111111&1color{blue}{00}1011111&11color{blue}{00}101111&111color{blue}{00}10111\
color{blue}{00}11011111&1color{blue}{00}1101111&11color{blue}{00}110111&111color{blue}{00}11011\
color{blue}{00}11101111&1color{blue}{00}1110111&11color{blue}{00}111011&111color{blue}{00}11101\
color{blue}{00}11110111&1color{blue}{00}1111011&11color{blue}{00}111101&111color{blue}{00}11110\
color{blue}{00}11111011&1color{blue}{00}1111101&11color{blue}{00}111110&11101color{blue}{00}111\
color{blue}{00}11111101&1color{blue}{00}1111110&1101color{blue}{00}1111&111011color{blue}{00}11\
color{blue}{00}11111110&101color{blue}{00}11111&11011color{blue}{00}111&1110111color{blue}{00}1\
01color{blue}{00}111111&1011color{blue}{00}1111&110111color{blue}{00}11&11101111color{blue}{00}\
011color{blue}{00}11111&10111color{blue}{00}111&1101111color{blue}{00}1&\
0111color{blue}{00}1111&101111color{blue}{00}11&11011111color{blue}{00}&\
01111color{blue}{00}111&1011111color{blue}{00}1&&\
011111color{blue}{00}11&10111111color{blue}{00}&&\
0111111color{blue}{00}1&&&\
01111111color{blue}{00}&&&\
\
1111color{blue}{00}0111&11111color{blue}{00}011&111111color{blue}{00}01&1111111color{blue}{00}0\
1111color{blue}{00}1011&11111color{blue}{00}101&111111color{blue}{00}10\
1111color{blue}{00}1101&11111color{blue}{00}110&11111101color{blue}{00}\
1111color{blue}{00}1110&1111101color{blue}{00}1&\
111101color{blue}{00}11&11111011color{blue}{00}&\
1111011color{blue}{00}1&&\
11110111color{blue}{00}&&\
&&\
&&\
end{array}
$$

Formula of $f_m(k,n)$:

We derive a formula of $f_m(k,n)$ from (5) manually. It is convenient to use the coefficient of operator $[z^k]$ to denote the coefficient of $z^k$ in a series.

We obtain from (5)
begin{align*}
color{blue}{f_m(k,n)}&=binom{k}{n}-[z^kt^{k-n}]frac{1-(tz)^m}{1-z(1+t-(tz)^m}\
&=binom{k}{n}-[z^kt^{k-n}]sum_{j=0}^infty z^jleft(1+t-(tz)^{m}right)^jleft(1-(tz)^mright)tag{6}\
&=binom{k}{n}-[t^{k-n}]sum_{j=0}^k [z^{k-j}]left(1+t-(tz)^{m}right)^jleft(1-(tz)^mright)tag{7}\
&=binom{k}{n}-[t^{k-n}]sum_{j=0}^k [z^j]left(1+t-(tz)^{m}right)^{k-j}left(1-(tz)^mright)tag{8}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]left(1+t-(tz)^{m}right)^{k-mj}left(1-(tz)^mright)tag{9}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]sum_{l=0}^{k-mj}binom{k-mj}{l}left(1-(tz)^mright)^{l+1}t^{k-mj-l}tag{10}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]sum_{l=0}^{k-mj}binom{k-mj}{l}sum_{u=0}^{l+1}binom{l+1}{u}(-1)^{u}(tz)^{mu}t^{k-mj-l}tag{11}\
&=[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}sum_{l=0}^{k-mj}binom{k-mj}{l}binom{l+1}{j}(-1)^{j+1}t^{k-l}tag{12}\
&,,color{blue}{=sum_{j=1}^{lfloor k/mrfloor}binom{k-mj}{n}binom{n+1}{j}(-1)^{j+1}}tag{13}\
end{align*}
with probability
begin{align*}
color{blue}{p_m(k,n)=binom{k}{n}^{-1}f_m(k,n)}
end{align*}
in accordance with (0).

Comment:

In (6) we do a geometric series expansion.

In (7) we apply the rule $[z^{p-q}]A(z)=[z^p]z^qA(z)$. We also set the upper limit to $k$, since other terms do not contribute.

In (8) we change the order of summation $jto k-j$.

In (9) we sum over multiples of $m$ only by setting $jto mj$ since we have $z^{m}$ in the binomial expansion. We also note that $binom{k}{n}$ is canceled with the first summand $j=0$ since $[z^0t^{k-n}](1+t-(tz)^m)^k(1-(tz)^m)=[t^{k-n}](1+t)^k=binom{k}{k-n}=binom{k}{n}$.

In (10) we expand the binomial.

In (11) we expand the binomial.

In (12) we select the coefficient of $z^{mj}$ by selecting the summand with $u=j$.

In (13) we select the coefficient of $t^{k-n}$ by selecting the summand with $l=n$.

Example revisited:

By calculating OPs example ($k=10,n=7,m=2$) now with formula (13) we obtain
begin{align*}
color{blue}{f_2(10,7)}&=sum_{j=1}^5binom{10-2j}{7}binom{8}{j}(-1)^{j+1}\
&=binom{8}{7}binom{8}{1}\
&,,color{blue}{=64}
end{align*}
in accordance with the result above.

Note that here we only take the summand $j=1$, since other terms have lower index $7$ greater than the upper index $10-2j$, so that the binomial coefficients are zero.

edited Dec 23 '18 at 15:23

answered Dec 22 '18 at 19:13

Markus Scheuer

61.2k456145

$begingroup$
This is great; thank you! Also: Within the first indented sentence I had guessed correctly who the answerer was. :)
$endgroup$
– Benjamin Dickman
Dec 22 '18 at 19:25

$begingroup$
@BenjaminDickman: You're welcome and many thanks for your amusing remark. :-)
$endgroup$
– Markus Scheuer
Dec 22 '18 at 19:48

1

$begingroup$
@BenjaminDickman: I've added a derivation of an explicit formula to complete the job.
$endgroup$
– Markus Scheuer
Dec 23 '18 at 8:19

add a comment |

Denote the sizes of the runs of empty bins as $x_1, ldots, x_{n + 1}$. (Note that there are at most this many runs, so runs may be empty; i.e., $x_j = 0$.) Now, count the number of nonnegative integer solutions to

$$
x_1 + cdots + x_{n + 1} = k - n
$$

that have at least one $x_j ge m$. This is equivalent to solving the problem.

To count this quantity, use the facts that:

The number of nonnegative integer solutions to
$$
x_1 + cdots + x_k = n
$$
is
$$
binom{n + k - 1}{k - 1}text.
$$
This is a simple stars and bars argument.

The number of nonnegative integer solutions to
$$
x_1 + cdots + x_k = n
$$
with each $x_j < c$ is
$$
binom{n + k - 1}{k - 1} - sum_{i = 1}^{lfloor n / {c} rfloor} {(-1)}^{i + 1} binom{k}{i}binom{n - c i + k - 1}{k - 1}text.tag{*}label{*}
$$
To show (ref{*}), recall the complementary form of the inclusion–exclusion principle. Namely, we can count the number of nonnegative integer solutions with each $x_j < c$ as
$$
begin{align}
&& text{the number of all (unrestricted) solutions} \
&-& text{the number of solutions with one } x_j geq c \
&+& text{the number of solutions with two } x_j geq c \
&-& cdots \
&{(-1)}^{lfloor n / {c} rfloor}& text{the number of solutions with }lfloor n / {c} rfloor text{ } x_j geq ctext.
end{align}
$$
Now, how do you count the number of solutions that have $i$-many $x_j geq c$? It's quite simple; write those $x_j$ as $c + y_j$ (since we already know they're greater than or equal to $c$), and you already have a reduction to the stars and bars argument, but with $n' = n - c i$. This tells us that the number of solutions with $i$-many $x_j geq c$ is $binom{n - c i + k - 1}{k - 1}$. The $binom{k}{i}$ factor counts the number of ways to have this scenario.

edited Dec 20 '18 at 15:48

answered Dec 11 '18 at 12:00

d125q

1,6571015

1

$begingroup$
Nice answer! I think the number of variables should be $n+1$, rather than $k-n$. The variables are the gaps between the balls, and there are $n$ balls, so $n+1$ gaps.
$endgroup$
– Mike Earnest
Dec 11 '18 at 14:36

$begingroup$
Thank you; you are right. I corrected the answer and checked that it agrees with the one for $m = 2$ (math.stackexchange.com/a/3035312/112944).
$endgroup$
– d125q
Dec 11 '18 at 14:44

$begingroup$
Could you say something about the second fact? The first one looks to me like stars and bars, but is the second one... stars and bars combined with the inclusion-exclusion principle? $$ $$ If you could indicate how this works in practice for, say, $n = 6$, $k = 10$, and $m = 2$, then that would be very helpful, too. (And could compared with @MikeEarnest's answer, as well as the hand computations I've done.) Thanks!
$endgroup$
– Benjamin Dickman
Dec 11 '18 at 16:21

1

$begingroup$
@BenjaminDickman Give the inclusion exclusion a try! The idea is this; you take all of the solutions, then for each variable subtract the cases where the variable is “bad”, meaning more than $m$, then add back in the cases where two variables are bad, etc.
$endgroup$
– Mike Earnest
Dec 11 '18 at 16:45

1

$begingroup$
@BenjaminDickman I added some "proof sketches."
$endgroup$
– d125q
Dec 20 '18 at 15:48

|
show 2 more comments

+50

This is supposed to be an integration to d125q's answer, as to better explain
how to reach to the formula he gave, but it is too long for a comment.

You are placing $0$ or $1$ ball in every bin.

We are clearly speaking of undistiguishable balls.

While, speaking of consecutive bins, that implies that the bins
be distinguishable (i.e. labelled, i.e. aligned in a row).

Then your problem is equivalent to :

given all the binary strings of length $k$, with $n$ ones (and $n-k$ zeros), how many of them
will contain one or more runs of consecutive zeros whose length is $m$ or greater ?

The number of binary strings of length $k$ and with $n$ ones is $binom{k}{n}$.

So the complement of the above will be the number of strings where the length of each run of consecutive
zeros is less than $m$, and naturally, in a binary string we can exchange the zeros with the ones.

Now, to keep congruence with the precedent posts I am going to cite, let me change
the formulation and the parameters of the problem with the following:

Consider a binary string with $s$ "$1$"'s and $m$ "$0$"'s in total. Let's put an additional (dummy) fixed $0$ at the start and at the end of the string.
We individuate as a run the consecutive $1$'s between two zeros, thereby including runs of null length.
With this scheme we have a fixed number of $m+1$ runs.

(refer also to the similar post).

Bernoulli_runs_1

Then, imposing that each run does not contain more than $r$ ones is equivalent to
compute
$$N_{,b} (s,r,m+1) = text{No}text{. of solutions to};left{ begin{gathered}
0 leqslant text{integer }x_{,j} leqslant r hfill \
x_{,1} + x_{,2} + cdots + x_{,m+1} = s hfill \
end{gathered} right.$$
which as explained in this other post is expressed as
$$
N_b (s,r,m + 1)quad left| {;0 leqslant text{integers }s,m,r} right.quad = sumlimits_{left( {0, leqslant } right),,k,,left( { leqslant ,frac{s}
{r}, leqslant ,m + 1} right)} {left( { - 1} right)^k left( begin{gathered}
m + 1 \
k \
end{gathered} right)left( begin{gathered}
s + m - kleft( {r + 1} right) \
s - kleft( {r + 1} right) \
end{gathered} right)}
$$
Note that by rewriting the binomial in the way shown above render the summation bounds implicit in the binomial,
so that they can be omitted (that's why they are indicated in brackets), which is of great advantage
in performing algebraic manipulations.

I think that an effective way to understand the above is by developing the polynomial
$$
eqalign{
& underbrace {left( {1 + x + x^{,2} + cdots + x^{,r} } right)
cdot left( {1 + cdots + x^{,r} } right) cdot ; cdots ; cdot left( {1 + cdots + x^{,r} } right)}_{m;times} = cr
& = left( {{{1 - x^{,r + 1} } over {1 - x}}} right) = sumlimits_{0, le ,,s,,, le ,m,r} {N_b (s,r,m);x^{,s} } cr}
$$
thereby having the o.g.f. in $s$, and from there easily getting the recursion
$$
N_{,b} (s,r,m + 1) = sumlimits_{i, = ,0}^r {N_{,b} (s - i,r,m)}
$$

Finally note that the $N_b$ are called "r-nomial coefficients" in OEIS: see e.g. Table A008287.

edited Dec 19 '18 at 17:05

answered Dec 19 '18 at 16:31

G Cab

18.9k31238

1

$begingroup$
@BenjaminDickman good that the o.g.f. is explicative. Sorry for the broken links: adjusted.
$endgroup$
– G Cab
Dec 19 '18 at 17:06

add a comment |

There's a nice way to do this if $m = 2.$
I describe it here, although it does not solve the more general problem indicated in the question.

We can get a classic "random" arrangement if the bins are identifiable and the balls are not distinguishable, so I'll make those assumptions.

In each bin we either place one ball or no balls.
We choose $n$ of the bins in which to place the $n$ balls, leaving $n - k$ empty.
The number of ways to do this is
$$binom k{k-n} = binom kn,$$
each way equally probable.

Another way to think about those same arrangements of balls and bins is that we have $k - n$ empty bins, with a run of zero or more full bins before the first empty bin, after the last empty bin, and between each consecutive pair of empty bins.
That is, we are putting $n$ balls into $k - n + 1$ "super bins", where each "super bin" is a run of zero or more filled bins.

As a check on the last interpretation of the problem, the number of ways to put zero or more indistinguishable items into each of $k - n + 1$ containers, placing a total of $n$ items, is
$$ binom{n + (k - n + 1) - 1}{(k - n + 1) - 1} = binom k{k - n}.$$

So that is the number of equally probable events in our sample space.
Now let's count the number of these events that do not have two consecutive empty bins. That means that the $k - n - 1$ runs of full bins that lie between consecutive pairs of empty bins must each include at least one full bin.
Having determined that we need $k - n - 1$ balls placed in this way,
we see that there are no such arrangements if $n < k - n - 1,$
that is, if $2n - k + 1 < 0.$
On the other hand, if $n + 1geq k$ there
are not even two empty bins anywhere and so it is guaranteed
that we will not have two adjacent empty bins.
But if $2n - k + 1 geq 0$ and $n + 1 < k,$ then the number of ways to
place the remaining $2n - k + 1$ full bins into the $k - n + 1$ runs
(including the ones on each end) is
$$
binom{(2n - k + 1) + (k - n + 1) - 1}{(k - n + 1) - 1} = binom{n + 1}{k - n}.
$$

So the probability of two consecutive empty bins is $0$ if $n + 1 geq k$;
if $2n - k + 1 < 0$ the probability is $1$; and in all other cases the probability is
$$
1 - frac{displaystylebinom{n + 1}{k - n}}{displaystylebinom k{k - n}}
= 1 - frac{(n + 1)! ,n!}{(2n - k + 1)!, k!}.
$$

answered Dec 11 '18 at 13:59

David K

53.9k342116

add a comment |

There is the possibility to calculate numerically the result with a rather simple recursive equation.

We will assume $m = 2$ in the following.

To simplify the notation, we will consider that $0$ represents the case "no ball" and $1$ the case "with a ball".

We note $f(k, n)$ the probability associated with $k$ and $n$.

We represent the set of possibilities as a graph.

We consider three cases for the first bins:

1 in the first bin. Probability $frac{n}{k}$. The subgraph corresponds to $f(k-1, n-1)$

01 in the first 2 bins: Probability $frac{k-n}{k} times frac{n}{k-1}$. The subgraph corresponds to $f(k-2, n-1)$

00 in the first 2 bins: Probability $frac{k-n}{k} times frac{k-1-n}{k-1}$ -> we get the $m$ holes.

Then, we obtain, for $m = 2$

$$f(k,n) = frac{n}{k} f(k-1, n-1) + frac{n(k-n)}{k(k-1)} f(k-2, n-1) + frac{(k-n)(k-1-n)}{k(k-1)}$$

We note in addition that $f() = 0$ if $k-n le m-1$ and that $f() = 1$ if $n=0$ and $k ge m$

This recursive formula can be generalised for higher values of $m$, although it becomes more difficult.

This is clearly not as satisfactory as a close form formula. However, it can be represented on a graph rather easily, and it is easy to program.

It may be used for secondary school students.

Note: the program gives $f(10, 7, 2) = frac{8}{15}$

EDIT:

When I arrived at this point, I suspected something ...

Is there a much simpler solution ?

I thought having found a simple formula, it was a mistake ...

EDIT 2: C++ programme for $m = 2$

#include <iostream>

#include <iomanip>



double f2(int k, int n) {       // m = 2

    if ((k - n) <= 1) return 0.0;

    if (n == 0) return 1.0;

    double res = double ((k-n)*(k-1-n)) / (k*(k-1))

        + (double (n*(k-n)) / (k*(k-1))) * f2(k-2, n-1)

        + (double (n)/k) * f2(k-1, n-1);

    return res;

}



int main() {

    int k = 10;

    int n = 7;

    double prob = f2(k, n);

    std::cout << "f = " << std::setprecision(12) << prob << "n";

    return 0;

}

EDIT 3: Display of a part of the graph.

"0:3/10" means that the line above corresponds to "0" (no ball), and that the corresponding probability is equal to 3/10. SB means "subgraph", and the near figure corresponds to the probability associated to this subgraph. This also illustrates the low efficiency of the scheme in terms of computation. However, again, it was not the goal of this proposal.

                      f(10,7)=8/15

                        /      

                       /        

                      /          

                   0:3/10       1:7/10

                  SB:1/8        SB:f(9,6)=7/12                     

                  /                   |     

                 /                    |      

                /                     |       

             0:2/9      1:7/9         0:1/3     1:2/3

            SB:1.0   SB:f(8,6)=1/4   SB:13/28   SB:f(8,5)=9/14

                        /             |           |       

                       /              |           |        

                                       |     

                                     0:1/4    1:3/4

                                     SB:1.0   SB:f(7,5)=2/7

                                                  /     

                                                 /

edited Dec 20 '18 at 13:34

answered Dec 18 '18 at 21:23

Damien

60714

$begingroup$
Could you indicate how the described program outputs 8/15? I find myself not quite able to follow the description and suspect seeing those steps written out could help me!
$endgroup$
– Benjamin Dickman
Dec 19 '18 at 16:47

$begingroup$
Sorry, I was really unclear. I mentioned the easiness to write a programme . .. I wrote such a C++ (very simple) programme for $m=2$ and use it. I might post it. You need to represent what I present in a graph ...Easy to follow with such a graph in front of the eyes. ... Not so easy without
$endgroup$
– Damien
Dec 19 '18 at 17:04

$begingroup$
Including the program and a graph would be great
$endgroup$
– Benjamin Dickman
Dec 19 '18 at 20:39

1

$begingroup$
I have included the C++ programme. I will try now to draw a graph. Much easier by hand than creating a figure, without any drawing software at home ...
$endgroup$
– Damien
Dec 19 '18 at 21:49

1

$begingroup$
I tried to display a part of the graph. Hope it is clear enough
$endgroup$
– Damien
Dec 20 '18 at 13:35

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\$","\$"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "69"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3034654%2farranging-n-balls-in-k-bins-so-that-m-consecutive-bins-are-empty%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

5 Answers
5

active

oldest

votes

5 Answers
5

active

oldest

votes

We show the probability $p_m(k,n)$ to randomly place $n$ balls into $k$ bins with no bin containing more than one ball, so that at least $m$ consecutive bins are empty is
begin{align*}
color{blue}{p_m(k,n)=binom{k}{n}^{-1}sum_{j=1}^{lfloor k/mrfloor}binom{k-mj}{n}binom{n+1}{j}(-1)^{j+1}}tag{0}
end{align*}

In order to do so we consider the binary alphabet $V={0,1}$ and decode empty bins with $0$ and bins where a ball is placed with $1$.

We are looking for the number $g_m(k,n)$ of binary words of length $k$ which contain $n$ $1$'s and runs of $0$ with length at most $m-1$. The number of wanted words is
begin{align*}
f_m(k,n)=binom{k}{n}-g_m(k,n)
end{align*}

Since there is a total of $binom{k}{n}$ binary words of length $k$ with $n$ $1$'s, the probability $p_m(k,n)$ is
begin{align*}
p_m(k,n)=binom{k}{n}^{-1}f_m(k,n)
end{align*}

A generating function for the number of Smirnov words over a binary alphabet is given by
begin{align*}
left(1-frac{2z}{1+z}right)^{-1}tag{1}
end{align*}

We replace occurrences of $0$ in a Smirnov word by one up to $m-1$ zeros to generate strings having runs of $0$ with length less than $m$.
begin{align*}
zlongrightarrow z+z^2+cdots+z^{m-1}=frac{zleft(1-z^{m-1}right)}{1-z}tag{2}
end{align*}

Since there are no restrictions to the length of runs of $1$'s we replace occurrences of $1$'s in a Smirnov word by one or more $1$s.
begin{align*}
zlongrightarrow z+z^2+cdots=frac{z}{1-z}tag{3}
end{align*}

We substitue (2) and (3) in (1). Since we want to keep track of the number of zeros we put $tz$ instead of $z$ in (2). We obtain

begin{align*}
left(1- frac{frac{tzleft(1-(tz)^{m-1}right)}{1-tz}}{1+frac{tzleft(1-(tz)^{m-1}right)}{1-tz}}-frac{frac{z}{1-z}}{1+frac{z}{1-z}}right)^{-1}
&=frac{1-(tz)^m}{1-z(1+t-(tz)^{m})}tag{4}
end{align*}

Denoting with $[z^n]$ the coefficient of $z^n$ in a series we obtain from (4) the number of wanted words as
begin{align*}
color{blue}{f_m(k,n)}&=binom{k}{n}-g_m(k,n)\
&color{blue}{=binom{k}{n}-[z^kt^{k-n}]frac{1-(tz)^m}{1-z(1+t-(tz)^{m})})}tag{5}
end{align*}

begin{align*}
color{blue}{p_2(10,7)}=binom{10}{7}^{-1}f_2(10,7)=frac{64}{120}color{blue}{=frac{8}{15}=0.5dot{3}}
end{align*}

Formula of $f_m(k,n)$:

We derive a formula of $f_m(k,n)$ from (5) manually. It is convenient to use the coefficient of operator $[z^k]$ to denote the coefficient of $z^k$ in a series.

We obtain from (5)
begin{align*}
color{blue}{f_m(k,n)}&=binom{k}{n}-[z^kt^{k-n}]frac{1-(tz)^m}{1-z(1+t-(tz)^m}\
&=binom{k}{n}-[z^kt^{k-n}]sum_{j=0}^infty z^jleft(1+t-(tz)^{m}right)^jleft(1-(tz)^mright)tag{6}\
&=binom{k}{n}-[t^{k-n}]sum_{j=0}^k [z^{k-j}]left(1+t-(tz)^{m}right)^jleft(1-(tz)^mright)tag{7}\
&=binom{k}{n}-[t^{k-n}]sum_{j=0}^k [z^j]left(1+t-(tz)^{m}right)^{k-j}left(1-(tz)^mright)tag{8}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]left(1+t-(tz)^{m}right)^{k-mj}left(1-(tz)^mright)tag{9}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]sum_{l=0}^{k-mj}binom{k-mj}{l}left(1-(tz)^mright)^{l+1}t^{k-mj-l}tag{10}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]sum_{l=0}^{k-mj}binom{k-mj}{l}sum_{u=0}^{l+1}binom{l+1}{u}(-1)^{u}(tz)^{mu}t^{k-mj-l}tag{11}\
&=[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}sum_{l=0}^{k-mj}binom{k-mj}{l}binom{l+1}{j}(-1)^{j+1}t^{k-l}tag{12}\
&,,color{blue}{=sum_{j=1}^{lfloor k/mrfloor}binom{k-mj}{n}binom{n+1}{j}(-1)^{j+1}}tag{13}\
end{align*}
with probability
begin{align*}
color{blue}{p_m(k,n)=binom{k}{n}^{-1}f_m(k,n)}
end{align*}
in accordance with (0).

Comment:

In (6) we do a geometric series expansion.

In (7) we apply the rule $[z^{p-q}]A(z)=[z^p]z^qA(z)$. We also set the upper limit to $k$, since other terms do not contribute.

In (8) we change the order of summation $jto k-j$.

In (9) we sum over multiples of $m$ only by setting $jto mj$ since we have $z^{m}$ in the binomial expansion. We also note that $binom{k}{n}$ is canceled with the first summand $j=0$ since $[z^0t^{k-n}](1+t-(tz)^m)^k(1-(tz)^m)=[t^{k-n}](1+t)^k=binom{k}{k-n}=binom{k}{n}$.

In (10) we expand the binomial.

In (11) we expand the binomial.

In (12) we select the coefficient of $z^{mj}$ by selecting the summand with $u=j$.

In (13) we select the coefficient of $t^{k-n}$ by selecting the summand with $l=n$.

Example revisited:

By calculating OPs example ($k=10,n=7,m=2$) now with formula (13) we obtain
begin{align*}
color{blue}{f_2(10,7)}&=sum_{j=1}^5binom{10-2j}{7}binom{8}{j}(-1)^{j+1}\
&=binom{8}{7}binom{8}{1}\
&,,color{blue}{=64}
end{align*}
in accordance with the result above.

Note that here we only take the summand $j=1$, since other terms have lower index $7$ greater than the upper index $10-2j$, so that the binomial coefficients are zero.

edited Dec 23 '18 at 15:23

answered Dec 22 '18 at 19:13

Markus Scheuer

61.2k456145

$begingroup$
This is great; thank you! Also: Within the first indented sentence I had guessed correctly who the answerer was. :)
$endgroup$
– Benjamin Dickman
Dec 22 '18 at 19:25

$begingroup$
@BenjaminDickman: You're welcome and many thanks for your amusing remark. :-)
$endgroup$
– Markus Scheuer
Dec 22 '18 at 19:48

1

$begingroup$
@BenjaminDickman: I've added a derivation of an explicit formula to complete the job.
$endgroup$
– Markus Scheuer
Dec 23 '18 at 8:19

add a comment |

We show the probability $p_m(k,n)$ to randomly place $n$ balls into $k$ bins with no bin containing more than one ball, so that at least $m$ consecutive bins are empty is
begin{align*}
color{blue}{p_m(k,n)=binom{k}{n}^{-1}sum_{j=1}^{lfloor k/mrfloor}binom{k-mj}{n}binom{n+1}{j}(-1)^{j+1}}tag{0}
end{align*}

In order to do so we consider the binary alphabet $V={0,1}$ and decode empty bins with $0$ and bins where a ball is placed with $1$.

We are looking for the number $g_m(k,n)$ of binary words of length $k$ which contain $n$ $1$'s and runs of $0$ with length at most $m-1$. The number of wanted words is
begin{align*}
f_m(k,n)=binom{k}{n}-g_m(k,n)
end{align*}

Since there is a total of $binom{k}{n}$ binary words of length $k$ with $n$ $1$'s, the probability $p_m(k,n)$ is
begin{align*}
p_m(k,n)=binom{k}{n}^{-1}f_m(k,n)
end{align*}

A generating function for the number of Smirnov words over a binary alphabet is given by
begin{align*}
left(1-frac{2z}{1+z}right)^{-1}tag{1}
end{align*}

We replace occurrences of $0$ in a Smirnov word by one up to $m-1$ zeros to generate strings having runs of $0$ with length less than $m$.
begin{align*}
zlongrightarrow z+z^2+cdots+z^{m-1}=frac{zleft(1-z^{m-1}right)}{1-z}tag{2}
end{align*}

Since there are no restrictions to the length of runs of $1$'s we replace occurrences of $1$'s in a Smirnov word by one or more $1$s.
begin{align*}
zlongrightarrow z+z^2+cdots=frac{z}{1-z}tag{3}
end{align*}

We substitue (2) and (3) in (1). Since we want to keep track of the number of zeros we put $tz$ instead of $z$ in (2). We obtain

begin{align*}
left(1- frac{frac{tzleft(1-(tz)^{m-1}right)}{1-tz}}{1+frac{tzleft(1-(tz)^{m-1}right)}{1-tz}}-frac{frac{z}{1-z}}{1+frac{z}{1-z}}right)^{-1}
&=frac{1-(tz)^m}{1-z(1+t-(tz)^{m})}tag{4}
end{align*}

Denoting with $[z^n]$ the coefficient of $z^n$ in a series we obtain from (4) the number of wanted words as
begin{align*}
color{blue}{f_m(k,n)}&=binom{k}{n}-g_m(k,n)\
&color{blue}{=binom{k}{n}-[z^kt^{k-n}]frac{1-(tz)^m}{1-z(1+t-(tz)^{m})})}tag{5}
end{align*}

begin{align*}
color{blue}{p_2(10,7)}=binom{10}{7}^{-1}f_2(10,7)=frac{64}{120}color{blue}{=frac{8}{15}=0.5dot{3}}
end{align*}

Formula of $f_m(k,n)$:

We derive a formula of $f_m(k,n)$ from (5) manually. It is convenient to use the coefficient of operator $[z^k]$ to denote the coefficient of $z^k$ in a series.

We obtain from (5)
begin{align*}
color{blue}{f_m(k,n)}&=binom{k}{n}-[z^kt^{k-n}]frac{1-(tz)^m}{1-z(1+t-(tz)^m}\
&=binom{k}{n}-[z^kt^{k-n}]sum_{j=0}^infty z^jleft(1+t-(tz)^{m}right)^jleft(1-(tz)^mright)tag{6}\
&=binom{k}{n}-[t^{k-n}]sum_{j=0}^k [z^{k-j}]left(1+t-(tz)^{m}right)^jleft(1-(tz)^mright)tag{7}\
&=binom{k}{n}-[t^{k-n}]sum_{j=0}^k [z^j]left(1+t-(tz)^{m}right)^{k-j}left(1-(tz)^mright)tag{8}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]left(1+t-(tz)^{m}right)^{k-mj}left(1-(tz)^mright)tag{9}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]sum_{l=0}^{k-mj}binom{k-mj}{l}left(1-(tz)^mright)^{l+1}t^{k-mj-l}tag{10}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]sum_{l=0}^{k-mj}binom{k-mj}{l}sum_{u=0}^{l+1}binom{l+1}{u}(-1)^{u}(tz)^{mu}t^{k-mj-l}tag{11}\
&=[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}sum_{l=0}^{k-mj}binom{k-mj}{l}binom{l+1}{j}(-1)^{j+1}t^{k-l}tag{12}\
&,,color{blue}{=sum_{j=1}^{lfloor k/mrfloor}binom{k-mj}{n}binom{n+1}{j}(-1)^{j+1}}tag{13}\
end{align*}
with probability
begin{align*}
color{blue}{p_m(k,n)=binom{k}{n}^{-1}f_m(k,n)}
end{align*}
in accordance with (0).

Comment:

In (6) we do a geometric series expansion.

In (7) we apply the rule $[z^{p-q}]A(z)=[z^p]z^qA(z)$. We also set the upper limit to $k$, since other terms do not contribute.

In (8) we change the order of summation $jto k-j$.

In (9) we sum over multiples of $m$ only by setting $jto mj$ since we have $z^{m}$ in the binomial expansion. We also note that $binom{k}{n}$ is canceled with the first summand $j=0$ since $[z^0t^{k-n}](1+t-(tz)^m)^k(1-(tz)^m)=[t^{k-n}](1+t)^k=binom{k}{k-n}=binom{k}{n}$.

In (10) we expand the binomial.

In (11) we expand the binomial.

In (12) we select the coefficient of $z^{mj}$ by selecting the summand with $u=j$.

In (13) we select the coefficient of $t^{k-n}$ by selecting the summand with $l=n$.

Example revisited:

By calculating OPs example ($k=10,n=7,m=2$) now with formula (13) we obtain
begin{align*}
color{blue}{f_2(10,7)}&=sum_{j=1}^5binom{10-2j}{7}binom{8}{j}(-1)^{j+1}\
&=binom{8}{7}binom{8}{1}\
&,,color{blue}{=64}
end{align*}
in accordance with the result above.

Note that here we only take the summand $j=1$, since other terms have lower index $7$ greater than the upper index $10-2j$, so that the binomial coefficients are zero.

edited Dec 23 '18 at 15:23

answered Dec 22 '18 at 19:13

Markus Scheuer

61.2k456145

$begingroup$
This is great; thank you! Also: Within the first indented sentence I had guessed correctly who the answerer was. :)
$endgroup$
– Benjamin Dickman
Dec 22 '18 at 19:25

$begingroup$
@BenjaminDickman: You're welcome and many thanks for your amusing remark. :-)
$endgroup$
– Markus Scheuer
Dec 22 '18 at 19:48

1

$begingroup$
@BenjaminDickman: I've added a derivation of an explicit formula to complete the job.
$endgroup$
– Markus Scheuer
Dec 23 '18 at 8:19

add a comment |

We show the probability $p_m(k,n)$ to randomly place $n$ balls into $k$ bins with no bin containing more than one ball, so that at least $m$ consecutive bins are empty is
begin{align*}
color{blue}{p_m(k,n)=binom{k}{n}^{-1}sum_{j=1}^{lfloor k/mrfloor}binom{k-mj}{n}binom{n+1}{j}(-1)^{j+1}}tag{0}
end{align*}

In order to do so we consider the binary alphabet $V={0,1}$ and decode empty bins with $0$ and bins where a ball is placed with $1$.

We are looking for the number $g_m(k,n)$ of binary words of length $k$ which contain $n$ $1$'s and runs of $0$ with length at most $m-1$. The number of wanted words is
begin{align*}
f_m(k,n)=binom{k}{n}-g_m(k,n)
end{align*}

Since there is a total of $binom{k}{n}$ binary words of length $k$ with $n$ $1$'s, the probability $p_m(k,n)$ is
begin{align*}
p_m(k,n)=binom{k}{n}^{-1}f_m(k,n)
end{align*}

A generating function for the number of Smirnov words over a binary alphabet is given by
begin{align*}
left(1-frac{2z}{1+z}right)^{-1}tag{1}
end{align*}

We replace occurrences of $0$ in a Smirnov word by one up to $m-1$ zeros to generate strings having runs of $0$ with length less than $m$.
begin{align*}
zlongrightarrow z+z^2+cdots+z^{m-1}=frac{zleft(1-z^{m-1}right)}{1-z}tag{2}
end{align*}

Since there are no restrictions to the length of runs of $1$'s we replace occurrences of $1$'s in a Smirnov word by one or more $1$s.
begin{align*}
zlongrightarrow z+z^2+cdots=frac{z}{1-z}tag{3}
end{align*}

We substitue (2) and (3) in (1). Since we want to keep track of the number of zeros we put $tz$ instead of $z$ in (2). We obtain

begin{align*}
left(1- frac{frac{tzleft(1-(tz)^{m-1}right)}{1-tz}}{1+frac{tzleft(1-(tz)^{m-1}right)}{1-tz}}-frac{frac{z}{1-z}}{1+frac{z}{1-z}}right)^{-1}
&=frac{1-(tz)^m}{1-z(1+t-(tz)^{m})}tag{4}
end{align*}

Denoting with $[z^n]$ the coefficient of $z^n$ in a series we obtain from (4) the number of wanted words as
begin{align*}
color{blue}{f_m(k,n)}&=binom{k}{n}-g_m(k,n)\
&color{blue}{=binom{k}{n}-[z^kt^{k-n}]frac{1-(tz)^m}{1-z(1+t-(tz)^{m})})}tag{5}
end{align*}

begin{align*}
color{blue}{p_2(10,7)}=binom{10}{7}^{-1}f_2(10,7)=frac{64}{120}color{blue}{=frac{8}{15}=0.5dot{3}}
end{align*}

Formula of $f_m(k,n)$:

We derive a formula of $f_m(k,n)$ from (5) manually. It is convenient to use the coefficient of operator $[z^k]$ to denote the coefficient of $z^k$ in a series.

We obtain from (5)
begin{align*}
color{blue}{f_m(k,n)}&=binom{k}{n}-[z^kt^{k-n}]frac{1-(tz)^m}{1-z(1+t-(tz)^m}\
&=binom{k}{n}-[z^kt^{k-n}]sum_{j=0}^infty z^jleft(1+t-(tz)^{m}right)^jleft(1-(tz)^mright)tag{6}\
&=binom{k}{n}-[t^{k-n}]sum_{j=0}^k [z^{k-j}]left(1+t-(tz)^{m}right)^jleft(1-(tz)^mright)tag{7}\
&=binom{k}{n}-[t^{k-n}]sum_{j=0}^k [z^j]left(1+t-(tz)^{m}right)^{k-j}left(1-(tz)^mright)tag{8}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]left(1+t-(tz)^{m}right)^{k-mj}left(1-(tz)^mright)tag{9}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]sum_{l=0}^{k-mj}binom{k-mj}{l}left(1-(tz)^mright)^{l+1}t^{k-mj-l}tag{10}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]sum_{l=0}^{k-mj}binom{k-mj}{l}sum_{u=0}^{l+1}binom{l+1}{u}(-1)^{u}(tz)^{mu}t^{k-mj-l}tag{11}\
&=[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}sum_{l=0}^{k-mj}binom{k-mj}{l}binom{l+1}{j}(-1)^{j+1}t^{k-l}tag{12}\
&,,color{blue}{=sum_{j=1}^{lfloor k/mrfloor}binom{k-mj}{n}binom{n+1}{j}(-1)^{j+1}}tag{13}\
end{align*}
with probability
begin{align*}
color{blue}{p_m(k,n)=binom{k}{n}^{-1}f_m(k,n)}
end{align*}
in accordance with (0).

Comment:

In (6) we do a geometric series expansion.

In (7) we apply the rule $[z^{p-q}]A(z)=[z^p]z^qA(z)$. We also set the upper limit to $k$, since other terms do not contribute.

In (8) we change the order of summation $jto k-j$.

In (9) we sum over multiples of $m$ only by setting $jto mj$ since we have $z^{m}$ in the binomial expansion. We also note that $binom{k}{n}$ is canceled with the first summand $j=0$ since $[z^0t^{k-n}](1+t-(tz)^m)^k(1-(tz)^m)=[t^{k-n}](1+t)^k=binom{k}{k-n}=binom{k}{n}$.

In (10) we expand the binomial.

In (11) we expand the binomial.

In (12) we select the coefficient of $z^{mj}$ by selecting the summand with $u=j$.

In (13) we select the coefficient of $t^{k-n}$ by selecting the summand with $l=n$.

Example revisited:

By calculating OPs example ($k=10,n=7,m=2$) now with formula (13) we obtain
begin{align*}
color{blue}{f_2(10,7)}&=sum_{j=1}^5binom{10-2j}{7}binom{8}{j}(-1)^{j+1}\
&=binom{8}{7}binom{8}{1}\
&,,color{blue}{=64}
end{align*}
in accordance with the result above.

Note that here we only take the summand $j=1$, since other terms have lower index $7$ greater than the upper index $10-2j$, so that the binomial coefficients are zero.

edited Dec 23 '18 at 15:23

answered Dec 22 '18 at 19:13

Markus Scheuer

61.2k456145

We show the probability $p_m(k,n)$ to randomly place $n$ balls into $k$ bins with no bin containing more than one ball, so that at least $m$ consecutive bins are empty is
begin{align*}
color{blue}{p_m(k,n)=binom{k}{n}^{-1}sum_{j=1}^{lfloor k/mrfloor}binom{k-mj}{n}binom{n+1}{j}(-1)^{j+1}}tag{0}
end{align*}

In order to do so we consider the binary alphabet $V={0,1}$ and decode empty bins with $0$ and bins where a ball is placed with $1$.

We are looking for the number $g_m(k,n)$ of binary words of length $k$ which contain $n$ $1$'s and runs of $0$ with length at most $m-1$. The number of wanted words is
begin{align*}
f_m(k,n)=binom{k}{n}-g_m(k,n)
end{align*}

Since there is a total of $binom{k}{n}$ binary words of length $k$ with $n$ $1$'s, the probability $p_m(k,n)$ is
begin{align*}
p_m(k,n)=binom{k}{n}^{-1}f_m(k,n)
end{align*}

A generating function for the number of Smirnov words over a binary alphabet is given by
begin{align*}
left(1-frac{2z}{1+z}right)^{-1}tag{1}
end{align*}

We replace occurrences of $0$ in a Smirnov word by one up to $m-1$ zeros to generate strings having runs of $0$ with length less than $m$.
begin{align*}
zlongrightarrow z+z^2+cdots+z^{m-1}=frac{zleft(1-z^{m-1}right)}{1-z}tag{2}
end{align*}

Since there are no restrictions to the length of runs of $1$'s we replace occurrences of $1$'s in a Smirnov word by one or more $1$s.
begin{align*}
zlongrightarrow z+z^2+cdots=frac{z}{1-z}tag{3}
end{align*}

We substitue (2) and (3) in (1). Since we want to keep track of the number of zeros we put $tz$ instead of $z$ in (2). We obtain

begin{align*}
left(1- frac{frac{tzleft(1-(tz)^{m-1}right)}{1-tz}}{1+frac{tzleft(1-(tz)^{m-1}right)}{1-tz}}-frac{frac{z}{1-z}}{1+frac{z}{1-z}}right)^{-1}
&=frac{1-(tz)^m}{1-z(1+t-(tz)^{m})}tag{4}
end{align*}

Denoting with $[z^n]$ the coefficient of $z^n$ in a series we obtain from (4) the number of wanted words as
begin{align*}
color{blue}{f_m(k,n)}&=binom{k}{n}-g_m(k,n)\
&color{blue}{=binom{k}{n}-[z^kt^{k-n}]frac{1-(tz)^m}{1-z(1+t-(tz)^{m})})}tag{5}
end{align*}

begin{align*}
color{blue}{p_2(10,7)}=binom{10}{7}^{-1}f_2(10,7)=frac{64}{120}color{blue}{=frac{8}{15}=0.5dot{3}}
end{align*}

Formula of $f_m(k,n)$:

We derive a formula of $f_m(k,n)$ from (5) manually. It is convenient to use the coefficient of operator $[z^k]$ to denote the coefficient of $z^k$ in a series.

We obtain from (5)
begin{align*}
color{blue}{f_m(k,n)}&=binom{k}{n}-[z^kt^{k-n}]frac{1-(tz)^m}{1-z(1+t-(tz)^m}\
&=binom{k}{n}-[z^kt^{k-n}]sum_{j=0}^infty z^jleft(1+t-(tz)^{m}right)^jleft(1-(tz)^mright)tag{6}\
&=binom{k}{n}-[t^{k-n}]sum_{j=0}^k [z^{k-j}]left(1+t-(tz)^{m}right)^jleft(1-(tz)^mright)tag{7}\
&=binom{k}{n}-[t^{k-n}]sum_{j=0}^k [z^j]left(1+t-(tz)^{m}right)^{k-j}left(1-(tz)^mright)tag{8}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]left(1+t-(tz)^{m}right)^{k-mj}left(1-(tz)^mright)tag{9}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]sum_{l=0}^{k-mj}binom{k-mj}{l}left(1-(tz)^mright)^{l+1}t^{k-mj-l}tag{10}\
&=-[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}[z^{mj}]sum_{l=0}^{k-mj}binom{k-mj}{l}sum_{u=0}^{l+1}binom{l+1}{u}(-1)^{u}(tz)^{mu}t^{k-mj-l}tag{11}\
&=[t^{k-n}]sum_{j=1}^{lfloor k/mrfloor}sum_{l=0}^{k-mj}binom{k-mj}{l}binom{l+1}{j}(-1)^{j+1}t^{k-l}tag{12}\
&,,color{blue}{=sum_{j=1}^{lfloor k/mrfloor}binom{k-mj}{n}binom{n+1}{j}(-1)^{j+1}}tag{13}\
end{align*}
with probability
begin{align*}
color{blue}{p_m(k,n)=binom{k}{n}^{-1}f_m(k,n)}
end{align*}
in accordance with (0).

Comment:

In (6) we do a geometric series expansion.

In (7) we apply the rule $[z^{p-q}]A(z)=[z^p]z^qA(z)$. We also set the upper limit to $k$, since other terms do not contribute.

In (8) we change the order of summation $jto k-j$.

In (9) we sum over multiples of $m$ only by setting $jto mj$ since we have $z^{m}$ in the binomial expansion. We also note that $binom{k}{n}$ is canceled with the first summand $j=0$ since $[z^0t^{k-n}](1+t-(tz)^m)^k(1-(tz)^m)=[t^{k-n}](1+t)^k=binom{k}{k-n}=binom{k}{n}$.

In (10) we expand the binomial.

In (11) we expand the binomial.

In (12) we select the coefficient of $z^{mj}$ by selecting the summand with $u=j$.

In (13) we select the coefficient of $t^{k-n}$ by selecting the summand with $l=n$.

Example revisited:

By calculating OPs example ($k=10,n=7,m=2$) now with formula (13) we obtain
begin{align*}
color{blue}{f_2(10,7)}&=sum_{j=1}^5binom{10-2j}{7}binom{8}{j}(-1)^{j+1}\
&=binom{8}{7}binom{8}{1}\
&,,color{blue}{=64}
end{align*}
in accordance with the result above.

Note that here we only take the summand $j=1$, since other terms have lower index $7$ greater than the upper index $10-2j$, so that the binomial coefficients are zero.

edited Dec 23 '18 at 15:23

answered Dec 22 '18 at 19:13

Markus Scheuer

61.2k456145

edited Dec 23 '18 at 15:23

answered Dec 22 '18 at 19:13

Markus Scheuer

61.2k456145

answered Dec 22 '18 at 19:13

Markus Scheuer

61.2k456145

answered Dec 22 '18 at 19:13

Markus Scheuer

61.2k456145

$begingroup$
This is great; thank you! Also: Within the first indented sentence I had guessed correctly who the answerer was. :)
$endgroup$
– Benjamin Dickman
Dec 22 '18 at 19:25

$begingroup$
@BenjaminDickman: You're welcome and many thanks for your amusing remark. :-)
$endgroup$
– Markus Scheuer
Dec 22 '18 at 19:48

1

$begingroup$
@BenjaminDickman: I've added a derivation of an explicit formula to complete the job.
$endgroup$
– Markus Scheuer
Dec 23 '18 at 8:19

add a comment |

$begingroup$
This is great; thank you! Also: Within the first indented sentence I had guessed correctly who the answerer was. :)
$endgroup$
– Benjamin Dickman
Dec 22 '18 at 19:25

$begingroup$
@BenjaminDickman: You're welcome and many thanks for your amusing remark. :-)
$endgroup$
– Markus Scheuer
Dec 22 '18 at 19:48

1

$begingroup$
@BenjaminDickman: I've added a derivation of an explicit formula to complete the job.
$endgroup$
– Markus Scheuer
Dec 23 '18 at 8:19

This is great; thank you! Also: Within the first indented sentence I had guessed correctly who the answerer was. :)

– Benjamin Dickman
Dec 22 '18 at 19:25

@BenjaminDickman: You're welcome and many thanks for your amusing remark. :-)

– Markus Scheuer
Dec 22 '18 at 19:48

@BenjaminDickman: I've added a derivation of an explicit formula to complete the job.

– Markus Scheuer
Dec 23 '18 at 8:19

add a comment |

$$
x_1 + cdots + x_{n + 1} = k - n
$$

that have at least one $x_j ge m$. This is equivalent to solving the problem.

To count this quantity, use the facts that:

The number of nonnegative integer solutions to
$$
x_1 + cdots + x_k = n
$$
is
$$
binom{n + k - 1}{k - 1}text.
$$
This is a simple stars and bars argument.

The number of nonnegative integer solutions to
$$
x_1 + cdots + x_k = n
$$
with each $x_j < c$ is
$$
binom{n + k - 1}{k - 1} - sum_{i = 1}^{lfloor n / {c} rfloor} {(-1)}^{i + 1} binom{k}{i}binom{n - c i + k - 1}{k - 1}text.tag{*}label{*}
$$
To show (ref{*}), recall the complementary form of the inclusion–exclusion principle. Namely, we can count the number of nonnegative integer solutions with each $x_j < c$ as
$$
begin{align}
&& text{the number of all (unrestricted) solutions} \
&-& text{the number of solutions with one } x_j geq c \
&+& text{the number of solutions with two } x_j geq c \
&-& cdots \
&{(-1)}^{lfloor n / {c} rfloor}& text{the number of solutions with }lfloor n / {c} rfloor text{ } x_j geq ctext.
end{align}
$$
Now, how do you count the number of solutions that have $i$-many $x_j geq c$? It's quite simple; write those $x_j$ as $c + y_j$ (since we already know they're greater than or equal to $c$), and you already have a reduction to the stars and bars argument, but with $n' = n - c i$. This tells us that the number of solutions with $i$-many $x_j geq c$ is $binom{n - c i + k - 1}{k - 1}$. The $binom{k}{i}$ factor counts the number of ways to have this scenario.

edited Dec 20 '18 at 15:48

answered Dec 11 '18 at 12:00

d125q

1,6571015

1

$begingroup$
Nice answer! I think the number of variables should be $n+1$, rather than $k-n$. The variables are the gaps between the balls, and there are $n$ balls, so $n+1$ gaps.
$endgroup$
– Mike Earnest
Dec 11 '18 at 14:36

$begingroup$
Thank you; you are right. I corrected the answer and checked that it agrees with the one for $m = 2$ (math.stackexchange.com/a/3035312/112944).
$endgroup$
– d125q
Dec 11 '18 at 14:44

$begingroup$
Could you say something about the second fact? The first one looks to me like stars and bars, but is the second one... stars and bars combined with the inclusion-exclusion principle? $$ $$ If you could indicate how this works in practice for, say, $n = 6$, $k = 10$, and $m = 2$, then that would be very helpful, too. (And could compared with @MikeEarnest's answer, as well as the hand computations I've done.) Thanks!
$endgroup$
– Benjamin Dickman
Dec 11 '18 at 16:21

1

$begingroup$
@BenjaminDickman Give the inclusion exclusion a try! The idea is this; you take all of the solutions, then for each variable subtract the cases where the variable is “bad”, meaning more than $m$, then add back in the cases where two variables are bad, etc.
$endgroup$
– Mike Earnest
Dec 11 '18 at 16:45

1

$begingroup$
@BenjaminDickman I added some "proof sketches."
$endgroup$
– d125q
Dec 20 '18 at 15:48

|
show 2 more comments

$$
x_1 + cdots + x_{n + 1} = k - n
$$

that have at least one $x_j ge m$. This is equivalent to solving the problem.

To count this quantity, use the facts that:

The number of nonnegative integer solutions to
$$
x_1 + cdots + x_k = n
$$
is
$$
binom{n + k - 1}{k - 1}text.
$$
This is a simple stars and bars argument.

The number of nonnegative integer solutions to
$$
x_1 + cdots + x_k = n
$$
with each $x_j < c$ is
$$
binom{n + k - 1}{k - 1} - sum_{i = 1}^{lfloor n / {c} rfloor} {(-1)}^{i + 1} binom{k}{i}binom{n - c i + k - 1}{k - 1}text.tag{*}label{*}
$$
To show (ref{*}), recall the complementary form of the inclusion–exclusion principle. Namely, we can count the number of nonnegative integer solutions with each $x_j < c$ as
$$
begin{align}
&& text{the number of all (unrestricted) solutions} \
&-& text{the number of solutions with one } x_j geq c \
&+& text{the number of solutions with two } x_j geq c \
&-& cdots \
&{(-1)}^{lfloor n / {c} rfloor}& text{the number of solutions with }lfloor n / {c} rfloor text{ } x_j geq ctext.
end{align}
$$
Now, how do you count the number of solutions that have $i$-many $x_j geq c$? It's quite simple; write those $x_j$ as $c + y_j$ (since we already know they're greater than or equal to $c$), and you already have a reduction to the stars and bars argument, but with $n' = n - c i$. This tells us that the number of solutions with $i$-many $x_j geq c$ is $binom{n - c i + k - 1}{k - 1}$. The $binom{k}{i}$ factor counts the number of ways to have this scenario.

edited Dec 20 '18 at 15:48

answered Dec 11 '18 at 12:00

d125q

1,6571015

1

$begingroup$
Nice answer! I think the number of variables should be $n+1$, rather than $k-n$. The variables are the gaps between the balls, and there are $n$ balls, so $n+1$ gaps.
$endgroup$
– Mike Earnest
Dec 11 '18 at 14:36

$begingroup$
Thank you; you are right. I corrected the answer and checked that it agrees with the one for $m = 2$ (math.stackexchange.com/a/3035312/112944).
$endgroup$
– d125q
Dec 11 '18 at 14:44

$begingroup$
Could you say something about the second fact? The first one looks to me like stars and bars, but is the second one... stars and bars combined with the inclusion-exclusion principle? $$ $$ If you could indicate how this works in practice for, say, $n = 6$, $k = 10$, and $m = 2$, then that would be very helpful, too. (And could compared with @MikeEarnest's answer, as well as the hand computations I've done.) Thanks!
$endgroup$
– Benjamin Dickman
Dec 11 '18 at 16:21

1

$begingroup$
@BenjaminDickman Give the inclusion exclusion a try! The idea is this; you take all of the solutions, then for each variable subtract the cases where the variable is “bad”, meaning more than $m$, then add back in the cases where two variables are bad, etc.
$endgroup$
– Mike Earnest
Dec 11 '18 at 16:45

1

$begingroup$
@BenjaminDickman I added some "proof sketches."
$endgroup$
– d125q
Dec 20 '18 at 15:48

|
show 2 more comments

$$
x_1 + cdots + x_{n + 1} = k - n
$$

that have at least one $x_j ge m$. This is equivalent to solving the problem.

To count this quantity, use the facts that:

The number of nonnegative integer solutions to
$$
x_1 + cdots + x_k = n
$$
is
$$
binom{n + k - 1}{k - 1}text.
$$
This is a simple stars and bars argument.

The number of nonnegative integer solutions to
$$
x_1 + cdots + x_k = n
$$
with each $x_j < c$ is
$$
binom{n + k - 1}{k - 1} - sum_{i = 1}^{lfloor n / {c} rfloor} {(-1)}^{i + 1} binom{k}{i}binom{n - c i + k - 1}{k - 1}text.tag{*}label{*}
$$
To show (ref{*}), recall the complementary form of the inclusion–exclusion principle. Namely, we can count the number of nonnegative integer solutions with each $x_j < c$ as
$$
begin{align}
&& text{the number of all (unrestricted) solutions} \
&-& text{the number of solutions with one } x_j geq c \
&+& text{the number of solutions with two } x_j geq c \
&-& cdots \
&{(-1)}^{lfloor n / {c} rfloor}& text{the number of solutions with }lfloor n / {c} rfloor text{ } x_j geq ctext.
end{align}
$$
Now, how do you count the number of solutions that have $i$-many $x_j geq c$? It's quite simple; write those $x_j$ as $c + y_j$ (since we already know they're greater than or equal to $c$), and you already have a reduction to the stars and bars argument, but with $n' = n - c i$. This tells us that the number of solutions with $i$-many $x_j geq c$ is $binom{n - c i + k - 1}{k - 1}$. The $binom{k}{i}$ factor counts the number of ways to have this scenario.

edited Dec 20 '18 at 15:48

answered Dec 11 '18 at 12:00

d125q

1,6571015

$$
x_1 + cdots + x_{n + 1} = k - n
$$

that have at least one $x_j ge m$. This is equivalent to solving the problem.

To count this quantity, use the facts that:

The number of nonnegative integer solutions to
$$
x_1 + cdots + x_k = n
$$
is
$$
binom{n + k - 1}{k - 1}text.
$$
This is a simple stars and bars argument.

The number of nonnegative integer solutions to
$$
x_1 + cdots + x_k = n
$$
with each $x_j < c$ is
$$
binom{n + k - 1}{k - 1} - sum_{i = 1}^{lfloor n / {c} rfloor} {(-1)}^{i + 1} binom{k}{i}binom{n - c i + k - 1}{k - 1}text.tag{*}label{*}
$$
To show (ref{*}), recall the complementary form of the inclusion–exclusion principle. Namely, we can count the number of nonnegative integer solutions with each $x_j < c$ as
$$
begin{align}
&& text{the number of all (unrestricted) solutions} \
&-& text{the number of solutions with one } x_j geq c \
&+& text{the number of solutions with two } x_j geq c \
&-& cdots \
&{(-1)}^{lfloor n / {c} rfloor}& text{the number of solutions with }lfloor n / {c} rfloor text{ } x_j geq ctext.
end{align}
$$
Now, how do you count the number of solutions that have $i$-many $x_j geq c$? It's quite simple; write those $x_j$ as $c + y_j$ (since we already know they're greater than or equal to $c$), and you already have a reduction to the stars and bars argument, but with $n' = n - c i$. This tells us that the number of solutions with $i$-many $x_j geq c$ is $binom{n - c i + k - 1}{k - 1}$. The $binom{k}{i}$ factor counts the number of ways to have this scenario.

edited Dec 20 '18 at 15:48

answered Dec 11 '18 at 12:00

d125q

1,6571015

edited Dec 20 '18 at 15:48

answered Dec 11 '18 at 12:00

d125q

1,6571015

answered Dec 11 '18 at 12:00

d125q

1,6571015

answered Dec 11 '18 at 12:00

d125q

1,6571015

1

$begingroup$
Nice answer! I think the number of variables should be $n+1$, rather than $k-n$. The variables are the gaps between the balls, and there are $n$ balls, so $n+1$ gaps.
$endgroup$
– Mike Earnest
Dec 11 '18 at 14:36

$begingroup$
Thank you; you are right. I corrected the answer and checked that it agrees with the one for $m = 2$ (math.stackexchange.com/a/3035312/112944).
$endgroup$
– d125q
Dec 11 '18 at 14:44

$begingroup$
Could you say something about the second fact? The first one looks to me like stars and bars, but is the second one... stars and bars combined with the inclusion-exclusion principle? $$ $$ If you could indicate how this works in practice for, say, $n = 6$, $k = 10$, and $m = 2$, then that would be very helpful, too. (And could compared with @MikeEarnest's answer, as well as the hand computations I've done.) Thanks!
$endgroup$
– Benjamin Dickman
Dec 11 '18 at 16:21

1

$begingroup$
@BenjaminDickman Give the inclusion exclusion a try! The idea is this; you take all of the solutions, then for each variable subtract the cases where the variable is “bad”, meaning more than $m$, then add back in the cases where two variables are bad, etc.
$endgroup$
– Mike Earnest
Dec 11 '18 at 16:45

1

$begingroup$
@BenjaminDickman I added some "proof sketches."
$endgroup$
– d125q
Dec 20 '18 at 15:48

|
show 2 more comments

1

$begingroup$
Nice answer! I think the number of variables should be $n+1$, rather than $k-n$. The variables are the gaps between the balls, and there are $n$ balls, so $n+1$ gaps.
$endgroup$
– Mike Earnest
Dec 11 '18 at 14:36

$begingroup$
Thank you; you are right. I corrected the answer and checked that it agrees with the one for $m = 2$ (math.stackexchange.com/a/3035312/112944).
$endgroup$
– d125q
Dec 11 '18 at 14:44

$begingroup$
Could you say something about the second fact? The first one looks to me like stars and bars, but is the second one... stars and bars combined with the inclusion-exclusion principle? $$ $$ If you could indicate how this works in practice for, say, $n = 6$, $k = 10$, and $m = 2$, then that would be very helpful, too. (And could compared with @MikeEarnest's answer, as well as the hand computations I've done.) Thanks!
$endgroup$
– Benjamin Dickman
Dec 11 '18 at 16:21

1

$begingroup$
@BenjaminDickman Give the inclusion exclusion a try! The idea is this; you take all of the solutions, then for each variable subtract the cases where the variable is “bad”, meaning more than $m$, then add back in the cases where two variables are bad, etc.
$endgroup$
– Mike Earnest
Dec 11 '18 at 16:45

1

$begingroup$
@BenjaminDickman I added some "proof sketches."
$endgroup$
– d125q
Dec 20 '18 at 15:48

Nice answer! I think the number of variables should be $n+1$, rather than $k-n$. The variables are the gaps between the balls, and there are $n$ balls, so $n+1$ gaps.

– Mike Earnest
Dec 11 '18 at 14:36

Thank you; you are right. I corrected the answer and checked that it agrees with the one for $m = 2$ (math.stackexchange.com/a/3035312/112944).

– d125q
Dec 11 '18 at 14:44

Could you say something about the second fact? The first one looks to me like stars and bars, but is the second one... stars and bars combined with the inclusion-exclusion principle? $$ $$ If you could indicate how this works in practice for, say, $n = 6$, $k = 10$, and $m = 2$, then that would be very helpful, too. (And could compared with @MikeEarnest's answer, as well as the hand computations I've done.) Thanks!

– Benjamin Dickman
Dec 11 '18 at 16:21

@BenjaminDickman Give the inclusion exclusion a try! The idea is this; you take all of the solutions, then for each variable subtract the cases where the variable is “bad”, meaning more than $m$, then add back in the cases where two variables are bad, etc.

– Mike Earnest
Dec 11 '18 at 16:45

@BenjaminDickman I added some "proof sketches."

– d125q
Dec 20 '18 at 15:48

|
show 2 more comments

+50

This is supposed to be an integration to d125q's answer, as to better explain
how to reach to the formula he gave, but it is too long for a comment.

Then your problem is equivalent to :

given all the binary strings of length $k$, with $n$ ones (and $n-k$ zeros), how many of them
will contain one or more runs of consecutive zeros whose length is $m$ or greater ?

Now, to keep congruence with the precedent posts I am going to cite, let me change
the formulation and the parameters of the problem with the following:

Bernoulli_runs_1

Finally note that the $N_b$ are called "r-nomial coefficients" in OEIS: see e.g. Table A008287.

edited Dec 19 '18 at 17:05

answered Dec 19 '18 at 16:31

G Cab

18.9k31238

1

$begingroup$
@BenjaminDickman good that the o.g.f. is explicative. Sorry for the broken links: adjusted.
$endgroup$
– G Cab
Dec 19 '18 at 17:06

add a comment |

+50

This is supposed to be an integration to d125q's answer, as to better explain
how to reach to the formula he gave, but it is too long for a comment.

Then your problem is equivalent to :

given all the binary strings of length $k$, with $n$ ones (and $n-k$ zeros), how many of them
will contain one or more runs of consecutive zeros whose length is $m$ or greater ?

Now, to keep congruence with the precedent posts I am going to cite, let me change
the formulation and the parameters of the problem with the following:

Bernoulli_runs_1

Finally note that the $N_b$ are called "r-nomial coefficients" in OEIS: see e.g. Table A008287.

edited Dec 19 '18 at 17:05

answered Dec 19 '18 at 16:31

G Cab

18.9k31238

1

$begingroup$
@BenjaminDickman good that the o.g.f. is explicative. Sorry for the broken links: adjusted.
$endgroup$
– G Cab
Dec 19 '18 at 17:06

add a comment |

+50

This is supposed to be an integration to d125q's answer, as to better explain
how to reach to the formula he gave, but it is too long for a comment.

Then your problem is equivalent to :

given all the binary strings of length $k$, with $n$ ones (and $n-k$ zeros), how many of them
will contain one or more runs of consecutive zeros whose length is $m$ or greater ?

Now, to keep congruence with the precedent posts I am going to cite, let me change
the formulation and the parameters of the problem with the following:

Bernoulli_runs_1

Finally note that the $N_b$ are called "r-nomial coefficients" in OEIS: see e.g. Table A008287.

edited Dec 19 '18 at 17:05

answered Dec 19 '18 at 16:31

G Cab

18.9k31238

This is supposed to be an integration to d125q's answer, as to better explain
how to reach to the formula he gave, but it is too long for a comment.

Then your problem is equivalent to :

given all the binary strings of length $k$, with $n$ ones (and $n-k$ zeros), how many of them
will contain one or more runs of consecutive zeros whose length is $m$ or greater ?

Now, to keep congruence with the precedent posts I am going to cite, let me change
the formulation and the parameters of the problem with the following:

Bernoulli_runs_1

Finally note that the $N_b$ are called "r-nomial coefficients" in OEIS: see e.g. Table A008287.

edited Dec 19 '18 at 17:05

answered Dec 19 '18 at 16:31

G Cab

18.9k31238

edited Dec 19 '18 at 17:05

answered Dec 19 '18 at 16:31

G Cab

18.9k31238

answered Dec 19 '18 at 16:31

G Cab

18.9k31238

answered Dec 19 '18 at 16:31

G Cab

18.9k31238

1

$begingroup$
@BenjaminDickman good that the o.g.f. is explicative. Sorry for the broken links: adjusted.
$endgroup$
– G Cab
Dec 19 '18 at 17:06

add a comment |

1

$begingroup$
@BenjaminDickman good that the o.g.f. is explicative. Sorry for the broken links: adjusted.
$endgroup$
– G Cab
Dec 19 '18 at 17:06

@BenjaminDickman good that the o.g.f. is explicative. Sorry for the broken links: adjusted.

– G Cab
Dec 19 '18 at 17:06

add a comment |

There's a nice way to do this if $m = 2.$
I describe it here, although it does not solve the more general problem indicated in the question.

We can get a classic "random" arrangement if the bins are identifiable and the balls are not distinguishable, so I'll make those assumptions.

answered Dec 11 '18 at 13:59

David K

53.9k342116

add a comment |

There's a nice way to do this if $m = 2.$
I describe it here, although it does not solve the more general problem indicated in the question.

We can get a classic "random" arrangement if the bins are identifiable and the balls are not distinguishable, so I'll make those assumptions.

answered Dec 11 '18 at 13:59

David K

53.9k342116

add a comment |

There's a nice way to do this if $m = 2.$
I describe it here, although it does not solve the more general problem indicated in the question.

We can get a classic "random" arrangement if the bins are identifiable and the balls are not distinguishable, so I'll make those assumptions.

answered Dec 11 '18 at 13:59

David K

53.9k342116

There's a nice way to do this if $m = 2.$
I describe it here, although it does not solve the more general problem indicated in the question.

We can get a classic "random" arrangement if the bins are identifiable and the balls are not distinguishable, so I'll make those assumptions.

answered Dec 11 '18 at 13:59

David K

53.9k342116

answered Dec 11 '18 at 13:59

David K

53.9k342116

answered Dec 11 '18 at 13:59

David K

53.9k342116

answered Dec 11 '18 at 13:59

David K

53.9k342116

add a comment |

There is the possibility to calculate numerically the result with a rather simple recursive equation.

1 in the first bin. Probability $frac{n}{k}$. The subgraph corresponds to $f(k-1, n-1)$

01 in the first 2 bins: Probability $frac{k-n}{k} times frac{n}{k-1}$. The subgraph corresponds to $f(k-2, n-1)$

00 in the first 2 bins: Probability $frac{k-n}{k} times frac{k-1-n}{k-1}$ -> we get the $m$ holes.

Then, we obtain, for $m = 2$

$$f(k,n) = frac{n}{k} f(k-1, n-1) + frac{n(k-n)}{k(k-1)} f(k-2, n-1) + frac{(k-n)(k-1-n)}{k(k-1)}$$

We note in addition that $f() = 0$ if $k-n le m-1$ and that $f() = 1$ if $n=0$ and $k ge m$

This recursive formula can be generalised for higher values of $m$, although it becomes more difficult.

This is clearly not as satisfactory as a close form formula. However, it can be represented on a graph rather easily, and it is easy to program.

It may be used for secondary school students.

Note: the program gives $f(10, 7, 2) = frac{8}{15}$

EDIT:

When I arrived at this point, I suspected something ...

Is there a much simpler solution ?

I thought having found a simple formula, it was a mistake ...

EDIT 2: C++ programme for $m = 2$

#include <iostream>

#include <iomanip>



double f2(int k, int n) {       // m = 2

    if ((k - n) <= 1) return 0.0;

    if (n == 0) return 1.0;

    double res = double ((k-n)*(k-1-n)) / (k*(k-1))

        + (double (n*(k-n)) / (k*(k-1))) * f2(k-2, n-1)

        + (double (n)/k) * f2(k-1, n-1);

    return res;

}



int main() {

    int k = 10;

    int n = 7;

    double prob = f2(k, n);

    std::cout << "f = " << std::setprecision(12) << prob << "n";

    return 0;

}

                      f(10,7)=8/15

                        /      

                       /        

                      /          

                   0:3/10       1:7/10

                  SB:1/8        SB:f(9,6)=7/12                     

                  /                   |     

                 /                    |      

                /                     |       

             0:2/9      1:7/9         0:1/3     1:2/3

            SB:1.0   SB:f(8,6)=1/4   SB:13/28   SB:f(8,5)=9/14

                        /             |           |       

                       /              |           |        

                                       |     

                                     0:1/4    1:3/4

                                     SB:1.0   SB:f(7,5)=2/7

                                                  /     

                                                 /

edited Dec 20 '18 at 13:34

answered Dec 18 '18 at 21:23

Damien

60714

$begingroup$
Could you indicate how the described program outputs 8/15? I find myself not quite able to follow the description and suspect seeing those steps written out could help me!
$endgroup$
– Benjamin Dickman
Dec 19 '18 at 16:47

$begingroup$
Sorry, I was really unclear. I mentioned the easiness to write a programme . .. I wrote such a C++ (very simple) programme for $m=2$ and use it. I might post it. You need to represent what I present in a graph ...Easy to follow with such a graph in front of the eyes. ... Not so easy without
$endgroup$
– Damien
Dec 19 '18 at 17:04

$begingroup$
Including the program and a graph would be great
$endgroup$
– Benjamin Dickman
Dec 19 '18 at 20:39

1

$begingroup$
I have included the C++ programme. I will try now to draw a graph. Much easier by hand than creating a figure, without any drawing software at home ...
$endgroup$
– Damien
Dec 19 '18 at 21:49

1

$begingroup$
I tried to display a part of the graph. Hope it is clear enough
$endgroup$
– Damien
Dec 20 '18 at 13:35

add a comment |

There is the possibility to calculate numerically the result with a rather simple recursive equation.

1 in the first bin. Probability $frac{n}{k}$. The subgraph corresponds to $f(k-1, n-1)$

01 in the first 2 bins: Probability $frac{k-n}{k} times frac{n}{k-1}$. The subgraph corresponds to $f(k-2, n-1)$

00 in the first 2 bins: Probability $frac{k-n}{k} times frac{k-1-n}{k-1}$ -> we get the $m$ holes.

Then, we obtain, for $m = 2$

$$f(k,n) = frac{n}{k} f(k-1, n-1) + frac{n(k-n)}{k(k-1)} f(k-2, n-1) + frac{(k-n)(k-1-n)}{k(k-1)}$$

We note in addition that $f() = 0$ if $k-n le m-1$ and that $f() = 1$ if $n=0$ and $k ge m$

This recursive formula can be generalised for higher values of $m$, although it becomes more difficult.

This is clearly not as satisfactory as a close form formula. However, it can be represented on a graph rather easily, and it is easy to program.

It may be used for secondary school students.

Note: the program gives $f(10, 7, 2) = frac{8}{15}$

EDIT:

When I arrived at this point, I suspected something ...

Is there a much simpler solution ?

I thought having found a simple formula, it was a mistake ...

EDIT 2: C++ programme for $m = 2$

#include <iostream>

#include <iomanip>



double f2(int k, int n) {       // m = 2

    if ((k - n) <= 1) return 0.0;

    if (n == 0) return 1.0;

    double res = double ((k-n)*(k-1-n)) / (k*(k-1))

        + (double (n*(k-n)) / (k*(k-1))) * f2(k-2, n-1)

        + (double (n)/k) * f2(k-1, n-1);

    return res;

}



int main() {

    int k = 10;

    int n = 7;

    double prob = f2(k, n);

    std::cout << "f = " << std::setprecision(12) << prob << "n";

    return 0;

}

                      f(10,7)=8/15

                        /      

                       /        

                      /          

                   0:3/10       1:7/10

                  SB:1/8        SB:f(9,6)=7/12                     

                  /                   |     

                 /                    |      

                /                     |       

             0:2/9      1:7/9         0:1/3     1:2/3

            SB:1.0   SB:f(8,6)=1/4   SB:13/28   SB:f(8,5)=9/14

                        /             |           |       

                       /              |           |        

                                       |     

                                     0:1/4    1:3/4

                                     SB:1.0   SB:f(7,5)=2/7

                                                  /     

                                                 /

edited Dec 20 '18 at 13:34

answered Dec 18 '18 at 21:23

Damien

60714

$begingroup$
Could you indicate how the described program outputs 8/15? I find myself not quite able to follow the description and suspect seeing those steps written out could help me!
$endgroup$
– Benjamin Dickman
Dec 19 '18 at 16:47

$begingroup$
Sorry, I was really unclear. I mentioned the easiness to write a programme . .. I wrote such a C++ (very simple) programme for $m=2$ and use it. I might post it. You need to represent what I present in a graph ...Easy to follow with such a graph in front of the eyes. ... Not so easy without
$endgroup$
– Damien
Dec 19 '18 at 17:04

$begingroup$
Including the program and a graph would be great
$endgroup$
– Benjamin Dickman
Dec 19 '18 at 20:39

1

$begingroup$
I have included the C++ programme. I will try now to draw a graph. Much easier by hand than creating a figure, without any drawing software at home ...
$endgroup$
– Damien
Dec 19 '18 at 21:49

1

$begingroup$
I tried to display a part of the graph. Hope it is clear enough
$endgroup$
– Damien
Dec 20 '18 at 13:35

add a comment |

There is the possibility to calculate numerically the result with a rather simple recursive equation.

1 in the first bin. Probability $frac{n}{k}$. The subgraph corresponds to $f(k-1, n-1)$

01 in the first 2 bins: Probability $frac{k-n}{k} times frac{n}{k-1}$. The subgraph corresponds to $f(k-2, n-1)$

00 in the first 2 bins: Probability $frac{k-n}{k} times frac{k-1-n}{k-1}$ -> we get the $m$ holes.

Then, we obtain, for $m = 2$

$$f(k,n) = frac{n}{k} f(k-1, n-1) + frac{n(k-n)}{k(k-1)} f(k-2, n-1) + frac{(k-n)(k-1-n)}{k(k-1)}$$

We note in addition that $f() = 0$ if $k-n le m-1$ and that $f() = 1$ if $n=0$ and $k ge m$

This recursive formula can be generalised for higher values of $m$, although it becomes more difficult.

This is clearly not as satisfactory as a close form formula. However, it can be represented on a graph rather easily, and it is easy to program.

It may be used for secondary school students.

Note: the program gives $f(10, 7, 2) = frac{8}{15}$

EDIT:

When I arrived at this point, I suspected something ...

Is there a much simpler solution ?

I thought having found a simple formula, it was a mistake ...

EDIT 2: C++ programme for $m = 2$

#include <iostream>

#include <iomanip>



double f2(int k, int n) {       // m = 2

    if ((k - n) <= 1) return 0.0;

    if (n == 0) return 1.0;

    double res = double ((k-n)*(k-1-n)) / (k*(k-1))

        + (double (n*(k-n)) / (k*(k-1))) * f2(k-2, n-1)

        + (double (n)/k) * f2(k-1, n-1);

    return res;

}



int main() {

    int k = 10;

    int n = 7;

    double prob = f2(k, n);

    std::cout << "f = " << std::setprecision(12) << prob << "n";

    return 0;

}

                      f(10,7)=8/15

                        /      

                       /        

                      /          

                   0:3/10       1:7/10

                  SB:1/8        SB:f(9,6)=7/12                     

                  /                   |     

                 /                    |      

                /                     |       

             0:2/9      1:7/9         0:1/3     1:2/3

            SB:1.0   SB:f(8,6)=1/4   SB:13/28   SB:f(8,5)=9/14

                        /             |           |       

                       /              |           |        

                                       |     

                                     0:1/4    1:3/4

                                     SB:1.0   SB:f(7,5)=2/7

                                                  /     

                                                 /

edited Dec 20 '18 at 13:34

answered Dec 18 '18 at 21:23

Damien

60714

There is the possibility to calculate numerically the result with a rather simple recursive equation.

1 in the first bin. Probability $frac{n}{k}$. The subgraph corresponds to $f(k-1, n-1)$

01 in the first 2 bins: Probability $frac{k-n}{k} times frac{n}{k-1}$. The subgraph corresponds to $f(k-2, n-1)$

00 in the first 2 bins: Probability $frac{k-n}{k} times frac{k-1-n}{k-1}$ -> we get the $m$ holes.

Then, we obtain, for $m = 2$

$$f(k,n) = frac{n}{k} f(k-1, n-1) + frac{n(k-n)}{k(k-1)} f(k-2, n-1) + frac{(k-n)(k-1-n)}{k(k-1)}$$

We note in addition that $f() = 0$ if $k-n le m-1$ and that $f() = 1$ if $n=0$ and $k ge m$

This recursive formula can be generalised for higher values of $m$, although it becomes more difficult.

This is clearly not as satisfactory as a close form formula. However, it can be represented on a graph rather easily, and it is easy to program.

It may be used for secondary school students.

Note: the program gives $f(10, 7, 2) = frac{8}{15}$

EDIT:

When I arrived at this point, I suspected something ...

Is there a much simpler solution ?

I thought having found a simple formula, it was a mistake ...

EDIT 2: C++ programme for $m = 2$

#include <iostream>

#include <iomanip>



double f2(int k, int n) {       // m = 2

    if ((k - n) <= 1) return 0.0;

    if (n == 0) return 1.0;

    double res = double ((k-n)*(k-1-n)) / (k*(k-1))

        + (double (n*(k-n)) / (k*(k-1))) * f2(k-2, n-1)

        + (double (n)/k) * f2(k-1, n-1);

    return res;

}



int main() {

    int k = 10;

    int n = 7;

    double prob = f2(k, n);

    std::cout << "f = " << std::setprecision(12) << prob << "n";

    return 0;

}

                      f(10,7)=8/15

                        /      

                       /        

                      /          

                   0:3/10       1:7/10

                  SB:1/8        SB:f(9,6)=7/12                     

                  /                   |     

                 /                    |      

                /                     |       

             0:2/9      1:7/9         0:1/3     1:2/3

            SB:1.0   SB:f(8,6)=1/4   SB:13/28   SB:f(8,5)=9/14

                        /             |           |       

                       /              |           |        

                                       |     

                                     0:1/4    1:3/4

                                     SB:1.0   SB:f(7,5)=2/7

                                                  /     

                                                 /

edited Dec 20 '18 at 13:34

answered Dec 18 '18 at 21:23

Damien

60714

edited Dec 20 '18 at 13:34

answered Dec 18 '18 at 21:23

Damien

60714

answered Dec 18 '18 at 21:23

Damien

60714

answered Dec 18 '18 at 21:23

Damien

60714

$begingroup$
Could you indicate how the described program outputs 8/15? I find myself not quite able to follow the description and suspect seeing those steps written out could help me!
$endgroup$
– Benjamin Dickman
Dec 19 '18 at 16:47

$begingroup$
Sorry, I was really unclear. I mentioned the easiness to write a programme . .. I wrote such a C++ (very simple) programme for $m=2$ and use it. I might post it. You need to represent what I present in a graph ...Easy to follow with such a graph in front of the eyes. ... Not so easy without
$endgroup$
– Damien
Dec 19 '18 at 17:04

$begingroup$
Including the program and a graph would be great
$endgroup$
– Benjamin Dickman
Dec 19 '18 at 20:39

1

$begingroup$
I have included the C++ programme. I will try now to draw a graph. Much easier by hand than creating a figure, without any drawing software at home ...
$endgroup$
– Damien
Dec 19 '18 at 21:49

1

$begingroup$
I tried to display a part of the graph. Hope it is clear enough
$endgroup$
– Damien
Dec 20 '18 at 13:35

add a comment |

$begingroup$
Could you indicate how the described program outputs 8/15? I find myself not quite able to follow the description and suspect seeing those steps written out could help me!
$endgroup$
– Benjamin Dickman
Dec 19 '18 at 16:47

$begingroup$
Sorry, I was really unclear. I mentioned the easiness to write a programme . .. I wrote such a C++ (very simple) programme for $m=2$ and use it. I might post it. You need to represent what I present in a graph ...Easy to follow with such a graph in front of the eyes. ... Not so easy without
$endgroup$
– Damien
Dec 19 '18 at 17:04

$begingroup$
Including the program and a graph would be great
$endgroup$
– Benjamin Dickman
Dec 19 '18 at 20:39

1

$begingroup$
I have included the C++ programme. I will try now to draw a graph. Much easier by hand than creating a figure, without any drawing software at home ...
$endgroup$
– Damien
Dec 19 '18 at 21:49

1

$begingroup$
I tried to display a part of the graph. Hope it is clear enough
$endgroup$
– Damien
Dec 20 '18 at 13:35

Could you indicate how the described program outputs 8/15? I find myself not quite able to follow the description and suspect seeing those steps written out could help me!

– Benjamin Dickman
Dec 19 '18 at 16:47

Sorry, I was really unclear. I mentioned the easiness to write a programme . .. I wrote such a C++ (very simple) programme for $m=2$ and use it. I might post it. You need to represent what I present in a graph ...Easy to follow with such a graph in front of the eyes. ... Not so easy without

– Damien
Dec 19 '18 at 17:04

Including the program and a graph would be great

– Benjamin Dickman
Dec 19 '18 at 20:39

I have included the C++ programme. I will try now to draw a graph. Much easier by hand than creating a figure, without any drawing software at home ...

– Damien
Dec 19 '18 at 21:49

I tried to display a part of the graph. Hope it is clear enough

– Damien
Dec 20 '18 at 13:35

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Mathematics Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Ytukyg