Famous puzzle: Girl/Boy proportion problem (Sum of infinite series)
$begingroup$
Puzzle
In a country in which people only want boys, every family continues to have children until they have a boy. If they have a girl, they have another child. If they have a boy, they stop. What is the proportion of boys to girls in the country?
My solution (not finished)
If we assume that the probability of having a girl is 50%, the set of possible cases are:
Boy (50%)
Girl, Boy (25%)
Girl, Girl, Boy (12.5%)
...
So, if we call G the number of girls that a family had and B the number of boys that a family had, we have:
$B = 1$
$P(G = x) = (1/2)^{x+1}*x$
So
$G = Sigma (1/2)^{x+1}*x$
I feel like the sum of this infinite serie is 1 and that the proportion of girls/boys in this country will be 50%, but I don't know how to prove it!
Thanks!
puzzle sequences-and-series
$endgroup$
add a comment |
$begingroup$
Puzzle
In a country in which people only want boys, every family continues to have children until they have a boy. If they have a girl, they have another child. If they have a boy, they stop. What is the proportion of boys to girls in the country?
My solution (not finished)
If we assume that the probability of having a girl is 50%, the set of possible cases are:
Boy (50%)
Girl, Boy (25%)
Girl, Girl, Boy (12.5%)
...
So, if we call G the number of girls that a family had and B the number of boys that a family had, we have:
$B = 1$
$P(G = x) = (1/2)^{x+1}*x$
So
$G = Sigma (1/2)^{x+1}*x$
I feel like the sum of this infinite serie is 1 and that the proportion of girls/boys in this country will be 50%, but I don't know how to prove it!
Thanks!
puzzle sequences-and-series
$endgroup$
8
$begingroup$
In this example, you don't need to sum the infinite series. For each birth, the probability of a boy is 50% and the probability of a girl is 50% (for the purposes of this exercise; the real world is more complicated). Therefore, the ratio for all births must be 50:50. The number of previous births in each family, and the genders of those children, is irrelevant.
$endgroup$
– Mike Scott
Feb 4 '11 at 17:28
7
$begingroup$
There is a thorough discussion of this question at mathoverflow.net/questions/17960/… for anyone who's interested.
$endgroup$
– Qiaochu Yuan
Feb 4 '11 at 17:29
1
$begingroup$
As the MathOverflow discussion Qiaochu linked to points out, your approach will get the approximately right answer but for the wrong reasons. The expected ratio of boys to girls is not necessarily equal to ratio of the expected number of boys to the expected number of girls.
$endgroup$
– Rahul
Feb 4 '11 at 23:21
$begingroup$
thebigquestions.com/2010/12/22/a-big-answer-2 why is this not the answer?
$endgroup$
– KevinDTimm
Oct 7 '14 at 21:03
$begingroup$
@MikeScott Why does then ratio is not 1 when family is finite?
$endgroup$
– Suraj Jain
Jul 29 '18 at 5:45
add a comment |
$begingroup$
Puzzle
In a country in which people only want boys, every family continues to have children until they have a boy. If they have a girl, they have another child. If they have a boy, they stop. What is the proportion of boys to girls in the country?
My solution (not finished)
If we assume that the probability of having a girl is 50%, the set of possible cases are:
Boy (50%)
Girl, Boy (25%)
Girl, Girl, Boy (12.5%)
...
So, if we call G the number of girls that a family had and B the number of boys that a family had, we have:
$B = 1$
$P(G = x) = (1/2)^{x+1}*x$
So
$G = Sigma (1/2)^{x+1}*x$
I feel like the sum of this infinite serie is 1 and that the proportion of girls/boys in this country will be 50%, but I don't know how to prove it!
Thanks!
puzzle sequences-and-series
$endgroup$
Puzzle
In a country in which people only want boys, every family continues to have children until they have a boy. If they have a girl, they have another child. If they have a boy, they stop. What is the proportion of boys to girls in the country?
My solution (not finished)
If we assume that the probability of having a girl is 50%, the set of possible cases are:
Boy (50%)
Girl, Boy (25%)
Girl, Girl, Boy (12.5%)
...
So, if we call G the number of girls that a family had and B the number of boys that a family had, we have:
$B = 1$
$P(G = x) = (1/2)^{x+1}*x$
So
$G = Sigma (1/2)^{x+1}*x$
I feel like the sum of this infinite serie is 1 and that the proportion of girls/boys in this country will be 50%, but I don't know how to prove it!
Thanks!
puzzle sequences-and-series
puzzle sequences-and-series
edited Jun 21 '15 at 19:09
Peter Woolfitt
18.6k54480
18.6k54480
asked Feb 4 '11 at 17:24
nbarraillenbarraille
163116
163116
8
$begingroup$
In this example, you don't need to sum the infinite series. For each birth, the probability of a boy is 50% and the probability of a girl is 50% (for the purposes of this exercise; the real world is more complicated). Therefore, the ratio for all births must be 50:50. The number of previous births in each family, and the genders of those children, is irrelevant.
$endgroup$
– Mike Scott
Feb 4 '11 at 17:28
7
$begingroup$
There is a thorough discussion of this question at mathoverflow.net/questions/17960/… for anyone who's interested.
$endgroup$
– Qiaochu Yuan
Feb 4 '11 at 17:29
1
$begingroup$
As the MathOverflow discussion Qiaochu linked to points out, your approach will get the approximately right answer but for the wrong reasons. The expected ratio of boys to girls is not necessarily equal to ratio of the expected number of boys to the expected number of girls.
$endgroup$
– Rahul
Feb 4 '11 at 23:21
$begingroup$
thebigquestions.com/2010/12/22/a-big-answer-2 why is this not the answer?
$endgroup$
– KevinDTimm
Oct 7 '14 at 21:03
$begingroup$
@MikeScott Why does then ratio is not 1 when family is finite?
$endgroup$
– Suraj Jain
Jul 29 '18 at 5:45
add a comment |
8
$begingroup$
In this example, you don't need to sum the infinite series. For each birth, the probability of a boy is 50% and the probability of a girl is 50% (for the purposes of this exercise; the real world is more complicated). Therefore, the ratio for all births must be 50:50. The number of previous births in each family, and the genders of those children, is irrelevant.
$endgroup$
– Mike Scott
Feb 4 '11 at 17:28
7
$begingroup$
There is a thorough discussion of this question at mathoverflow.net/questions/17960/… for anyone who's interested.
$endgroup$
– Qiaochu Yuan
Feb 4 '11 at 17:29
1
$begingroup$
As the MathOverflow discussion Qiaochu linked to points out, your approach will get the approximately right answer but for the wrong reasons. The expected ratio of boys to girls is not necessarily equal to ratio of the expected number of boys to the expected number of girls.
$endgroup$
– Rahul
Feb 4 '11 at 23:21
$begingroup$
thebigquestions.com/2010/12/22/a-big-answer-2 why is this not the answer?
$endgroup$
– KevinDTimm
Oct 7 '14 at 21:03
$begingroup$
@MikeScott Why does then ratio is not 1 when family is finite?
$endgroup$
– Suraj Jain
Jul 29 '18 at 5:45
8
8
$begingroup$
In this example, you don't need to sum the infinite series. For each birth, the probability of a boy is 50% and the probability of a girl is 50% (for the purposes of this exercise; the real world is more complicated). Therefore, the ratio for all births must be 50:50. The number of previous births in each family, and the genders of those children, is irrelevant.
$endgroup$
– Mike Scott
Feb 4 '11 at 17:28
$begingroup$
In this example, you don't need to sum the infinite series. For each birth, the probability of a boy is 50% and the probability of a girl is 50% (for the purposes of this exercise; the real world is more complicated). Therefore, the ratio for all births must be 50:50. The number of previous births in each family, and the genders of those children, is irrelevant.
$endgroup$
– Mike Scott
Feb 4 '11 at 17:28
7
7
$begingroup$
There is a thorough discussion of this question at mathoverflow.net/questions/17960/… for anyone who's interested.
$endgroup$
– Qiaochu Yuan
Feb 4 '11 at 17:29
$begingroup$
There is a thorough discussion of this question at mathoverflow.net/questions/17960/… for anyone who's interested.
$endgroup$
– Qiaochu Yuan
Feb 4 '11 at 17:29
1
1
$begingroup$
As the MathOverflow discussion Qiaochu linked to points out, your approach will get the approximately right answer but for the wrong reasons. The expected ratio of boys to girls is not necessarily equal to ratio of the expected number of boys to the expected number of girls.
$endgroup$
– Rahul
Feb 4 '11 at 23:21
$begingroup$
As the MathOverflow discussion Qiaochu linked to points out, your approach will get the approximately right answer but for the wrong reasons. The expected ratio of boys to girls is not necessarily equal to ratio of the expected number of boys to the expected number of girls.
$endgroup$
– Rahul
Feb 4 '11 at 23:21
$begingroup$
thebigquestions.com/2010/12/22/a-big-answer-2 why is this not the answer?
$endgroup$
– KevinDTimm
Oct 7 '14 at 21:03
$begingroup$
thebigquestions.com/2010/12/22/a-big-answer-2 why is this not the answer?
$endgroup$
– KevinDTimm
Oct 7 '14 at 21:03
$begingroup$
@MikeScott Why does then ratio is not 1 when family is finite?
$endgroup$
– Suraj Jain
Jul 29 '18 at 5:45
$begingroup$
@MikeScott Why does then ratio is not 1 when family is finite?
$endgroup$
– Suraj Jain
Jul 29 '18 at 5:45
add a comment |
7 Answers
7
active
oldest
votes
$begingroup$
Mike Scott is correct that you don't need to sum the series, but suppose you want to. Each family has 1 boy-that is easy. Each family has 50% chance of no girls, 25% chance of 1, etc. So the average number of girls is $$sum_{i=0}^infty frac{i}{2^{i+1}}$$ The way to sum this is to remember that $$sum_{i=0}^infty a^{-i} = frac{1}{1-1/a}=frac a{a-1}$$ Now if you take the derivative with respect to a and evaluate it at a=2 So $$frac{d}{da}sum_{i=0}^infty a^{-i}
\=sum_{i=0}^infty{-i}a^{-(i+1)}
\=frac{d}{da} frac{1}{1-1/a}
\=frac d{da}frac a{a-1}
\=frac {a-1-a}{(a-1)^2}
\=frac{-1}{(a-1)^2}=-1$$ for $a=2$. So there is an average of one girl per family as well
$endgroup$
$begingroup$
You've got an even number of sign mistakes that cancel each other out. The derivative is going to have a negative sign for each term because the exponent is negative. Same for the derivative of the sum, it's clearly a descending function so the derivative should be negative. I think your answer would benefit from showing more steps of the calculation, in particular how the derivative relates to the first expression. That would also have made this mistake more obvious.
$endgroup$
– Dici
Nov 24 '18 at 19:39
add a comment |
$begingroup$
This is a trick question!
This question is very simple if you just learn to accept that most of the information given is completely irrelevant...
It doesnt matter how many families continue to have children and how many stop at 1 or 2...its no more relevant than what car they drive...
None of the information provided alters the statistical probability of a child born being male or female...its still 50%
$endgroup$
add a comment |
$begingroup$
Google and other interviewing companies made an important assumption — that there are infinitely many families. Under this assumption, we have their clever and simple answer: In expectation, there are as many boys as girls.
However, it turns out that the answer depends on the number of families there are in the country. If there is a finite number of families, then in expectation, there'll be more boys than girls.
Consider the extreme scenario where there's only one family, then 1/2 the time the fraction of girls is 0 (B in our only family), 1/4 the time it's 1/2 (GB), 1/8 the time it's 2/3 (GGB), 1/16 the time it's 3/4 (GGGB) etc. And so the expected fraction of girls is:
$$frac{1}{2}times 0+frac{1}{4}times frac{1}{2}+frac{1}{8}times frac{2}{3}+frac{1}{16}times frac{3}{4}+dots = 1 - ln 2 approx 0.307.$$
If there are 4 families, the expected fraction of girls in the country is about 0.439; and if there are 10, it's about 0.475.
See this Math.Overflow answer for more details.
(By the way, this puzzle also appeared in Thomas Schelling's Micromotives and Macrobehavior, 1978, p. 72.)
$endgroup$
add a comment |
$begingroup$
If the probability is 50% then we can expect from four families to typically have:
B B,
B G,
G B,
G G
However because families stop when they have a boy, we do not gain the boy from the first row or the girl from the second row:
B B,
B G,
Therefore we actually get three girls and three boys.
From 8 families we would expect:
B B B,
B B G,
B G B,
G B B,
B G G,
G B G,
G G B,
G G G
and we would actually keep:
B,
B,
B,
GB,
B,
GB,
GGB,
GGG,
for a total of 7 G and 7 B.
I expect proportion to remain 50% for each expected outcome from larger numbers of families but haven't proved anything as such. This is a lot simpler though and doesn't require summation notation or any advanced maths.
$endgroup$
add a comment |
$begingroup$
The ratio is 1:1. Your summing of an infinite series provides the result of the chance that there is a boy in the family (theoretically 1 because births are repeated infinitely until a boy is born); as you approach 1 in your answer to this, you approach zero in the chance that there are no girls in the family. This is all this sum actually tells us about the families.
$endgroup$
add a comment |
$begingroup$
When I first heard this puzzle (not here), I was convinced that you would end up with more girls. My intuition was of course wrong. I love the proof that this is not the case, but for those that enjoy a demonstration to enhance the intuition, this is some python code I wrote to convince myself that this is indeed the case.
import random
class family():
boys = 0
girls = 0
def haveKids(self):
while not self.boys:
if random.randint(0, 1):
self.boys += 1
else:
self.girls += 1
population = [family() for f in range(1000000)]
map(lambda f: f.haveKids(), population)
boys = sum(map(lambda f: f.boys, population))
girls = sum(map(lambda f: f.girls, population))
print "boys:", boys, "girls:", girls
print "difference:", abs(boys - girls)
print "percentage difference: %" + str(abs(float(boys - girls)/(boys + girls)))
This simulates a population of 1 million families. The difference between the number of girls and boys is indeed nominal. The percentage difference is typically around 0.001% for 1,000,000 families.
$endgroup$
add a comment |
$begingroup$
Let's model the births as "rounds" of childbirthing. All families have their first child on round 1, their second (if applicable) on round 2, etc.
With 10 families, the first children had may be:
$B, G, G, B, B, G, B, B, B, G$
And will be around 50% boys and 50% girls.
Now, let's look at the second round of births. Of course, only the families who did not have a boy will have a second child.
$
text{First round: }space space space space B, G, G, B, B, G, B, B, B, G
\ text{Second round: } space space space space space B, G,space space space space space space space space space space B, space space space space space space space space space space space space space space space G
$
(Someone who knows LaTeX please format the above in a non-horrendous way)
Note that in the second round, again, 50% of the births are girls and 50% are boys. Since this will continue for each round, 50% of births being boys and 50% being girls (on average), the grand total will also be 50-50.
$endgroup$
add a comment |
protected by Alex M. Dec 31 '16 at 16:22
Thank you for your interest in this question.
Because it has attracted low-quality or spam answers that had to be removed, posting an answer now requires 10 reputation on this site (the association bonus does not count).
Would you like to answer one of these unanswered questions instead?
7 Answers
7
active
oldest
votes
7 Answers
7
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
Mike Scott is correct that you don't need to sum the series, but suppose you want to. Each family has 1 boy-that is easy. Each family has 50% chance of no girls, 25% chance of 1, etc. So the average number of girls is $$sum_{i=0}^infty frac{i}{2^{i+1}}$$ The way to sum this is to remember that $$sum_{i=0}^infty a^{-i} = frac{1}{1-1/a}=frac a{a-1}$$ Now if you take the derivative with respect to a and evaluate it at a=2 So $$frac{d}{da}sum_{i=0}^infty a^{-i}
\=sum_{i=0}^infty{-i}a^{-(i+1)}
\=frac{d}{da} frac{1}{1-1/a}
\=frac d{da}frac a{a-1}
\=frac {a-1-a}{(a-1)^2}
\=frac{-1}{(a-1)^2}=-1$$ for $a=2$. So there is an average of one girl per family as well
$endgroup$
$begingroup$
You've got an even number of sign mistakes that cancel each other out. The derivative is going to have a negative sign for each term because the exponent is negative. Same for the derivative of the sum, it's clearly a descending function so the derivative should be negative. I think your answer would benefit from showing more steps of the calculation, in particular how the derivative relates to the first expression. That would also have made this mistake more obvious.
$endgroup$
– Dici
Nov 24 '18 at 19:39
add a comment |
$begingroup$
Mike Scott is correct that you don't need to sum the series, but suppose you want to. Each family has 1 boy-that is easy. Each family has 50% chance of no girls, 25% chance of 1, etc. So the average number of girls is $$sum_{i=0}^infty frac{i}{2^{i+1}}$$ The way to sum this is to remember that $$sum_{i=0}^infty a^{-i} = frac{1}{1-1/a}=frac a{a-1}$$ Now if you take the derivative with respect to a and evaluate it at a=2 So $$frac{d}{da}sum_{i=0}^infty a^{-i}
\=sum_{i=0}^infty{-i}a^{-(i+1)}
\=frac{d}{da} frac{1}{1-1/a}
\=frac d{da}frac a{a-1}
\=frac {a-1-a}{(a-1)^2}
\=frac{-1}{(a-1)^2}=-1$$ for $a=2$. So there is an average of one girl per family as well
$endgroup$
$begingroup$
You've got an even number of sign mistakes that cancel each other out. The derivative is going to have a negative sign for each term because the exponent is negative. Same for the derivative of the sum, it's clearly a descending function so the derivative should be negative. I think your answer would benefit from showing more steps of the calculation, in particular how the derivative relates to the first expression. That would also have made this mistake more obvious.
$endgroup$
– Dici
Nov 24 '18 at 19:39
add a comment |
$begingroup$
Mike Scott is correct that you don't need to sum the series, but suppose you want to. Each family has 1 boy-that is easy. Each family has 50% chance of no girls, 25% chance of 1, etc. So the average number of girls is $$sum_{i=0}^infty frac{i}{2^{i+1}}$$ The way to sum this is to remember that $$sum_{i=0}^infty a^{-i} = frac{1}{1-1/a}=frac a{a-1}$$ Now if you take the derivative with respect to a and evaluate it at a=2 So $$frac{d}{da}sum_{i=0}^infty a^{-i}
\=sum_{i=0}^infty{-i}a^{-(i+1)}
\=frac{d}{da} frac{1}{1-1/a}
\=frac d{da}frac a{a-1}
\=frac {a-1-a}{(a-1)^2}
\=frac{-1}{(a-1)^2}=-1$$ for $a=2$. So there is an average of one girl per family as well
$endgroup$
Mike Scott is correct that you don't need to sum the series, but suppose you want to. Each family has 1 boy-that is easy. Each family has 50% chance of no girls, 25% chance of 1, etc. So the average number of girls is $$sum_{i=0}^infty frac{i}{2^{i+1}}$$ The way to sum this is to remember that $$sum_{i=0}^infty a^{-i} = frac{1}{1-1/a}=frac a{a-1}$$ Now if you take the derivative with respect to a and evaluate it at a=2 So $$frac{d}{da}sum_{i=0}^infty a^{-i}
\=sum_{i=0}^infty{-i}a^{-(i+1)}
\=frac{d}{da} frac{1}{1-1/a}
\=frac d{da}frac a{a-1}
\=frac {a-1-a}{(a-1)^2}
\=frac{-1}{(a-1)^2}=-1$$ for $a=2$. So there is an average of one girl per family as well
edited Dec 27 '18 at 14:19
answered Feb 4 '11 at 17:39
Ross MillikanRoss Millikan
299k24200374
299k24200374
$begingroup$
You've got an even number of sign mistakes that cancel each other out. The derivative is going to have a negative sign for each term because the exponent is negative. Same for the derivative of the sum, it's clearly a descending function so the derivative should be negative. I think your answer would benefit from showing more steps of the calculation, in particular how the derivative relates to the first expression. That would also have made this mistake more obvious.
$endgroup$
– Dici
Nov 24 '18 at 19:39
add a comment |
$begingroup$
You've got an even number of sign mistakes that cancel each other out. The derivative is going to have a negative sign for each term because the exponent is negative. Same for the derivative of the sum, it's clearly a descending function so the derivative should be negative. I think your answer would benefit from showing more steps of the calculation, in particular how the derivative relates to the first expression. That would also have made this mistake more obvious.
$endgroup$
– Dici
Nov 24 '18 at 19:39
$begingroup$
You've got an even number of sign mistakes that cancel each other out. The derivative is going to have a negative sign for each term because the exponent is negative. Same for the derivative of the sum, it's clearly a descending function so the derivative should be negative. I think your answer would benefit from showing more steps of the calculation, in particular how the derivative relates to the first expression. That would also have made this mistake more obvious.
$endgroup$
– Dici
Nov 24 '18 at 19:39
$begingroup$
You've got an even number of sign mistakes that cancel each other out. The derivative is going to have a negative sign for each term because the exponent is negative. Same for the derivative of the sum, it's clearly a descending function so the derivative should be negative. I think your answer would benefit from showing more steps of the calculation, in particular how the derivative relates to the first expression. That would also have made this mistake more obvious.
$endgroup$
– Dici
Nov 24 '18 at 19:39
add a comment |
$begingroup$
This is a trick question!
This question is very simple if you just learn to accept that most of the information given is completely irrelevant...
It doesnt matter how many families continue to have children and how many stop at 1 or 2...its no more relevant than what car they drive...
None of the information provided alters the statistical probability of a child born being male or female...its still 50%
$endgroup$
add a comment |
$begingroup$
This is a trick question!
This question is very simple if you just learn to accept that most of the information given is completely irrelevant...
It doesnt matter how many families continue to have children and how many stop at 1 or 2...its no more relevant than what car they drive...
None of the information provided alters the statistical probability of a child born being male or female...its still 50%
$endgroup$
add a comment |
$begingroup$
This is a trick question!
This question is very simple if you just learn to accept that most of the information given is completely irrelevant...
It doesnt matter how many families continue to have children and how many stop at 1 or 2...its no more relevant than what car they drive...
None of the information provided alters the statistical probability of a child born being male or female...its still 50%
$endgroup$
This is a trick question!
This question is very simple if you just learn to accept that most of the information given is completely irrelevant...
It doesnt matter how many families continue to have children and how many stop at 1 or 2...its no more relevant than what car they drive...
None of the information provided alters the statistical probability of a child born being male or female...its still 50%
answered Jun 21 '13 at 5:01
tomtom
9111
9111
add a comment |
add a comment |
$begingroup$
Google and other interviewing companies made an important assumption — that there are infinitely many families. Under this assumption, we have their clever and simple answer: In expectation, there are as many boys as girls.
However, it turns out that the answer depends on the number of families there are in the country. If there is a finite number of families, then in expectation, there'll be more boys than girls.
Consider the extreme scenario where there's only one family, then 1/2 the time the fraction of girls is 0 (B in our only family), 1/4 the time it's 1/2 (GB), 1/8 the time it's 2/3 (GGB), 1/16 the time it's 3/4 (GGGB) etc. And so the expected fraction of girls is:
$$frac{1}{2}times 0+frac{1}{4}times frac{1}{2}+frac{1}{8}times frac{2}{3}+frac{1}{16}times frac{3}{4}+dots = 1 - ln 2 approx 0.307.$$
If there are 4 families, the expected fraction of girls in the country is about 0.439; and if there are 10, it's about 0.475.
See this Math.Overflow answer for more details.
(By the way, this puzzle also appeared in Thomas Schelling's Micromotives and Macrobehavior, 1978, p. 72.)
$endgroup$
add a comment |
$begingroup$
Google and other interviewing companies made an important assumption — that there are infinitely many families. Under this assumption, we have their clever and simple answer: In expectation, there are as many boys as girls.
However, it turns out that the answer depends on the number of families there are in the country. If there is a finite number of families, then in expectation, there'll be more boys than girls.
Consider the extreme scenario where there's only one family, then 1/2 the time the fraction of girls is 0 (B in our only family), 1/4 the time it's 1/2 (GB), 1/8 the time it's 2/3 (GGB), 1/16 the time it's 3/4 (GGGB) etc. And so the expected fraction of girls is:
$$frac{1}{2}times 0+frac{1}{4}times frac{1}{2}+frac{1}{8}times frac{2}{3}+frac{1}{16}times frac{3}{4}+dots = 1 - ln 2 approx 0.307.$$
If there are 4 families, the expected fraction of girls in the country is about 0.439; and if there are 10, it's about 0.475.
See this Math.Overflow answer for more details.
(By the way, this puzzle also appeared in Thomas Schelling's Micromotives and Macrobehavior, 1978, p. 72.)
$endgroup$
add a comment |
$begingroup$
Google and other interviewing companies made an important assumption — that there are infinitely many families. Under this assumption, we have their clever and simple answer: In expectation, there are as many boys as girls.
However, it turns out that the answer depends on the number of families there are in the country. If there is a finite number of families, then in expectation, there'll be more boys than girls.
Consider the extreme scenario where there's only one family, then 1/2 the time the fraction of girls is 0 (B in our only family), 1/4 the time it's 1/2 (GB), 1/8 the time it's 2/3 (GGB), 1/16 the time it's 3/4 (GGGB) etc. And so the expected fraction of girls is:
$$frac{1}{2}times 0+frac{1}{4}times frac{1}{2}+frac{1}{8}times frac{2}{3}+frac{1}{16}times frac{3}{4}+dots = 1 - ln 2 approx 0.307.$$
If there are 4 families, the expected fraction of girls in the country is about 0.439; and if there are 10, it's about 0.475.
See this Math.Overflow answer for more details.
(By the way, this puzzle also appeared in Thomas Schelling's Micromotives and Macrobehavior, 1978, p. 72.)
$endgroup$
Google and other interviewing companies made an important assumption — that there are infinitely many families. Under this assumption, we have their clever and simple answer: In expectation, there are as many boys as girls.
However, it turns out that the answer depends on the number of families there are in the country. If there is a finite number of families, then in expectation, there'll be more boys than girls.
Consider the extreme scenario where there's only one family, then 1/2 the time the fraction of girls is 0 (B in our only family), 1/4 the time it's 1/2 (GB), 1/8 the time it's 2/3 (GGB), 1/16 the time it's 3/4 (GGGB) etc. And so the expected fraction of girls is:
$$frac{1}{2}times 0+frac{1}{4}times frac{1}{2}+frac{1}{8}times frac{2}{3}+frac{1}{16}times frac{3}{4}+dots = 1 - ln 2 approx 0.307.$$
If there are 4 families, the expected fraction of girls in the country is about 0.439; and if there are 10, it's about 0.475.
See this Math.Overflow answer for more details.
(By the way, this puzzle also appeared in Thomas Schelling's Micromotives and Macrobehavior, 1978, p. 72.)
edited Feb 5 at 5:55
answered Aug 4 '17 at 3:20
user20311user20311
1805
1805
add a comment |
add a comment |
$begingroup$
If the probability is 50% then we can expect from four families to typically have:
B B,
B G,
G B,
G G
However because families stop when they have a boy, we do not gain the boy from the first row or the girl from the second row:
B B,
B G,
Therefore we actually get three girls and three boys.
From 8 families we would expect:
B B B,
B B G,
B G B,
G B B,
B G G,
G B G,
G G B,
G G G
and we would actually keep:
B,
B,
B,
GB,
B,
GB,
GGB,
GGG,
for a total of 7 G and 7 B.
I expect proportion to remain 50% for each expected outcome from larger numbers of families but haven't proved anything as such. This is a lot simpler though and doesn't require summation notation or any advanced maths.
$endgroup$
add a comment |
$begingroup$
If the probability is 50% then we can expect from four families to typically have:
B B,
B G,
G B,
G G
However because families stop when they have a boy, we do not gain the boy from the first row or the girl from the second row:
B B,
B G,
Therefore we actually get three girls and three boys.
From 8 families we would expect:
B B B,
B B G,
B G B,
G B B,
B G G,
G B G,
G G B,
G G G
and we would actually keep:
B,
B,
B,
GB,
B,
GB,
GGB,
GGG,
for a total of 7 G and 7 B.
I expect proportion to remain 50% for each expected outcome from larger numbers of families but haven't proved anything as such. This is a lot simpler though and doesn't require summation notation or any advanced maths.
$endgroup$
add a comment |
$begingroup$
If the probability is 50% then we can expect from four families to typically have:
B B,
B G,
G B,
G G
However because families stop when they have a boy, we do not gain the boy from the first row or the girl from the second row:
B B,
B G,
Therefore we actually get three girls and three boys.
From 8 families we would expect:
B B B,
B B G,
B G B,
G B B,
B G G,
G B G,
G G B,
G G G
and we would actually keep:
B,
B,
B,
GB,
B,
GB,
GGB,
GGG,
for a total of 7 G and 7 B.
I expect proportion to remain 50% for each expected outcome from larger numbers of families but haven't proved anything as such. This is a lot simpler though and doesn't require summation notation or any advanced maths.
$endgroup$
If the probability is 50% then we can expect from four families to typically have:
B B,
B G,
G B,
G G
However because families stop when they have a boy, we do not gain the boy from the first row or the girl from the second row:
B B,
B G,
Therefore we actually get three girls and three boys.
From 8 families we would expect:
B B B,
B B G,
B G B,
G B B,
B G G,
G B G,
G G B,
G G G
and we would actually keep:
B,
B,
B,
GB,
B,
GB,
GGB,
GGG,
for a total of 7 G and 7 B.
I expect proportion to remain 50% for each expected outcome from larger numbers of families but haven't proved anything as such. This is a lot simpler though and doesn't require summation notation or any advanced maths.
answered Feb 12 '15 at 23:34
Raynard DickersonRaynard Dickerson
211
211
add a comment |
add a comment |
$begingroup$
The ratio is 1:1. Your summing of an infinite series provides the result of the chance that there is a boy in the family (theoretically 1 because births are repeated infinitely until a boy is born); as you approach 1 in your answer to this, you approach zero in the chance that there are no girls in the family. This is all this sum actually tells us about the families.
$endgroup$
add a comment |
$begingroup$
The ratio is 1:1. Your summing of an infinite series provides the result of the chance that there is a boy in the family (theoretically 1 because births are repeated infinitely until a boy is born); as you approach 1 in your answer to this, you approach zero in the chance that there are no girls in the family. This is all this sum actually tells us about the families.
$endgroup$
add a comment |
$begingroup$
The ratio is 1:1. Your summing of an infinite series provides the result of the chance that there is a boy in the family (theoretically 1 because births are repeated infinitely until a boy is born); as you approach 1 in your answer to this, you approach zero in the chance that there are no girls in the family. This is all this sum actually tells us about the families.
$endgroup$
The ratio is 1:1. Your summing of an infinite series provides the result of the chance that there is a boy in the family (theoretically 1 because births are repeated infinitely until a boy is born); as you approach 1 in your answer to this, you approach zero in the chance that there are no girls in the family. This is all this sum actually tells us about the families.
answered Aug 22 '15 at 1:20
DodoDodo
1507
1507
add a comment |
add a comment |
$begingroup$
When I first heard this puzzle (not here), I was convinced that you would end up with more girls. My intuition was of course wrong. I love the proof that this is not the case, but for those that enjoy a demonstration to enhance the intuition, this is some python code I wrote to convince myself that this is indeed the case.
import random
class family():
boys = 0
girls = 0
def haveKids(self):
while not self.boys:
if random.randint(0, 1):
self.boys += 1
else:
self.girls += 1
population = [family() for f in range(1000000)]
map(lambda f: f.haveKids(), population)
boys = sum(map(lambda f: f.boys, population))
girls = sum(map(lambda f: f.girls, population))
print "boys:", boys, "girls:", girls
print "difference:", abs(boys - girls)
print "percentage difference: %" + str(abs(float(boys - girls)/(boys + girls)))
This simulates a population of 1 million families. The difference between the number of girls and boys is indeed nominal. The percentage difference is typically around 0.001% for 1,000,000 families.
$endgroup$
add a comment |
$begingroup$
When I first heard this puzzle (not here), I was convinced that you would end up with more girls. My intuition was of course wrong. I love the proof that this is not the case, but for those that enjoy a demonstration to enhance the intuition, this is some python code I wrote to convince myself that this is indeed the case.
import random
class family():
boys = 0
girls = 0
def haveKids(self):
while not self.boys:
if random.randint(0, 1):
self.boys += 1
else:
self.girls += 1
population = [family() for f in range(1000000)]
map(lambda f: f.haveKids(), population)
boys = sum(map(lambda f: f.boys, population))
girls = sum(map(lambda f: f.girls, population))
print "boys:", boys, "girls:", girls
print "difference:", abs(boys - girls)
print "percentage difference: %" + str(abs(float(boys - girls)/(boys + girls)))
This simulates a population of 1 million families. The difference between the number of girls and boys is indeed nominal. The percentage difference is typically around 0.001% for 1,000,000 families.
$endgroup$
add a comment |
$begingroup$
When I first heard this puzzle (not here), I was convinced that you would end up with more girls. My intuition was of course wrong. I love the proof that this is not the case, but for those that enjoy a demonstration to enhance the intuition, this is some python code I wrote to convince myself that this is indeed the case.
import random
class family():
boys = 0
girls = 0
def haveKids(self):
while not self.boys:
if random.randint(0, 1):
self.boys += 1
else:
self.girls += 1
population = [family() for f in range(1000000)]
map(lambda f: f.haveKids(), population)
boys = sum(map(lambda f: f.boys, population))
girls = sum(map(lambda f: f.girls, population))
print "boys:", boys, "girls:", girls
print "difference:", abs(boys - girls)
print "percentage difference: %" + str(abs(float(boys - girls)/(boys + girls)))
This simulates a population of 1 million families. The difference between the number of girls and boys is indeed nominal. The percentage difference is typically around 0.001% for 1,000,000 families.
$endgroup$
When I first heard this puzzle (not here), I was convinced that you would end up with more girls. My intuition was of course wrong. I love the proof that this is not the case, but for those that enjoy a demonstration to enhance the intuition, this is some python code I wrote to convince myself that this is indeed the case.
import random
class family():
boys = 0
girls = 0
def haveKids(self):
while not self.boys:
if random.randint(0, 1):
self.boys += 1
else:
self.girls += 1
population = [family() for f in range(1000000)]
map(lambda f: f.haveKids(), population)
boys = sum(map(lambda f: f.boys, population))
girls = sum(map(lambda f: f.girls, population))
print "boys:", boys, "girls:", girls
print "difference:", abs(boys - girls)
print "percentage difference: %" + str(abs(float(boys - girls)/(boys + girls)))
This simulates a population of 1 million families. The difference between the number of girls and boys is indeed nominal. The percentage difference is typically around 0.001% for 1,000,000 families.
answered Aug 21 '15 at 22:06
StephenStephen
18937
18937
add a comment |
add a comment |
$begingroup$
Let's model the births as "rounds" of childbirthing. All families have their first child on round 1, their second (if applicable) on round 2, etc.
With 10 families, the first children had may be:
$B, G, G, B, B, G, B, B, B, G$
And will be around 50% boys and 50% girls.
Now, let's look at the second round of births. Of course, only the families who did not have a boy will have a second child.
$
text{First round: }space space space space B, G, G, B, B, G, B, B, B, G
\ text{Second round: } space space space space space B, G,space space space space space space space space space space B, space space space space space space space space space space space space space space space G
$
(Someone who knows LaTeX please format the above in a non-horrendous way)
Note that in the second round, again, 50% of the births are girls and 50% are boys. Since this will continue for each round, 50% of births being boys and 50% being girls (on average), the grand total will also be 50-50.
$endgroup$
add a comment |
$begingroup$
Let's model the births as "rounds" of childbirthing. All families have their first child on round 1, their second (if applicable) on round 2, etc.
With 10 families, the first children had may be:
$B, G, G, B, B, G, B, B, B, G$
And will be around 50% boys and 50% girls.
Now, let's look at the second round of births. Of course, only the families who did not have a boy will have a second child.
$
text{First round: }space space space space B, G, G, B, B, G, B, B, B, G
\ text{Second round: } space space space space space B, G,space space space space space space space space space space B, space space space space space space space space space space space space space space space G
$
(Someone who knows LaTeX please format the above in a non-horrendous way)
Note that in the second round, again, 50% of the births are girls and 50% are boys. Since this will continue for each round, 50% of births being boys and 50% being girls (on average), the grand total will also be 50-50.
$endgroup$
add a comment |
$begingroup$
Let's model the births as "rounds" of childbirthing. All families have their first child on round 1, their second (if applicable) on round 2, etc.
With 10 families, the first children had may be:
$B, G, G, B, B, G, B, B, B, G$
And will be around 50% boys and 50% girls.
Now, let's look at the second round of births. Of course, only the families who did not have a boy will have a second child.
$
text{First round: }space space space space B, G, G, B, B, G, B, B, B, G
\ text{Second round: } space space space space space B, G,space space space space space space space space space space B, space space space space space space space space space space space space space space space G
$
(Someone who knows LaTeX please format the above in a non-horrendous way)
Note that in the second round, again, 50% of the births are girls and 50% are boys. Since this will continue for each round, 50% of births being boys and 50% being girls (on average), the grand total will also be 50-50.
$endgroup$
Let's model the births as "rounds" of childbirthing. All families have their first child on round 1, their second (if applicable) on round 2, etc.
With 10 families, the first children had may be:
$B, G, G, B, B, G, B, B, B, G$
And will be around 50% boys and 50% girls.
Now, let's look at the second round of births. Of course, only the families who did not have a boy will have a second child.
$
text{First round: }space space space space B, G, G, B, B, G, B, B, B, G
\ text{Second round: } space space space space space B, G,space space space space space space space space space space B, space space space space space space space space space space space space space space space G
$
(Someone who knows LaTeX please format the above in a non-horrendous way)
Note that in the second round, again, 50% of the births are girls and 50% are boys. Since this will continue for each round, 50% of births being boys and 50% being girls (on average), the grand total will also be 50-50.
answered May 29 '17 at 23:13
QuelklefQuelklef
435311
435311
add a comment |
add a comment |
protected by Alex M. Dec 31 '16 at 16:22
Thank you for your interest in this question.
Because it has attracted low-quality or spam answers that had to be removed, posting an answer now requires 10 reputation on this site (the association bonus does not count).
Would you like to answer one of these unanswered questions instead?
8
$begingroup$
In this example, you don't need to sum the infinite series. For each birth, the probability of a boy is 50% and the probability of a girl is 50% (for the purposes of this exercise; the real world is more complicated). Therefore, the ratio for all births must be 50:50. The number of previous births in each family, and the genders of those children, is irrelevant.
$endgroup$
– Mike Scott
Feb 4 '11 at 17:28
7
$begingroup$
There is a thorough discussion of this question at mathoverflow.net/questions/17960/… for anyone who's interested.
$endgroup$
– Qiaochu Yuan
Feb 4 '11 at 17:29
1
$begingroup$
As the MathOverflow discussion Qiaochu linked to points out, your approach will get the approximately right answer but for the wrong reasons. The expected ratio of boys to girls is not necessarily equal to ratio of the expected number of boys to the expected number of girls.
$endgroup$
– Rahul
Feb 4 '11 at 23:21
$begingroup$
thebigquestions.com/2010/12/22/a-big-answer-2 why is this not the answer?
$endgroup$
– KevinDTimm
Oct 7 '14 at 21:03
$begingroup$
@MikeScott Why does then ratio is not 1 when family is finite?
$endgroup$
– Suraj Jain
Jul 29 '18 at 5:45