If $P(A) neq 0$ and $P(B) neq 0$, then $P(B|A) geq P(B)$ is equivalent to $P(A|B) geq P(A)$












9












$begingroup$


I am puzzled by the intuition behind the following fact:



If $P(A) neq 0$ and $P(B) neq 0$, then $P(B|A) geq P(B)$ is equivalent to $P(A|B) geq P(A)$.



This is easy enough to show by definition of conditional probability, but I would like to have some sort of geometric intuition behind this.



I can create positive example pictures, but not one that shows me why this statement must hold.



I would greatly appreciate some help. Thanks










share|cite|improve this question











$endgroup$












  • $begingroup$
    @AritroPathak It is equivalent. Multiply $P(B|A) geq P(B)$ with $P(A)/P(B)$ and you get $P(A|B) geq P(A)$.
    $endgroup$
    – Andreas
    Oct 19 '17 at 20:05






  • 3




    $begingroup$
    The fact you quote says that, if knowing $A$ makes $B$ more likely then knowing $B$ makes $A$ more likely (and conversely). Some people might consider that a reasonable intuition: Both inequalities say that $A$ and $B$ are positively correlated (in an intuitive sense). But I wouldn't claim that this intuition fully explains the fact. (That's why this is a comment and not an answer.)
    $endgroup$
    – Andreas Blass
    Oct 19 '17 at 20:06












  • $begingroup$
    Originally the question said something else, and then it was edited to say the correct thing.
    $endgroup$
    – Aritro Pathak
    Oct 19 '17 at 20:06










  • $begingroup$
    @Andreas My bad, I misread your comment and thought you were referring to the statement of the question before the edit.
    $endgroup$
    – jadhachem
    Oct 19 '17 at 20:15
















9












$begingroup$


I am puzzled by the intuition behind the following fact:



If $P(A) neq 0$ and $P(B) neq 0$, then $P(B|A) geq P(B)$ is equivalent to $P(A|B) geq P(A)$.



This is easy enough to show by definition of conditional probability, but I would like to have some sort of geometric intuition behind this.



I can create positive example pictures, but not one that shows me why this statement must hold.



I would greatly appreciate some help. Thanks










share|cite|improve this question











$endgroup$












  • $begingroup$
    @AritroPathak It is equivalent. Multiply $P(B|A) geq P(B)$ with $P(A)/P(B)$ and you get $P(A|B) geq P(A)$.
    $endgroup$
    – Andreas
    Oct 19 '17 at 20:05






  • 3




    $begingroup$
    The fact you quote says that, if knowing $A$ makes $B$ more likely then knowing $B$ makes $A$ more likely (and conversely). Some people might consider that a reasonable intuition: Both inequalities say that $A$ and $B$ are positively correlated (in an intuitive sense). But I wouldn't claim that this intuition fully explains the fact. (That's why this is a comment and not an answer.)
    $endgroup$
    – Andreas Blass
    Oct 19 '17 at 20:06












  • $begingroup$
    Originally the question said something else, and then it was edited to say the correct thing.
    $endgroup$
    – Aritro Pathak
    Oct 19 '17 at 20:06










  • $begingroup$
    @Andreas My bad, I misread your comment and thought you were referring to the statement of the question before the edit.
    $endgroup$
    – jadhachem
    Oct 19 '17 at 20:15














9












9








9


2



$begingroup$


I am puzzled by the intuition behind the following fact:



If $P(A) neq 0$ and $P(B) neq 0$, then $P(B|A) geq P(B)$ is equivalent to $P(A|B) geq P(A)$.



This is easy enough to show by definition of conditional probability, but I would like to have some sort of geometric intuition behind this.



I can create positive example pictures, but not one that shows me why this statement must hold.



I would greatly appreciate some help. Thanks










share|cite|improve this question











$endgroup$




I am puzzled by the intuition behind the following fact:



If $P(A) neq 0$ and $P(B) neq 0$, then $P(B|A) geq P(B)$ is equivalent to $P(A|B) geq P(A)$.



This is easy enough to show by definition of conditional probability, but I would like to have some sort of geometric intuition behind this.



I can create positive example pictures, but not one that shows me why this statement must hold.



I would greatly appreciate some help. Thanks







probability probability-theory






share|cite|improve this question















share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited Jan 2 at 13:18









nbro

2,46163474




2,46163474










asked Oct 19 '17 at 19:53









AffaengAffaeng

759




759












  • $begingroup$
    @AritroPathak It is equivalent. Multiply $P(B|A) geq P(B)$ with $P(A)/P(B)$ and you get $P(A|B) geq P(A)$.
    $endgroup$
    – Andreas
    Oct 19 '17 at 20:05






  • 3




    $begingroup$
    The fact you quote says that, if knowing $A$ makes $B$ more likely then knowing $B$ makes $A$ more likely (and conversely). Some people might consider that a reasonable intuition: Both inequalities say that $A$ and $B$ are positively correlated (in an intuitive sense). But I wouldn't claim that this intuition fully explains the fact. (That's why this is a comment and not an answer.)
    $endgroup$
    – Andreas Blass
    Oct 19 '17 at 20:06












  • $begingroup$
    Originally the question said something else, and then it was edited to say the correct thing.
    $endgroup$
    – Aritro Pathak
    Oct 19 '17 at 20:06










  • $begingroup$
    @Andreas My bad, I misread your comment and thought you were referring to the statement of the question before the edit.
    $endgroup$
    – jadhachem
    Oct 19 '17 at 20:15


















  • $begingroup$
    @AritroPathak It is equivalent. Multiply $P(B|A) geq P(B)$ with $P(A)/P(B)$ and you get $P(A|B) geq P(A)$.
    $endgroup$
    – Andreas
    Oct 19 '17 at 20:05






  • 3




    $begingroup$
    The fact you quote says that, if knowing $A$ makes $B$ more likely then knowing $B$ makes $A$ more likely (and conversely). Some people might consider that a reasonable intuition: Both inequalities say that $A$ and $B$ are positively correlated (in an intuitive sense). But I wouldn't claim that this intuition fully explains the fact. (That's why this is a comment and not an answer.)
    $endgroup$
    – Andreas Blass
    Oct 19 '17 at 20:06












  • $begingroup$
    Originally the question said something else, and then it was edited to say the correct thing.
    $endgroup$
    – Aritro Pathak
    Oct 19 '17 at 20:06










  • $begingroup$
    @Andreas My bad, I misread your comment and thought you were referring to the statement of the question before the edit.
    $endgroup$
    – jadhachem
    Oct 19 '17 at 20:15
















$begingroup$
@AritroPathak It is equivalent. Multiply $P(B|A) geq P(B)$ with $P(A)/P(B)$ and you get $P(A|B) geq P(A)$.
$endgroup$
– Andreas
Oct 19 '17 at 20:05




$begingroup$
@AritroPathak It is equivalent. Multiply $P(B|A) geq P(B)$ with $P(A)/P(B)$ and you get $P(A|B) geq P(A)$.
$endgroup$
– Andreas
Oct 19 '17 at 20:05




3




3




$begingroup$
The fact you quote says that, if knowing $A$ makes $B$ more likely then knowing $B$ makes $A$ more likely (and conversely). Some people might consider that a reasonable intuition: Both inequalities say that $A$ and $B$ are positively correlated (in an intuitive sense). But I wouldn't claim that this intuition fully explains the fact. (That's why this is a comment and not an answer.)
$endgroup$
– Andreas Blass
Oct 19 '17 at 20:06






$begingroup$
The fact you quote says that, if knowing $A$ makes $B$ more likely then knowing $B$ makes $A$ more likely (and conversely). Some people might consider that a reasonable intuition: Both inequalities say that $A$ and $B$ are positively correlated (in an intuitive sense). But I wouldn't claim that this intuition fully explains the fact. (That's why this is a comment and not an answer.)
$endgroup$
– Andreas Blass
Oct 19 '17 at 20:06














$begingroup$
Originally the question said something else, and then it was edited to say the correct thing.
$endgroup$
– Aritro Pathak
Oct 19 '17 at 20:06




$begingroup$
Originally the question said something else, and then it was edited to say the correct thing.
$endgroup$
– Aritro Pathak
Oct 19 '17 at 20:06












$begingroup$
@Andreas My bad, I misread your comment and thought you were referring to the statement of the question before the edit.
$endgroup$
– jadhachem
Oct 19 '17 at 20:15




$begingroup$
@Andreas My bad, I misread your comment and thought you were referring to the statement of the question before the edit.
$endgroup$
– jadhachem
Oct 19 '17 at 20:15










4 Answers
4






active

oldest

votes


















16












$begingroup$

Both statements are saying that $P(A cap B) ge P(A)cdot P(B)$. Note that $P(A)cdot P(B)$ corresponds to $P(A cap B)$ if $A$ and $B$ were independent. Thus $P(A cap B) ge P(A)cdot P(B)$ means that there is some positive correlation (in a figurative sense) between these two events.



On the other hand, the phrase that always pops in my mind when I am talking about conditional probability is you are changing your universe. $P(X|Y)$ means that you are looking at $P(X)$ in a different universe, i.e. $Y$. Since there is a positive correlation between these two, it makes sense that $P(A|B) ge P(A)$. This is because we are changing our universe to $B$, and since we know that $B$ has a positive correlation with $A$, it means that in this new universe, we are more likely to see $A$. This is symmetric in $A$ and $B$ of course.



I don't know if this is intuition enough.






share|cite|improve this answer









$endgroup$









  • 4




    $begingroup$
    Good point. Your "universe point of view" is going from prior to posteriors. Here is one example. Men are generally taller than women. Let "A" be sex and "B" be body height. Ask for sex: Picking a person among the taller ones gives a higher probability of finding a men, then just picking any person. Ask for body height: picking a person among all men gives a higher probability of finding a taller person, then just picking any person. So it works from both points of view, from the view of the "tallers" and the view of the "men".
    $endgroup$
    – Andreas
    Oct 19 '17 at 20:22





















9












$begingroup$

General intution answer: Having $P(A | B) > P(A)$ is an indication that $A$ and $B$ are positively correlated -- that is, they occur more frequently together. From this notion, it should be natural that if $A$ has happened, $B$ is more likely.



Geometric intuition answer: If we talk about this in a Venn Diagram context, then the statement $P(B | A) > P(B)$ means that events $A$ and $B$ have heavy overlap. Here are some poorly-drawn MSPaint pictures to explain what I mean:
enter image description here
Here, I'm presuming that (1) Venn Diagrams always have areas correspond to probabilities (which is of course not true), and (2) that my drawings actually are accurate, which is probably false. But the spirit of the drawings is: the top set have independent events, for which the intersection area is "proportional" to the event probabilities. Specifically, the proportion is
$$frac{|A cap B|}{|A|} = frac{|B|}{|Omega| = 1}$$
which is obviously just a thinly-veiled version of the original probabilistic statement. In the top case, the pictures (allegedly) have "proportional intersections" in this way; in the bottom pictures, the overlap is larger than that prescribed proportion. I think this "intersection proportionality" might be the kind of geometric crux of the question that you're asking about.






share|cite|improve this answer









$endgroup$





















    5












    $begingroup$

    First, note that this theorem is equivalent to




    If $P(A) neq 0$ and $P(B) neq 0$ and
    $$P(B|A) geq P(B), tag{1}$$ then $$P(A|B) geq P(A).tag{2}$$




    because you can just swap $A$ and $B$ in the theorem to get the direction (2) $Rightarrow$ (1).





    Real life intuition (aka Bayesian inference):



    Consider a thing $B$ that's more likely when you know $A$ happens than it is when you don't know whether $A$ happens. Mathematically, this is condtion (1). Now if you observe $B$, it is actually evidence for $A$. Which mathematically means (2).



    Example:



    You know that rigged dice tend to give $6$s more often than fair dice. You pick up a die you find somewhere. As you really haven't encountered that many significantly rigged dice in your life, but have seen quite many almost fair dice, you initially think that your die is likely close to fair. You throw the die 20 times and observe 20 sixes. What do you deduce? You start thinking that the die is very probably rigged.



    Mathematically: Let $A$ be the event that the die is significantly rigged, and $B$ be the event that you get 20 sixes in a row. The fact that rigged dice give more sixes than fair dice means $P(B|A)>P(B)$. Before observing the throws, you think the probability that the die is rigged is $P(A)$, which is very small. But once you see the throws, you think that the probability is $P(A|B)$. The new probability is a lot larger, which means that $P(A|B)>P(A)$. The fact that the new probability is a lot larger was based on your knowledge that $P(B|A)>P(B)$.






    share|cite|improve this answer









    $endgroup$





















      2












      $begingroup$

      I'll try to give a "quasi" geometric interpretation - a "set" interpretation. You are meant to extract a card from a 3-cards deck. The cards are signed with $A$, $B$ and $C$. Then I ask you: which is the probability to choose $A$ from the deck?



      The pool of your possibilities is:



      $${A, B, C }.$$



      Then, of course $P(A) = 1/3$. But if I say you that I already selected a card from the deck, and this card is $B$, then the pool of possibilities for you is now:



      $${A, C},$$



      right? In this case, the probability that you get $A$ is:



      $$P(A | B) = 1/2.$$



      As Abhiram Natarajan (+1) said, when you want to calculate a conditional probability, then you are changing - better, you are reducing - the universe of your possibility, and hence the probabilities must be greater (or equal) to the simple one. The universe changed because something already happened before you take a card from the deck.






      share|cite|improve this answer









      $endgroup$














        Your Answer





        StackExchange.ifUsing("editor", function () {
        return StackExchange.using("mathjaxEditing", function () {
        StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
        StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
        });
        });
        }, "mathjax-editing");

        StackExchange.ready(function() {
        var channelOptions = {
        tags: "".split(" "),
        id: "69"
        };
        initTagRenderer("".split(" "), "".split(" "), channelOptions);

        StackExchange.using("externalEditor", function() {
        // Have to fire editor after snippets, if snippets enabled
        if (StackExchange.settings.snippets.snippetsEnabled) {
        StackExchange.using("snippets", function() {
        createEditor();
        });
        }
        else {
        createEditor();
        }
        });

        function createEditor() {
        StackExchange.prepareEditor({
        heartbeatType: 'answer',
        autoActivateHeartbeat: false,
        convertImagesToLinks: true,
        noModals: true,
        showLowRepImageUploadWarning: true,
        reputationToPostImages: 10,
        bindNavPrevention: true,
        postfix: "",
        imageUploader: {
        brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
        contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
        allowUrls: true
        },
        noCode: true, onDemand: true,
        discardSelector: ".discard-answer"
        ,immediatelyShowMarkdownHelp:true
        });


        }
        });














        draft saved

        draft discarded


















        StackExchange.ready(
        function () {
        StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2480490%2fif-pa-neq-0-and-pb-neq-0-then-pba-geq-pb-is-equivalent-to-p%23new-answer', 'question_page');
        }
        );

        Post as a guest















        Required, but never shown

























        4 Answers
        4






        active

        oldest

        votes








        4 Answers
        4






        active

        oldest

        votes









        active

        oldest

        votes






        active

        oldest

        votes









        16












        $begingroup$

        Both statements are saying that $P(A cap B) ge P(A)cdot P(B)$. Note that $P(A)cdot P(B)$ corresponds to $P(A cap B)$ if $A$ and $B$ were independent. Thus $P(A cap B) ge P(A)cdot P(B)$ means that there is some positive correlation (in a figurative sense) between these two events.



        On the other hand, the phrase that always pops in my mind when I am talking about conditional probability is you are changing your universe. $P(X|Y)$ means that you are looking at $P(X)$ in a different universe, i.e. $Y$. Since there is a positive correlation between these two, it makes sense that $P(A|B) ge P(A)$. This is because we are changing our universe to $B$, and since we know that $B$ has a positive correlation with $A$, it means that in this new universe, we are more likely to see $A$. This is symmetric in $A$ and $B$ of course.



        I don't know if this is intuition enough.






        share|cite|improve this answer









        $endgroup$









        • 4




          $begingroup$
          Good point. Your "universe point of view" is going from prior to posteriors. Here is one example. Men are generally taller than women. Let "A" be sex and "B" be body height. Ask for sex: Picking a person among the taller ones gives a higher probability of finding a men, then just picking any person. Ask for body height: picking a person among all men gives a higher probability of finding a taller person, then just picking any person. So it works from both points of view, from the view of the "tallers" and the view of the "men".
          $endgroup$
          – Andreas
          Oct 19 '17 at 20:22


















        16












        $begingroup$

        Both statements are saying that $P(A cap B) ge P(A)cdot P(B)$. Note that $P(A)cdot P(B)$ corresponds to $P(A cap B)$ if $A$ and $B$ were independent. Thus $P(A cap B) ge P(A)cdot P(B)$ means that there is some positive correlation (in a figurative sense) between these two events.



        On the other hand, the phrase that always pops in my mind when I am talking about conditional probability is you are changing your universe. $P(X|Y)$ means that you are looking at $P(X)$ in a different universe, i.e. $Y$. Since there is a positive correlation between these two, it makes sense that $P(A|B) ge P(A)$. This is because we are changing our universe to $B$, and since we know that $B$ has a positive correlation with $A$, it means that in this new universe, we are more likely to see $A$. This is symmetric in $A$ and $B$ of course.



        I don't know if this is intuition enough.






        share|cite|improve this answer









        $endgroup$









        • 4




          $begingroup$
          Good point. Your "universe point of view" is going from prior to posteriors. Here is one example. Men are generally taller than women. Let "A" be sex and "B" be body height. Ask for sex: Picking a person among the taller ones gives a higher probability of finding a men, then just picking any person. Ask for body height: picking a person among all men gives a higher probability of finding a taller person, then just picking any person. So it works from both points of view, from the view of the "tallers" and the view of the "men".
          $endgroup$
          – Andreas
          Oct 19 '17 at 20:22
















        16












        16








        16





        $begingroup$

        Both statements are saying that $P(A cap B) ge P(A)cdot P(B)$. Note that $P(A)cdot P(B)$ corresponds to $P(A cap B)$ if $A$ and $B$ were independent. Thus $P(A cap B) ge P(A)cdot P(B)$ means that there is some positive correlation (in a figurative sense) between these two events.



        On the other hand, the phrase that always pops in my mind when I am talking about conditional probability is you are changing your universe. $P(X|Y)$ means that you are looking at $P(X)$ in a different universe, i.e. $Y$. Since there is a positive correlation between these two, it makes sense that $P(A|B) ge P(A)$. This is because we are changing our universe to $B$, and since we know that $B$ has a positive correlation with $A$, it means that in this new universe, we are more likely to see $A$. This is symmetric in $A$ and $B$ of course.



        I don't know if this is intuition enough.






        share|cite|improve this answer









        $endgroup$



        Both statements are saying that $P(A cap B) ge P(A)cdot P(B)$. Note that $P(A)cdot P(B)$ corresponds to $P(A cap B)$ if $A$ and $B$ were independent. Thus $P(A cap B) ge P(A)cdot P(B)$ means that there is some positive correlation (in a figurative sense) between these two events.



        On the other hand, the phrase that always pops in my mind when I am talking about conditional probability is you are changing your universe. $P(X|Y)$ means that you are looking at $P(X)$ in a different universe, i.e. $Y$. Since there is a positive correlation between these two, it makes sense that $P(A|B) ge P(A)$. This is because we are changing our universe to $B$, and since we know that $B$ has a positive correlation with $A$, it means that in this new universe, we are more likely to see $A$. This is symmetric in $A$ and $B$ of course.



        I don't know if this is intuition enough.







        share|cite|improve this answer












        share|cite|improve this answer



        share|cite|improve this answer










        answered Oct 19 '17 at 20:14









        Abhiram NatarajanAbhiram Natarajan

        1,119215




        1,119215








        • 4




          $begingroup$
          Good point. Your "universe point of view" is going from prior to posteriors. Here is one example. Men are generally taller than women. Let "A" be sex and "B" be body height. Ask for sex: Picking a person among the taller ones gives a higher probability of finding a men, then just picking any person. Ask for body height: picking a person among all men gives a higher probability of finding a taller person, then just picking any person. So it works from both points of view, from the view of the "tallers" and the view of the "men".
          $endgroup$
          – Andreas
          Oct 19 '17 at 20:22
















        • 4




          $begingroup$
          Good point. Your "universe point of view" is going from prior to posteriors. Here is one example. Men are generally taller than women. Let "A" be sex and "B" be body height. Ask for sex: Picking a person among the taller ones gives a higher probability of finding a men, then just picking any person. Ask for body height: picking a person among all men gives a higher probability of finding a taller person, then just picking any person. So it works from both points of view, from the view of the "tallers" and the view of the "men".
          $endgroup$
          – Andreas
          Oct 19 '17 at 20:22










        4




        4




        $begingroup$
        Good point. Your "universe point of view" is going from prior to posteriors. Here is one example. Men are generally taller than women. Let "A" be sex and "B" be body height. Ask for sex: Picking a person among the taller ones gives a higher probability of finding a men, then just picking any person. Ask for body height: picking a person among all men gives a higher probability of finding a taller person, then just picking any person. So it works from both points of view, from the view of the "tallers" and the view of the "men".
        $endgroup$
        – Andreas
        Oct 19 '17 at 20:22






        $begingroup$
        Good point. Your "universe point of view" is going from prior to posteriors. Here is one example. Men are generally taller than women. Let "A" be sex and "B" be body height. Ask for sex: Picking a person among the taller ones gives a higher probability of finding a men, then just picking any person. Ask for body height: picking a person among all men gives a higher probability of finding a taller person, then just picking any person. So it works from both points of view, from the view of the "tallers" and the view of the "men".
        $endgroup$
        – Andreas
        Oct 19 '17 at 20:22













        9












        $begingroup$

        General intution answer: Having $P(A | B) > P(A)$ is an indication that $A$ and $B$ are positively correlated -- that is, they occur more frequently together. From this notion, it should be natural that if $A$ has happened, $B$ is more likely.



        Geometric intuition answer: If we talk about this in a Venn Diagram context, then the statement $P(B | A) > P(B)$ means that events $A$ and $B$ have heavy overlap. Here are some poorly-drawn MSPaint pictures to explain what I mean:
        enter image description here
        Here, I'm presuming that (1) Venn Diagrams always have areas correspond to probabilities (which is of course not true), and (2) that my drawings actually are accurate, which is probably false. But the spirit of the drawings is: the top set have independent events, for which the intersection area is "proportional" to the event probabilities. Specifically, the proportion is
        $$frac{|A cap B|}{|A|} = frac{|B|}{|Omega| = 1}$$
        which is obviously just a thinly-veiled version of the original probabilistic statement. In the top case, the pictures (allegedly) have "proportional intersections" in this way; in the bottom pictures, the overlap is larger than that prescribed proportion. I think this "intersection proportionality" might be the kind of geometric crux of the question that you're asking about.






        share|cite|improve this answer









        $endgroup$


















          9












          $begingroup$

          General intution answer: Having $P(A | B) > P(A)$ is an indication that $A$ and $B$ are positively correlated -- that is, they occur more frequently together. From this notion, it should be natural that if $A$ has happened, $B$ is more likely.



          Geometric intuition answer: If we talk about this in a Venn Diagram context, then the statement $P(B | A) > P(B)$ means that events $A$ and $B$ have heavy overlap. Here are some poorly-drawn MSPaint pictures to explain what I mean:
          enter image description here
          Here, I'm presuming that (1) Venn Diagrams always have areas correspond to probabilities (which is of course not true), and (2) that my drawings actually are accurate, which is probably false. But the spirit of the drawings is: the top set have independent events, for which the intersection area is "proportional" to the event probabilities. Specifically, the proportion is
          $$frac{|A cap B|}{|A|} = frac{|B|}{|Omega| = 1}$$
          which is obviously just a thinly-veiled version of the original probabilistic statement. In the top case, the pictures (allegedly) have "proportional intersections" in this way; in the bottom pictures, the overlap is larger than that prescribed proportion. I think this "intersection proportionality" might be the kind of geometric crux of the question that you're asking about.






          share|cite|improve this answer









          $endgroup$
















            9












            9








            9





            $begingroup$

            General intution answer: Having $P(A | B) > P(A)$ is an indication that $A$ and $B$ are positively correlated -- that is, they occur more frequently together. From this notion, it should be natural that if $A$ has happened, $B$ is more likely.



            Geometric intuition answer: If we talk about this in a Venn Diagram context, then the statement $P(B | A) > P(B)$ means that events $A$ and $B$ have heavy overlap. Here are some poorly-drawn MSPaint pictures to explain what I mean:
            enter image description here
            Here, I'm presuming that (1) Venn Diagrams always have areas correspond to probabilities (which is of course not true), and (2) that my drawings actually are accurate, which is probably false. But the spirit of the drawings is: the top set have independent events, for which the intersection area is "proportional" to the event probabilities. Specifically, the proportion is
            $$frac{|A cap B|}{|A|} = frac{|B|}{|Omega| = 1}$$
            which is obviously just a thinly-veiled version of the original probabilistic statement. In the top case, the pictures (allegedly) have "proportional intersections" in this way; in the bottom pictures, the overlap is larger than that prescribed proportion. I think this "intersection proportionality" might be the kind of geometric crux of the question that you're asking about.






            share|cite|improve this answer









            $endgroup$



            General intution answer: Having $P(A | B) > P(A)$ is an indication that $A$ and $B$ are positively correlated -- that is, they occur more frequently together. From this notion, it should be natural that if $A$ has happened, $B$ is more likely.



            Geometric intuition answer: If we talk about this in a Venn Diagram context, then the statement $P(B | A) > P(B)$ means that events $A$ and $B$ have heavy overlap. Here are some poorly-drawn MSPaint pictures to explain what I mean:
            enter image description here
            Here, I'm presuming that (1) Venn Diagrams always have areas correspond to probabilities (which is of course not true), and (2) that my drawings actually are accurate, which is probably false. But the spirit of the drawings is: the top set have independent events, for which the intersection area is "proportional" to the event probabilities. Specifically, the proportion is
            $$frac{|A cap B|}{|A|} = frac{|B|}{|Omega| = 1}$$
            which is obviously just a thinly-veiled version of the original probabilistic statement. In the top case, the pictures (allegedly) have "proportional intersections" in this way; in the bottom pictures, the overlap is larger than that prescribed proportion. I think this "intersection proportionality" might be the kind of geometric crux of the question that you're asking about.







            share|cite|improve this answer












            share|cite|improve this answer



            share|cite|improve this answer










            answered Oct 19 '17 at 20:38









            Aaron MontgomeryAaron Montgomery

            4,837523




            4,837523























                5












                $begingroup$

                First, note that this theorem is equivalent to




                If $P(A) neq 0$ and $P(B) neq 0$ and
                $$P(B|A) geq P(B), tag{1}$$ then $$P(A|B) geq P(A).tag{2}$$




                because you can just swap $A$ and $B$ in the theorem to get the direction (2) $Rightarrow$ (1).





                Real life intuition (aka Bayesian inference):



                Consider a thing $B$ that's more likely when you know $A$ happens than it is when you don't know whether $A$ happens. Mathematically, this is condtion (1). Now if you observe $B$, it is actually evidence for $A$. Which mathematically means (2).



                Example:



                You know that rigged dice tend to give $6$s more often than fair dice. You pick up a die you find somewhere. As you really haven't encountered that many significantly rigged dice in your life, but have seen quite many almost fair dice, you initially think that your die is likely close to fair. You throw the die 20 times and observe 20 sixes. What do you deduce? You start thinking that the die is very probably rigged.



                Mathematically: Let $A$ be the event that the die is significantly rigged, and $B$ be the event that you get 20 sixes in a row. The fact that rigged dice give more sixes than fair dice means $P(B|A)>P(B)$. Before observing the throws, you think the probability that the die is rigged is $P(A)$, which is very small. But once you see the throws, you think that the probability is $P(A|B)$. The new probability is a lot larger, which means that $P(A|B)>P(A)$. The fact that the new probability is a lot larger was based on your knowledge that $P(B|A)>P(B)$.






                share|cite|improve this answer









                $endgroup$


















                  5












                  $begingroup$

                  First, note that this theorem is equivalent to




                  If $P(A) neq 0$ and $P(B) neq 0$ and
                  $$P(B|A) geq P(B), tag{1}$$ then $$P(A|B) geq P(A).tag{2}$$




                  because you can just swap $A$ and $B$ in the theorem to get the direction (2) $Rightarrow$ (1).





                  Real life intuition (aka Bayesian inference):



                  Consider a thing $B$ that's more likely when you know $A$ happens than it is when you don't know whether $A$ happens. Mathematically, this is condtion (1). Now if you observe $B$, it is actually evidence for $A$. Which mathematically means (2).



                  Example:



                  You know that rigged dice tend to give $6$s more often than fair dice. You pick up a die you find somewhere. As you really haven't encountered that many significantly rigged dice in your life, but have seen quite many almost fair dice, you initially think that your die is likely close to fair. You throw the die 20 times and observe 20 sixes. What do you deduce? You start thinking that the die is very probably rigged.



                  Mathematically: Let $A$ be the event that the die is significantly rigged, and $B$ be the event that you get 20 sixes in a row. The fact that rigged dice give more sixes than fair dice means $P(B|A)>P(B)$. Before observing the throws, you think the probability that the die is rigged is $P(A)$, which is very small. But once you see the throws, you think that the probability is $P(A|B)$. The new probability is a lot larger, which means that $P(A|B)>P(A)$. The fact that the new probability is a lot larger was based on your knowledge that $P(B|A)>P(B)$.






                  share|cite|improve this answer









                  $endgroup$
















                    5












                    5








                    5





                    $begingroup$

                    First, note that this theorem is equivalent to




                    If $P(A) neq 0$ and $P(B) neq 0$ and
                    $$P(B|A) geq P(B), tag{1}$$ then $$P(A|B) geq P(A).tag{2}$$




                    because you can just swap $A$ and $B$ in the theorem to get the direction (2) $Rightarrow$ (1).





                    Real life intuition (aka Bayesian inference):



                    Consider a thing $B$ that's more likely when you know $A$ happens than it is when you don't know whether $A$ happens. Mathematically, this is condtion (1). Now if you observe $B$, it is actually evidence for $A$. Which mathematically means (2).



                    Example:



                    You know that rigged dice tend to give $6$s more often than fair dice. You pick up a die you find somewhere. As you really haven't encountered that many significantly rigged dice in your life, but have seen quite many almost fair dice, you initially think that your die is likely close to fair. You throw the die 20 times and observe 20 sixes. What do you deduce? You start thinking that the die is very probably rigged.



                    Mathematically: Let $A$ be the event that the die is significantly rigged, and $B$ be the event that you get 20 sixes in a row. The fact that rigged dice give more sixes than fair dice means $P(B|A)>P(B)$. Before observing the throws, you think the probability that the die is rigged is $P(A)$, which is very small. But once you see the throws, you think that the probability is $P(A|B)$. The new probability is a lot larger, which means that $P(A|B)>P(A)$. The fact that the new probability is a lot larger was based on your knowledge that $P(B|A)>P(B)$.






                    share|cite|improve this answer









                    $endgroup$



                    First, note that this theorem is equivalent to




                    If $P(A) neq 0$ and $P(B) neq 0$ and
                    $$P(B|A) geq P(B), tag{1}$$ then $$P(A|B) geq P(A).tag{2}$$




                    because you can just swap $A$ and $B$ in the theorem to get the direction (2) $Rightarrow$ (1).





                    Real life intuition (aka Bayesian inference):



                    Consider a thing $B$ that's more likely when you know $A$ happens than it is when you don't know whether $A$ happens. Mathematically, this is condtion (1). Now if you observe $B$, it is actually evidence for $A$. Which mathematically means (2).



                    Example:



                    You know that rigged dice tend to give $6$s more often than fair dice. You pick up a die you find somewhere. As you really haven't encountered that many significantly rigged dice in your life, but have seen quite many almost fair dice, you initially think that your die is likely close to fair. You throw the die 20 times and observe 20 sixes. What do you deduce? You start thinking that the die is very probably rigged.



                    Mathematically: Let $A$ be the event that the die is significantly rigged, and $B$ be the event that you get 20 sixes in a row. The fact that rigged dice give more sixes than fair dice means $P(B|A)>P(B)$. Before observing the throws, you think the probability that the die is rigged is $P(A)$, which is very small. But once you see the throws, you think that the probability is $P(A|B)$. The new probability is a lot larger, which means that $P(A|B)>P(A)$. The fact that the new probability is a lot larger was based on your knowledge that $P(B|A)>P(B)$.







                    share|cite|improve this answer












                    share|cite|improve this answer



                    share|cite|improve this answer










                    answered Oct 19 '17 at 22:06









                    JiKJiK

                    4,8831232




                    4,8831232























                        2












                        $begingroup$

                        I'll try to give a "quasi" geometric interpretation - a "set" interpretation. You are meant to extract a card from a 3-cards deck. The cards are signed with $A$, $B$ and $C$. Then I ask you: which is the probability to choose $A$ from the deck?



                        The pool of your possibilities is:



                        $${A, B, C }.$$



                        Then, of course $P(A) = 1/3$. But if I say you that I already selected a card from the deck, and this card is $B$, then the pool of possibilities for you is now:



                        $${A, C},$$



                        right? In this case, the probability that you get $A$ is:



                        $$P(A | B) = 1/2.$$



                        As Abhiram Natarajan (+1) said, when you want to calculate a conditional probability, then you are changing - better, you are reducing - the universe of your possibility, and hence the probabilities must be greater (or equal) to the simple one. The universe changed because something already happened before you take a card from the deck.






                        share|cite|improve this answer









                        $endgroup$


















                          2












                          $begingroup$

                          I'll try to give a "quasi" geometric interpretation - a "set" interpretation. You are meant to extract a card from a 3-cards deck. The cards are signed with $A$, $B$ and $C$. Then I ask you: which is the probability to choose $A$ from the deck?



                          The pool of your possibilities is:



                          $${A, B, C }.$$



                          Then, of course $P(A) = 1/3$. But if I say you that I already selected a card from the deck, and this card is $B$, then the pool of possibilities for you is now:



                          $${A, C},$$



                          right? In this case, the probability that you get $A$ is:



                          $$P(A | B) = 1/2.$$



                          As Abhiram Natarajan (+1) said, when you want to calculate a conditional probability, then you are changing - better, you are reducing - the universe of your possibility, and hence the probabilities must be greater (or equal) to the simple one. The universe changed because something already happened before you take a card from the deck.






                          share|cite|improve this answer









                          $endgroup$
















                            2












                            2








                            2





                            $begingroup$

                            I'll try to give a "quasi" geometric interpretation - a "set" interpretation. You are meant to extract a card from a 3-cards deck. The cards are signed with $A$, $B$ and $C$. Then I ask you: which is the probability to choose $A$ from the deck?



                            The pool of your possibilities is:



                            $${A, B, C }.$$



                            Then, of course $P(A) = 1/3$. But if I say you that I already selected a card from the deck, and this card is $B$, then the pool of possibilities for you is now:



                            $${A, C},$$



                            right? In this case, the probability that you get $A$ is:



                            $$P(A | B) = 1/2.$$



                            As Abhiram Natarajan (+1) said, when you want to calculate a conditional probability, then you are changing - better, you are reducing - the universe of your possibility, and hence the probabilities must be greater (or equal) to the simple one. The universe changed because something already happened before you take a card from the deck.






                            share|cite|improve this answer









                            $endgroup$



                            I'll try to give a "quasi" geometric interpretation - a "set" interpretation. You are meant to extract a card from a 3-cards deck. The cards are signed with $A$, $B$ and $C$. Then I ask you: which is the probability to choose $A$ from the deck?



                            The pool of your possibilities is:



                            $${A, B, C }.$$



                            Then, of course $P(A) = 1/3$. But if I say you that I already selected a card from the deck, and this card is $B$, then the pool of possibilities for you is now:



                            $${A, C},$$



                            right? In this case, the probability that you get $A$ is:



                            $$P(A | B) = 1/2.$$



                            As Abhiram Natarajan (+1) said, when you want to calculate a conditional probability, then you are changing - better, you are reducing - the universe of your possibility, and hence the probabilities must be greater (or equal) to the simple one. The universe changed because something already happened before you take a card from the deck.







                            share|cite|improve this answer












                            share|cite|improve this answer



                            share|cite|improve this answer










                            answered Oct 19 '17 at 20:30









                            the_candymanthe_candyman

                            9,12532145




                            9,12532145






























                                draft saved

                                draft discarded




















































                                Thanks for contributing an answer to Mathematics Stack Exchange!


                                • Please be sure to answer the question. Provide details and share your research!

                                But avoid



                                • Asking for help, clarification, or responding to other answers.

                                • Making statements based on opinion; back them up with references or personal experience.


                                Use MathJax to format equations. MathJax reference.


                                To learn more, see our tips on writing great answers.




                                draft saved


                                draft discarded














                                StackExchange.ready(
                                function () {
                                StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2480490%2fif-pa-neq-0-and-pb-neq-0-then-pba-geq-pb-is-equivalent-to-p%23new-answer', 'question_page');
                                }
                                );

                                Post as a guest















                                Required, but never shown





















































                                Required, but never shown














                                Required, but never shown












                                Required, but never shown







                                Required, but never shown

































                                Required, but never shown














                                Required, but never shown












                                Required, but never shown







                                Required, but never shown







                                Popular posts from this blog

                                Wiesbaden

                                Marschland

                                Dieringhausen