Why isn' t high order polynomial a good fit?












3












$begingroup$


Let's say I have a set of data points $(x_i,y_i), i=1,2,...,N$, and I want to approximate it using a polynomial $p(x)=sum_{i=0}^n a_i x^i$ with a least squares fit (so $n<N$).



I know that the coefficient $R^2$ is a measure for goodness of fit. But as I increase the order $n$ of the polynomial $p$, $R^2$ approaches $1$. (In the extreme case when $n=N$ the fit is an interpolation with $R^2=1$.)



But a high order polynomial fit is likely an unphysical result and doesn't describe the population well. So is there a mathematical characteristic or coefficient or model that describes and explains this?










share|cite|improve this question











$endgroup$








  • 4




    $begingroup$
    en.wikipedia.org/wiki/Overfitting
    $endgroup$
    – M.B.
    Feb 10 '15 at 11:53










  • $begingroup$
    "a high order polynomial fit is likely an unphysical result and doesn't describe the population well": I don't agree, every smooth function is well approximated by its Taylor polynomial. For instance, it works very well for an exponential growth.
    $endgroup$
    – Yves Daoust
    Feb 16 '15 at 10:37
















3












$begingroup$


Let's say I have a set of data points $(x_i,y_i), i=1,2,...,N$, and I want to approximate it using a polynomial $p(x)=sum_{i=0}^n a_i x^i$ with a least squares fit (so $n<N$).



I know that the coefficient $R^2$ is a measure for goodness of fit. But as I increase the order $n$ of the polynomial $p$, $R^2$ approaches $1$. (In the extreme case when $n=N$ the fit is an interpolation with $R^2=1$.)



But a high order polynomial fit is likely an unphysical result and doesn't describe the population well. So is there a mathematical characteristic or coefficient or model that describes and explains this?










share|cite|improve this question











$endgroup$








  • 4




    $begingroup$
    en.wikipedia.org/wiki/Overfitting
    $endgroup$
    – M.B.
    Feb 10 '15 at 11:53










  • $begingroup$
    "a high order polynomial fit is likely an unphysical result and doesn't describe the population well": I don't agree, every smooth function is well approximated by its Taylor polynomial. For instance, it works very well for an exponential growth.
    $endgroup$
    – Yves Daoust
    Feb 16 '15 at 10:37














3












3








3





$begingroup$


Let's say I have a set of data points $(x_i,y_i), i=1,2,...,N$, and I want to approximate it using a polynomial $p(x)=sum_{i=0}^n a_i x^i$ with a least squares fit (so $n<N$).



I know that the coefficient $R^2$ is a measure for goodness of fit. But as I increase the order $n$ of the polynomial $p$, $R^2$ approaches $1$. (In the extreme case when $n=N$ the fit is an interpolation with $R^2=1$.)



But a high order polynomial fit is likely an unphysical result and doesn't describe the population well. So is there a mathematical characteristic or coefficient or model that describes and explains this?










share|cite|improve this question











$endgroup$




Let's say I have a set of data points $(x_i,y_i), i=1,2,...,N$, and I want to approximate it using a polynomial $p(x)=sum_{i=0}^n a_i x^i$ with a least squares fit (so $n<N$).



I know that the coefficient $R^2$ is a measure for goodness of fit. But as I increase the order $n$ of the polynomial $p$, $R^2$ approaches $1$. (In the extreme case when $n=N$ the fit is an interpolation with $R^2=1$.)



But a high order polynomial fit is likely an unphysical result and doesn't describe the population well. So is there a mathematical characteristic or coefficient or model that describes and explains this?







statistics statistical-inference






share|cite|improve this question















share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited Feb 16 '15 at 10:26







Minethlos

















asked Feb 10 '15 at 11:49









MinethlosMinethlos

9561127




9561127








  • 4




    $begingroup$
    en.wikipedia.org/wiki/Overfitting
    $endgroup$
    – M.B.
    Feb 10 '15 at 11:53










  • $begingroup$
    "a high order polynomial fit is likely an unphysical result and doesn't describe the population well": I don't agree, every smooth function is well approximated by its Taylor polynomial. For instance, it works very well for an exponential growth.
    $endgroup$
    – Yves Daoust
    Feb 16 '15 at 10:37














  • 4




    $begingroup$
    en.wikipedia.org/wiki/Overfitting
    $endgroup$
    – M.B.
    Feb 10 '15 at 11:53










  • $begingroup$
    "a high order polynomial fit is likely an unphysical result and doesn't describe the population well": I don't agree, every smooth function is well approximated by its Taylor polynomial. For instance, it works very well for an exponential growth.
    $endgroup$
    – Yves Daoust
    Feb 16 '15 at 10:37








4




4




$begingroup$
en.wikipedia.org/wiki/Overfitting
$endgroup$
– M.B.
Feb 10 '15 at 11:53




$begingroup$
en.wikipedia.org/wiki/Overfitting
$endgroup$
– M.B.
Feb 10 '15 at 11:53












$begingroup$
"a high order polynomial fit is likely an unphysical result and doesn't describe the population well": I don't agree, every smooth function is well approximated by its Taylor polynomial. For instance, it works very well for an exponential growth.
$endgroup$
– Yves Daoust
Feb 16 '15 at 10:37




$begingroup$
"a high order polynomial fit is likely an unphysical result and doesn't describe the population well": I don't agree, every smooth function is well approximated by its Taylor polynomial. For instance, it works very well for an exponential growth.
$endgroup$
– Yves Daoust
Feb 16 '15 at 10:37










2 Answers
2






active

oldest

votes


















2












$begingroup$

I think what you might be looking for is the Akaike information criterion. It takes the number of parameters and the maximised value of the liklihood function to produce a number; the AIC says the model which produces the minimum value is preferred. Essentially it rewards well-fitting but penalises use of extra parameters.



http://en.m.wikipedia.org/wiki/Akaike_information_criterion






share|cite|improve this answer









$endgroup$













  • $begingroup$
    Yes, the information theory background of AIC is enlightening. Thank you.
    $endgroup$
    – Minethlos
    Feb 16 '15 at 11:32



















1












$begingroup$

Why you use R^2 for the goodness of fit, why not just use Mean Square Error or Root Mean Square Error? Higher order polynomials are prone to local fluctuations, what you can do is to use regularization to reduce the artifacts, e.g., you can add a penalty term to address the second order derivation of the high order polynomial.






share|cite|improve this answer









$endgroup$













    Your Answer





    StackExchange.ifUsing("editor", function () {
    return StackExchange.using("mathjaxEditing", function () {
    StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
    StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
    });
    });
    }, "mathjax-editing");

    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "69"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    noCode: true, onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f1141956%2fwhy-isn-t-high-order-polynomial-a-good-fit%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    2 Answers
    2






    active

    oldest

    votes








    2 Answers
    2






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    2












    $begingroup$

    I think what you might be looking for is the Akaike information criterion. It takes the number of parameters and the maximised value of the liklihood function to produce a number; the AIC says the model which produces the minimum value is preferred. Essentially it rewards well-fitting but penalises use of extra parameters.



    http://en.m.wikipedia.org/wiki/Akaike_information_criterion






    share|cite|improve this answer









    $endgroup$













    • $begingroup$
      Yes, the information theory background of AIC is enlightening. Thank you.
      $endgroup$
      – Minethlos
      Feb 16 '15 at 11:32
















    2












    $begingroup$

    I think what you might be looking for is the Akaike information criterion. It takes the number of parameters and the maximised value of the liklihood function to produce a number; the AIC says the model which produces the minimum value is preferred. Essentially it rewards well-fitting but penalises use of extra parameters.



    http://en.m.wikipedia.org/wiki/Akaike_information_criterion






    share|cite|improve this answer









    $endgroup$













    • $begingroup$
      Yes, the information theory background of AIC is enlightening. Thank you.
      $endgroup$
      – Minethlos
      Feb 16 '15 at 11:32














    2












    2








    2





    $begingroup$

    I think what you might be looking for is the Akaike information criterion. It takes the number of parameters and the maximised value of the liklihood function to produce a number; the AIC says the model which produces the minimum value is preferred. Essentially it rewards well-fitting but penalises use of extra parameters.



    http://en.m.wikipedia.org/wiki/Akaike_information_criterion






    share|cite|improve this answer









    $endgroup$



    I think what you might be looking for is the Akaike information criterion. It takes the number of parameters and the maximised value of the liklihood function to produce a number; the AIC says the model which produces the minimum value is preferred. Essentially it rewards well-fitting but penalises use of extra parameters.



    http://en.m.wikipedia.org/wiki/Akaike_information_criterion







    share|cite|improve this answer












    share|cite|improve this answer



    share|cite|improve this answer










    answered Feb 16 '15 at 10:45









    FireGardenFireGarden

    2,84421525




    2,84421525












    • $begingroup$
      Yes, the information theory background of AIC is enlightening. Thank you.
      $endgroup$
      – Minethlos
      Feb 16 '15 at 11:32


















    • $begingroup$
      Yes, the information theory background of AIC is enlightening. Thank you.
      $endgroup$
      – Minethlos
      Feb 16 '15 at 11:32
















    $begingroup$
    Yes, the information theory background of AIC is enlightening. Thank you.
    $endgroup$
    – Minethlos
    Feb 16 '15 at 11:32




    $begingroup$
    Yes, the information theory background of AIC is enlightening. Thank you.
    $endgroup$
    – Minethlos
    Feb 16 '15 at 11:32











    1












    $begingroup$

    Why you use R^2 for the goodness of fit, why not just use Mean Square Error or Root Mean Square Error? Higher order polynomials are prone to local fluctuations, what you can do is to use regularization to reduce the artifacts, e.g., you can add a penalty term to address the second order derivation of the high order polynomial.






    share|cite|improve this answer









    $endgroup$


















      1












      $begingroup$

      Why you use R^2 for the goodness of fit, why not just use Mean Square Error or Root Mean Square Error? Higher order polynomials are prone to local fluctuations, what you can do is to use regularization to reduce the artifacts, e.g., you can add a penalty term to address the second order derivation of the high order polynomial.






      share|cite|improve this answer









      $endgroup$
















        1












        1








        1





        $begingroup$

        Why you use R^2 for the goodness of fit, why not just use Mean Square Error or Root Mean Square Error? Higher order polynomials are prone to local fluctuations, what you can do is to use regularization to reduce the artifacts, e.g., you can add a penalty term to address the second order derivation of the high order polynomial.






        share|cite|improve this answer









        $endgroup$



        Why you use R^2 for the goodness of fit, why not just use Mean Square Error or Root Mean Square Error? Higher order polynomials are prone to local fluctuations, what you can do is to use regularization to reduce the artifacts, e.g., you can add a penalty term to address the second order derivation of the high order polynomial.







        share|cite|improve this answer












        share|cite|improve this answer



        share|cite|improve this answer










        answered Dec 18 '18 at 2:47









        QZHuaQZHua

        112




        112






























            draft saved

            draft discarded




















































            Thanks for contributing an answer to Mathematics Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            Use MathJax to format equations. MathJax reference.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f1141956%2fwhy-isn-t-high-order-polynomial-a-good-fit%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            To store a contact into the json file from server.js file using a class in NodeJS

            Redirect URL with Chrome Remote Debugging Android Devices

            Dieringhausen