handlee and convert multiple data types in the same column to numeric





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty{ height:90px;width:728px;box-sizing:border-box;
}







-1















I am trying to clean my Dataframe. The problem is that I want to convert more than 300 different columns to numeric, but in the same column I have multiple data types.
As an example of DataFrame:



ID  MONOCITOS   EOSIN    EOSINOFILOS NORMOBLASTOS   
0 5 0.21 2 0 0.04 31.0
2 9 <0.22 False 0 0.04 33.0
5 12.8 0.16 0 0 0.02 sdfdr
6 No 0 fh 0 0.02 60.0
9 0 0.28 3 - 0.06 Nan
14 3 - 3 - - 59.0


What is the best way to convert a column with different data types to numeric? Is there any module to perform this task automatically?
Thanks










share|improve this question























  • I dont know a tool but you can go trough the columns and write a function which do this programmaticly. In Easy cases you can use: pd.to_numeric pandas.pydata.org/pandas-docs/stable/generated/…

    – MisterMonk
    Nov 26 '18 at 22:46








  • 1





    Use df = df.apply(pd.to_numeric, errors='coerce')

    – Abhi
    Nov 26 '18 at 22:47











  • I have tried this solution, but it doesn´t work. As some values of the column are string data type the whole column type remains as an object type

    – Ley
    Nov 27 '18 at 9:39


















-1















I am trying to clean my Dataframe. The problem is that I want to convert more than 300 different columns to numeric, but in the same column I have multiple data types.
As an example of DataFrame:



ID  MONOCITOS   EOSIN    EOSINOFILOS NORMOBLASTOS   
0 5 0.21 2 0 0.04 31.0
2 9 <0.22 False 0 0.04 33.0
5 12.8 0.16 0 0 0.02 sdfdr
6 No 0 fh 0 0.02 60.0
9 0 0.28 3 - 0.06 Nan
14 3 - 3 - - 59.0


What is the best way to convert a column with different data types to numeric? Is there any module to perform this task automatically?
Thanks










share|improve this question























  • I dont know a tool but you can go trough the columns and write a function which do this programmaticly. In Easy cases you can use: pd.to_numeric pandas.pydata.org/pandas-docs/stable/generated/…

    – MisterMonk
    Nov 26 '18 at 22:46








  • 1





    Use df = df.apply(pd.to_numeric, errors='coerce')

    – Abhi
    Nov 26 '18 at 22:47











  • I have tried this solution, but it doesn´t work. As some values of the column are string data type the whole column type remains as an object type

    – Ley
    Nov 27 '18 at 9:39














-1












-1








-1








I am trying to clean my Dataframe. The problem is that I want to convert more than 300 different columns to numeric, but in the same column I have multiple data types.
As an example of DataFrame:



ID  MONOCITOS   EOSIN    EOSINOFILOS NORMOBLASTOS   
0 5 0.21 2 0 0.04 31.0
2 9 <0.22 False 0 0.04 33.0
5 12.8 0.16 0 0 0.02 sdfdr
6 No 0 fh 0 0.02 60.0
9 0 0.28 3 - 0.06 Nan
14 3 - 3 - - 59.0


What is the best way to convert a column with different data types to numeric? Is there any module to perform this task automatically?
Thanks










share|improve this question














I am trying to clean my Dataframe. The problem is that I want to convert more than 300 different columns to numeric, but in the same column I have multiple data types.
As an example of DataFrame:



ID  MONOCITOS   EOSIN    EOSINOFILOS NORMOBLASTOS   
0 5 0.21 2 0 0.04 31.0
2 9 <0.22 False 0 0.04 33.0
5 12.8 0.16 0 0 0.02 sdfdr
6 No 0 fh 0 0.02 60.0
9 0 0.28 3 - 0.06 Nan
14 3 - 3 - - 59.0


What is the best way to convert a column with different data types to numeric? Is there any module to perform this task automatically?
Thanks







python pandas types numeric






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Nov 26 '18 at 22:43









LeyLey

193




193













  • I dont know a tool but you can go trough the columns and write a function which do this programmaticly. In Easy cases you can use: pd.to_numeric pandas.pydata.org/pandas-docs/stable/generated/…

    – MisterMonk
    Nov 26 '18 at 22:46








  • 1





    Use df = df.apply(pd.to_numeric, errors='coerce')

    – Abhi
    Nov 26 '18 at 22:47











  • I have tried this solution, but it doesn´t work. As some values of the column are string data type the whole column type remains as an object type

    – Ley
    Nov 27 '18 at 9:39



















  • I dont know a tool but you can go trough the columns and write a function which do this programmaticly. In Easy cases you can use: pd.to_numeric pandas.pydata.org/pandas-docs/stable/generated/…

    – MisterMonk
    Nov 26 '18 at 22:46








  • 1





    Use df = df.apply(pd.to_numeric, errors='coerce')

    – Abhi
    Nov 26 '18 at 22:47











  • I have tried this solution, but it doesn´t work. As some values of the column are string data type the whole column type remains as an object type

    – Ley
    Nov 27 '18 at 9:39

















I dont know a tool but you can go trough the columns and write a function which do this programmaticly. In Easy cases you can use: pd.to_numeric pandas.pydata.org/pandas-docs/stable/generated/…

– MisterMonk
Nov 26 '18 at 22:46







I dont know a tool but you can go trough the columns and write a function which do this programmaticly. In Easy cases you can use: pd.to_numeric pandas.pydata.org/pandas-docs/stable/generated/…

– MisterMonk
Nov 26 '18 at 22:46






1




1





Use df = df.apply(pd.to_numeric, errors='coerce')

– Abhi
Nov 26 '18 at 22:47





Use df = df.apply(pd.to_numeric, errors='coerce')

– Abhi
Nov 26 '18 at 22:47













I have tried this solution, but it doesn´t work. As some values of the column are string data type the whole column type remains as an object type

– Ley
Nov 27 '18 at 9:39





I have tried this solution, but it doesn´t work. As some values of the column are string data type the whole column type remains as an object type

– Ley
Nov 27 '18 at 9:39












1 Answer
1






active

oldest

votes


















0














You can convert multiple columns like this:



df[["col1", "col2"]] = df[["col1", "col2"]].apply(pd.to_numeric, errors='coerce')


or change the entire dataframe:



df = df.apply(pd.to_numeric, errors='coerce')


coerce will cause the non-convertible values to be NaN






share|improve this answer


























  • This is not suitable for "300 different columns"

    – MisterMonk
    Nov 26 '18 at 22:48











  • You could go over a set of columns at a time. For instance, use a loop that would convert 10 cols at a time

    – Pedro Torres
    Nov 26 '18 at 22:49











  • Yeah like:: for c in df.columns: column = df[c]

    – MisterMonk
    Nov 26 '18 at 22:53













  • You can also just pass the whole df at once: df=df.apply(pd.to_numeric, errors='coerce')

    – G. Anderson
    Nov 26 '18 at 23:03











  • True. Passing the column names was just in the case trying to change it all at the same time could be too much. I will change to make it more explicit

    – Pedro Torres
    Nov 26 '18 at 23:05












Your Answer






StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53490234%2fhandlee-and-convert-multiple-data-types-in-the-same-column-to-numeric%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes









0














You can convert multiple columns like this:



df[["col1", "col2"]] = df[["col1", "col2"]].apply(pd.to_numeric, errors='coerce')


or change the entire dataframe:



df = df.apply(pd.to_numeric, errors='coerce')


coerce will cause the non-convertible values to be NaN






share|improve this answer


























  • This is not suitable for "300 different columns"

    – MisterMonk
    Nov 26 '18 at 22:48











  • You could go over a set of columns at a time. For instance, use a loop that would convert 10 cols at a time

    – Pedro Torres
    Nov 26 '18 at 22:49











  • Yeah like:: for c in df.columns: column = df[c]

    – MisterMonk
    Nov 26 '18 at 22:53













  • You can also just pass the whole df at once: df=df.apply(pd.to_numeric, errors='coerce')

    – G. Anderson
    Nov 26 '18 at 23:03











  • True. Passing the column names was just in the case trying to change it all at the same time could be too much. I will change to make it more explicit

    – Pedro Torres
    Nov 26 '18 at 23:05
















0














You can convert multiple columns like this:



df[["col1", "col2"]] = df[["col1", "col2"]].apply(pd.to_numeric, errors='coerce')


or change the entire dataframe:



df = df.apply(pd.to_numeric, errors='coerce')


coerce will cause the non-convertible values to be NaN






share|improve this answer


























  • This is not suitable for "300 different columns"

    – MisterMonk
    Nov 26 '18 at 22:48











  • You could go over a set of columns at a time. For instance, use a loop that would convert 10 cols at a time

    – Pedro Torres
    Nov 26 '18 at 22:49











  • Yeah like:: for c in df.columns: column = df[c]

    – MisterMonk
    Nov 26 '18 at 22:53













  • You can also just pass the whole df at once: df=df.apply(pd.to_numeric, errors='coerce')

    – G. Anderson
    Nov 26 '18 at 23:03











  • True. Passing the column names was just in the case trying to change it all at the same time could be too much. I will change to make it more explicit

    – Pedro Torres
    Nov 26 '18 at 23:05














0












0








0







You can convert multiple columns like this:



df[["col1", "col2"]] = df[["col1", "col2"]].apply(pd.to_numeric, errors='coerce')


or change the entire dataframe:



df = df.apply(pd.to_numeric, errors='coerce')


coerce will cause the non-convertible values to be NaN






share|improve this answer















You can convert multiple columns like this:



df[["col1", "col2"]] = df[["col1", "col2"]].apply(pd.to_numeric, errors='coerce')


or change the entire dataframe:



df = df.apply(pd.to_numeric, errors='coerce')


coerce will cause the non-convertible values to be NaN







share|improve this answer














share|improve this answer



share|improve this answer








edited Nov 26 '18 at 23:06

























answered Nov 26 '18 at 22:47









Pedro TorresPedro Torres

718413




718413













  • This is not suitable for "300 different columns"

    – MisterMonk
    Nov 26 '18 at 22:48











  • You could go over a set of columns at a time. For instance, use a loop that would convert 10 cols at a time

    – Pedro Torres
    Nov 26 '18 at 22:49











  • Yeah like:: for c in df.columns: column = df[c]

    – MisterMonk
    Nov 26 '18 at 22:53













  • You can also just pass the whole df at once: df=df.apply(pd.to_numeric, errors='coerce')

    – G. Anderson
    Nov 26 '18 at 23:03











  • True. Passing the column names was just in the case trying to change it all at the same time could be too much. I will change to make it more explicit

    – Pedro Torres
    Nov 26 '18 at 23:05



















  • This is not suitable for "300 different columns"

    – MisterMonk
    Nov 26 '18 at 22:48











  • You could go over a set of columns at a time. For instance, use a loop that would convert 10 cols at a time

    – Pedro Torres
    Nov 26 '18 at 22:49











  • Yeah like:: for c in df.columns: column = df[c]

    – MisterMonk
    Nov 26 '18 at 22:53













  • You can also just pass the whole df at once: df=df.apply(pd.to_numeric, errors='coerce')

    – G. Anderson
    Nov 26 '18 at 23:03











  • True. Passing the column names was just in the case trying to change it all at the same time could be too much. I will change to make it more explicit

    – Pedro Torres
    Nov 26 '18 at 23:05

















This is not suitable for "300 different columns"

– MisterMonk
Nov 26 '18 at 22:48





This is not suitable for "300 different columns"

– MisterMonk
Nov 26 '18 at 22:48













You could go over a set of columns at a time. For instance, use a loop that would convert 10 cols at a time

– Pedro Torres
Nov 26 '18 at 22:49





You could go over a set of columns at a time. For instance, use a loop that would convert 10 cols at a time

– Pedro Torres
Nov 26 '18 at 22:49













Yeah like:: for c in df.columns: column = df[c]

– MisterMonk
Nov 26 '18 at 22:53







Yeah like:: for c in df.columns: column = df[c]

– MisterMonk
Nov 26 '18 at 22:53















You can also just pass the whole df at once: df=df.apply(pd.to_numeric, errors='coerce')

– G. Anderson
Nov 26 '18 at 23:03





You can also just pass the whole df at once: df=df.apply(pd.to_numeric, errors='coerce')

– G. Anderson
Nov 26 '18 at 23:03













True. Passing the column names was just in the case trying to change it all at the same time could be too much. I will change to make it more explicit

– Pedro Torres
Nov 26 '18 at 23:05





True. Passing the column names was just in the case trying to change it all at the same time could be too much. I will change to make it more explicit

– Pedro Torres
Nov 26 '18 at 23:05




















draft saved

draft discarded




















































Thanks for contributing an answer to Stack Overflow!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53490234%2fhandlee-and-convert-multiple-data-types-in-the-same-column-to-numeric%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Wiesbaden

Marschland

Dieringhausen