So I’m trying to open and read this csv file into Spyder IDE and then the IDLE Shell as well. Both environments produce an error when I attempt this, and they both tell me:
‘utf-8’ codec can’t decode byte 0xf1 in position 8982: invalid continuation byte
Reading around online, it appears to be something relating to a “codec registry”, which apparently relates to how the machine or interpreter is meant to interpret the code characters for various human languages. My data set is a Honduran household survey, and while most the answers are numeric, some are in Spanish, and the overall language used here is the Spanish language – although it does look like all the data was actually entered by an English language user because there are no distinctly Spanish characters (such as ñ) anywhere throughout a rather large text.
I’ve researched codecs some online hoping there is an easy code patch or import or module to make some type of conversion. But all I find are dense reading, maybe 10-20 pages, most of which seems unrelated to what I’m trying to do. But I can’t get my bearings within all of that to find some direction.
I have actually loaded, opened, read, selected, printed etc etc this exact dataset with SQL in Big Querry. And also with R language in Posit there was no problem to work with this data set.
Does anyone have any suggestions to get over this hurdle here? Thanks
submitted by /u/MiracleNamedHope
[link] [comments]
r/learnpython So I’m trying to open and read this csv file into Spyder IDE and then the IDLE Shell as well. Both environments produce an error when I attempt this, and they both tell me: ‘utf-8’ codec can’t decode byte 0xf1 in position 8982: invalid continuation byte Reading around online, it appears to be something relating to a “codec registry”, which apparently relates to how the machine or interpreter is meant to interpret the code characters for various human languages. My data set is a Honduran household survey, and while most the answers are numeric, some are in Spanish, and the overall language used here is the Spanish language – although it does look like all the data was actually entered by an English language user because there are no distinctly Spanish characters (such as ñ) anywhere throughout a rather large text. I’ve researched codecs some online hoping there is an easy code patch or import or module to make some type of conversion. But all I find are dense reading, maybe 10-20 pages, most of which seems unrelated to what I’m trying to do. But I can’t get my bearings within all of that to find some direction. I have actually loaded, opened, read, selected, printed etc etc this exact dataset with SQL in Big Querry. And also with R language in Posit there was no problem to work with this data set. Does anyone have any suggestions to get over this hurdle here? Thanks submitted by /u/MiracleNamedHope [link] [comments]
So I’m trying to open and read this csv file into Spyder IDE and then the IDLE Shell as well. Both environments produce an error when I attempt this, and they both tell me:
‘utf-8’ codec can’t decode byte 0xf1 in position 8982: invalid continuation byte
Reading around online, it appears to be something relating to a “codec registry”, which apparently relates to how the machine or interpreter is meant to interpret the code characters for various human languages. My data set is a Honduran household survey, and while most the answers are numeric, some are in Spanish, and the overall language used here is the Spanish language – although it does look like all the data was actually entered by an English language user because there are no distinctly Spanish characters (such as ñ) anywhere throughout a rather large text.
I’ve researched codecs some online hoping there is an easy code patch or import or module to make some type of conversion. But all I find are dense reading, maybe 10-20 pages, most of which seems unrelated to what I’m trying to do. But I can’t get my bearings within all of that to find some direction.
I have actually loaded, opened, read, selected, printed etc etc this exact dataset with SQL in Big Querry. And also with R language in Posit there was no problem to work with this data set.
Does anyone have any suggestions to get over this hurdle here? Thanks
submitted by /u/MiracleNamedHope
[link] [comments]