[Solved] How to load CSV file in Jupyter Notebook?

I’m new and studying machine learning. I stumble upon a tutorial I found online and I’d like to make the program work so I’ll get a better understanding. However, I’m getting problems about loading the CSV File into the Jupyter Notebook.

I get this error:

File "<ipython-input-2-70e07fb5b537>", line 2
    student_data = pd.read_csv("C:UsersxxxxDesktopstudent-intervention- 
SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in 
position 2-3: truncated UXXXXXXXX escape

and here is the code:
enter image description here

I followed tutorials online regarding this error but none worked. Does anyone know how to fix it?

3rd attempt with r”path”
enter image description here

I’ve tried also “” and utf-8 but none worked.

I’m using the latest version of Anaconda
Windows 7
Python 3.7

Enquirer: stella


Solution #1:

Use raw string notation for your Windows path. In python ” have meaning in python. Try instead do string like this r”path”:

student_data = pd.read_csv(r"C:UsersxxxxDesktopstudent-intervention-

If it doesnt work try this way:

import os

path = os.path.join('c:' + os.sep, 'Users', 'xxxx', 'Desktop', 'student-intervention-system', 'student-data.csv')
student_data = pd.read_csv(path)
Respondent: stella

Solution #2:

I had the same problem. I tried to encode it with ‘Latin-1’ and it worked for me.

    autos = pd.read_csv('filename',encoding = "Latin-1")
Respondent: Bohun Mielecki

Solution #3:

Either replace all backslashes with frontslashes / or place a r before your filepath string to avoid this error. It is not a matter of your folder name being too long.

As Bohun Mielecki mentioned, the character which is typically used to denote file structure in Windows has a different function when written within a string.

From Python3 Documentation: The backslash character is used to escape characters that otherwise have a special meaning, such as newline, backslash itself, or the quote character.

How this particularly affects your statement is that in the line

student_data = pd.read_csv("C:UsersxxxxDesktopstudent-intervention- 

Users matches the escape sequence Uxxxxxxxx whereby xxxxxxxx refers to a Character with 32-bit hex value xxxxxxxx. Because of this, Python tries to find a 32-bit hex value. However as the -sers from Users doesn’t match the xxxxxxxx format, you get the error:

SyntaxError: (unicode error) ‘unicodeescape’ codec can’t decode bytes in
position 2-3: truncated UXXXXXXXX escape

The reason why your code works now is that you have placed a r in front of 'C:UsersxxxxDesktopprojectstudent-data.csv'. This tells python not to process the backslash character / as it usually does and read the whole string as-is.

I hope this helps you better understand your problem. If you need any more clarification, do let me know.

Source: Python 3 Documentation

Respondent: Yuwe711

Solution #4:

Try this student_data = pd.read_csv("C:/Users/xxxx/Desktop/student-intervention-

Replacing backslashes in that code it will work for you.

Respondent: MoltenMuffins

Solution #5:


pd.read_csv('file_name',encoding = "utf-8")
Respondent: Qasim ali shah

Solution #6:

I found the problem. The problem is my folder name that is really long. I changed my folder name into “project” and the data is now finally loaded! Silly!

enter image description here

Respondent: balderman

Solution #7:

Please open notepad, write csv format data into the file and opt ‘Save As’ to save the file with format .csv.
E.g. Train.csv

Use this file, ensure you mention the same path correctly for the above saved CSV file during python coding.

Import pandas as pd

I’ve seen people using existing .txt/other format files to covert their format to .csv by just renaming. Which actually does nothing than changing the name of the file. It doesn’t become a CSV file at all.
Hope this helps. ??

Respondent: stella

Solution #8:

import pandas as pd
data=pd.read_csv(“C:UsersssDesktopfile or csv file name.csv”)
just place the csv file on the desktop

Respondent: Kiran Vaichalkar

The answers/resolutions are collected from stackoverflow, are licensed under cc by-sa 2.5 , cc by-sa 3.0 and cc by-sa 4.0 .

Most Popular

To Top
India and Pakistan’s steroid-soaked rhetoric over Kashmir will come back to haunt them both clenbuterol australia bossier man pleads guilty for leadership role in anabolic steriod distribution conspiracy