search
Search
Login
Unlock 100+ guides
menu
menu
web
search toc
close
Comments
Log in or sign up
Cancel
Post
account_circle
Profile
exit_to_app
Sign out
What does this mean?
Why is this true?
Give me some examples!
search
keyboard_voice
close
Searching Tips
Search for a recipe:
"Creating a table in MySQL"
Search for an API documentation: "@append"
Search for code: "!dataframe"
Apply a tag filter: "#python"
Useful Shortcuts
/ to open search panel
Esc to close search panel
to navigate between search results
d to clear all current filters
Enter to expand content preview
icon_star
Doc Search
icon_star
Code Search Beta
SORRY NOTHING FOUND!
mic
Start speaking...
Voice search is only supported in Safari and Chrome.
Navigate to
chevron_leftCreating DataFrames Cookbook
Combining multiple Series into a DataFrameCombining multiple Series to form a DataFrameConverting a Series to a DataFrameConverting list of lists into DataFrameConverting list to DataFrameConverting percent string into a numeric for read_csvConverting scikit-learn dataset to Pandas DataFrameConverting string data into a DataFrameCreating a DataFrame from a stringCreating a DataFrame using listsCreating a DataFrame with different type for each columnCreating a DataFrame with empty valuesCreating a DataFrame with missing valuesCreating a DataFrame with random numbersCreating a DataFrame with zerosCreating a MultiIndex DataFrameCreating a Pandas DataFrameCreating a single DataFrame from multiple filesCreating empty DataFrame with only column labelsFilling missing values when using read_csvImporting DatasetImporting tables from PostgreSQL as Pandas DataFramesInitialising a DataFrame using a constantInitialising a DataFrame using a dictionaryInitialising a DataFrame using a list of dictionariesInserting lists into a DataFrame cellKeeping leading zeroes when using read_csvParsing dates when using read_csvPreventing strings from getting parsed as NaN for read_csvReading data from GitHubReading file without headerReading large CSV files in chunksReading n random lines using read_csvReading space-delimited filesReading specific columns from fileReading tab-delimited filesReading the first few lines of a file to create DataFrameReading the last n lines of a fileReading URL using read_csvReading zipped csv file as a DataFrameRemoving Unnamed:0 columnResolving ParserError: Error tokenizing dataSaving DataFrame as zipped csvSkipping rows without skipping header for read_csvSpecifying data type for read_csvTreating missing values as empty strings rather than NaN for read_csv
check_circle
Mark as learned
thumb_up
0
thumb_down
0
chat_bubble_outline
0
Comment
auto_stories Bi-column layout
settings

Importing Dataset in Pandas

schedule Aug 10, 2023
Last updated
local_offer
PandasPython
Tags
mode_heat
Master the mathematics behind data science with 100+ top-tier guides
Start your free 7-days trial now!

To import and read a CSV file as a DataFrame, use Pandas read_csv(~) method.

NOTE

read_csv(~) is a misnomer. Despite having csv in its name, the method can be used to import datasets that use delimiters (separators) other than comma.

Datasets with Column Labels and Indices

Suppose we wanted to import the following file called my_file.csv:

a,b,c
A,1,2,3
B,4,5,6

Note the following:

  • the column labels are a, b and c.

  • the row labels are A and B.

To read this file as a DataFrame:

import pandas as pd

df = pd.read_csv("my_file.csv")
print(df)
a b c
A 1 2 3
B 4 5 6

Note that this code assumes the file to be located in the same directory.

Datasets with Only Values

Suppose our CSV dataset only consisted of values:

1,2,3
4,5,6

To import this file, we need to add the header=None option:

df = pd.read_csv("my_file.csv", header=None)
print(df)
0 1 2
0 1 2 3
1 4 5 6

As we can see, Pandas uses the default integer indices for its column and row labels.

Datasets with Custom Delimiters

The read_csv() method can also be used to import datasets that use delimiters other than comma. We can specify the delimiter by supplying the delimiters parameter.

As an example, suppose our dataset uses a single blank space as the delimiter:

a b c
A 1 2 3
B 4 5 6

To read this file:

df = pd.read_csv("my_file.csv", delimiter=" ")
print(df)
a b c
A 1 2 3
B 4 5 6
robocat
Published by Isshin Inada
Edited by 0 others
Did you find this page useful?
thumb_up
thumb_down
Comment
Citation
Ask a question or leave a feedback...
thumb_up
0
thumb_down
0
chat_bubble_outline
0
settings
Enjoy our search
Hit / to insta-search docs and recipes!