chevron_left
Creating DataFrames Cookbook
Combining multiple Series into a DataFrameCombining multiple Series to form a DataFrameConverting a Series to a DataFrameConverting list of lists into DataFrameConverting list to DataFrameConverting percent string into a numeric for read_csvConverting scikit-learn dataset to Pandas DataFrameConverting string data into a DataFrameCreating a DataFrame from a stringCreating a DataFrame using listsCreating a DataFrame with different type for each columnCreating a DataFrame with empty valuesCreating a DataFrame with missing valuesCreating a DataFrame with random numbersCreating a DataFrame with zerosCreating a MultiIndex DataFrameCreating a Pandas DataFrameCreating a single DataFrame from multiple filesCreating empty DataFrame with only column labelsFilling missing values when using read_csvImporting DatasetInitialising a DataFrame using a constantInitialising a DataFrame using a dictionaryInitialising a DataFrame using a list of dictionariesInserting lists into a DataFrame cellKeeping leading zeroes when using read_csvParsing dates when using read_csvPreventing strings from getting parsed as NaN for read_csvReading data from GitHubReading file without headerReading large CSV files in chunksReading n random lines using read_csvReading space-delimited filesReading specific columns from fileReading tab-delimited filesReading the first few lines of a file to create DataFrameReading the last n lines of a fileReading URL using read_csvReading zipped csv file as a DataFrameRemoving Unnamed:0 columnResolving ParserError: Error tokenizing dataSaving DataFrame as zipped csvSkipping rows without skipping header for read_csvSpecifying data type for read_csvTreating missing values as empty strings rather than NaN for read_csv
0
0
0
new
Creating a single DataFrame from multiple files in Pandas
Programming
chevron_rightPython
chevron_rightPandas
chevron_rightCookbooks
chevron_rightDataFrame Cookbooks
chevron_rightCreating DataFrames Cookbook
schedule Mar 9, 2022
Last updated Python●Pandas
Tags tocTable of Contents
expand_more When files contain columns
Consider the following my_data_one.txt
file:
A,B3,45,6
And my_data_two.txt
file:
C,D10,1112,13
To read and combine the two files into a single DataFrame:
df_one = pd.read_csv("my_data_one.txt")df_two = pd.read_csv("my_data_two.txt")pd.concat([df_one, df_two], axis=1)
A B C D0 3 4 10 111 5 6 12 13
Here, axis=1
indicates that we want to concatenate the DataFrames horizontally, as opposed to vertically.
When files contain rows
Consider the following my_data_one.txt
file:
A,B3,45,6
And my_data_two.txt
:
A,B10,1112,13
To construct a single DataFrame from the two files:
df_one = pd.read_csv("my_data_one.txt")df_two = pd.read_csv("my_data_two.txt")df = pd.concat([df_one, df_two], axis=0, ignore_index=True)df
A B0 3 41 5 62 10 113 12 13
Note the following:
axis=0
forconcat(~)
means that we want to stack the DataFrames vertically, as opposed to horizontally.ignore_index=True
forconcat(~)
resets the index of the resulting DataFrame to the default integer indices ([0,1,2,3]
). Without this parameter, we would end up with duplicate index values ([0,1,0,1]
),
Related
Pandas | read_csv method
Reads a file, and parses its content into a DataFrame.
Pandas | concat method
Concatenates a list of Series or DataFrame, either horizontally or vertically.
Published by Isshin Inada
Edited by 0 others
Did you find this page useful?
Ask a question or leave a feedback...