A data frame object for handling structured data in Python. You can easily read files and perform subsequent SQL operations, and it is necessary for work such as machine learning to process, calculate, and visualize data. A memo list of commonly used syntaxes for data manipulation. This section is an overview of the data.
Import pandas with the name pd
python
import pandas as pd
Check the number of "dataflame".
python
print(len(dataflame))
python
dataflame.dtypes
python
dataflame.describe
Aggregate count (number of data), mean (mean), std (standard deviation), min (minimum), 25% etc. (quartile), max (maximum).
python
dataflame.describe(include='O')
It's o, not zero. Aggregate count (number of data), unique (number of unique data), top (value of the most frequently occurring element), freq (number of elements). If you want to display it together with the numerical value, use "describe (include ='all')".
Check if the value is taken correctly after reading the initial data and after joining.
python
dataflame.isnull().sum()
Recommended Posts