What This is an article that summarizes what I noticed and researched when learning machine learning using Chainer. This time, I will study scikit-learn.
It is written based on my understanding, so it may be incorrect. I will correct any mistakes, please forgive me.
Content Pandas
Pandas is a package often used for data manipulation, and it handles data handled by machine learning methods such as reading data saved in a general data format such as CSV and extracting some data by specifying conditions. Useful for organizing.
Well, it's like a calculation tool. It is a tool-like standing position that makes calculations easier, such as an abacus or a calculator. By the way, it seems that the following operations can be performed.
-Reading and writing CSV files ・ Calculation of statistics ·Sorting ・ Data selection ・ Selection by specifying conditions -Removal of missing values / interpolation -Convert ndarray and data frame to each other ・ Graph drawing
Matplotlib A library where you can draw graphs. Lines, scatter plots, box plots, etc. There is nothing special to mention.
In addition, I will add it when there is a point to suppress while writing the code
Recommended Posts