Sort by pandas

Introduction

The contents related to sorting of DataFrame were only simple in Japanese, so I summarized them. I will touch on places where there is not much demand.

Confirmed to work with pandas 0.17.1.

This is the data used this time.

sort.py



import numpy as np
import pandas as pd

if __name__ == "__main__":

	df = pd.DataFrame([[1, 3, "Hokkaido"], [4, 5, "Tokyo"], [3, 5, "Saitama"], [6, 9, "Osaka"], [1, 1, "Aomori Prefecture"]])
	df.index = ["Suzuki", "Tanaka", "Kimura", "Endo", "Yoshida"]
	df.columns = ["Item 1", "Item 2", "Item 3"]

Item 1 Item 2 Item 3
Suzuki 1 3 Hokkaido
Tanaka 4 5 Tokyo
Kimura 3 5 Saitama
Endo 6 9 Osaka
Yoshida 1 1 Aomori Prefecture

Sort by number

If you try to arrange this table by item 1, df.sort_values (by = ["item 1 "], ascending = True) By

Item 1 Item 2 Item 3
Suzuki 1 3 Hokkaido
Yoshida 1 1 Aomori Prefecture
Kimura 3 5 Saitama
Tanaka 4 5 Tokyo
Endo 6 9 Osaka

They are arranged in ascending order of item 1 (in ascending order of value). If the value of item 1 is the same, it will be arranged depending on the order in the original table. At this time, if you change ʻascending = True to ʻascending = False, the items will be sorted in descending order (largest value).

If you want to depend on item 2 instead of the original order df.sort_values (by = ["item 1 "," item 2 "], ascending = True) By

Item 1 Item 2 Item 3
Yoshida 1 1 Aomori Prefecture
Suzuki 1 3 Hokkaido
Kimura 3 5 Saitama
Tanaka 4 5 Tokyo
Endo 6 9 Osaka

It will be. It's a little anomalous, but if you want to sort item 1 in ascending order and item 2 in descending order df.sort_values (by = ["item 1 "," item 2 "], ascending = [True, False]) You can sort by.

Sort by character

Next, sort item 3. As before df.sort_values (by = ["item 3 "], ascending = True) Then

Item 1 Item 2 Item 3
Suzuki 1 3 Hokkaido
Kimura 3 5 Saitama
Endo 6 9 Osaka
Tanaka 4 5 Tokyo
Yoshida 1 1 Aomori Prefecture

It will be sorted, but probably not what you want. Here is a list in the order you want to arrange, for example tdhk = ["Hokkaido", "Aomori", "Saitama", "Tokyo", "Osaka"] As df ["item 3"] = pd.Categorical (df ["item 3"], tdhk) When you insert, it will be sorted in the order of the list.

Item 1 Item 2 Item 3
Suzuki 1 3 Hokkaido
Yoshida 1 1 Aomori Prefecture
Kimura 3 5 Saitama
Tanaka 4 5 Tokyo
Endo 6 9 Osaka

Recommended Posts

Sort by pandas
[Python] Sort the table by sort_values (pandas DataFrame)
Sort by dict type value value
Pandas
Memorandum (pseudo Vlookup by pandas)
Standardize by group with pandas
Visualization memo by pandas, seaborn
Sort by file modification date
sort
Sort by date in python
[Python] Sort iterable by multiple conditions
Manipulating strings with pandas group by
Sort by specifying conditions in CASTable
Feature generation with pandas group by
Insertion sort
Pandas memo
Update Pandas DataFrame elements by column name
Pandas: groupby () to complete value by group
Pandas basics
Speed comparison when shifting by group by pandas
Pandas notes
Natural sort
Python # sort
Pandas of the beginner, by the beginner, for the beginner [Python]
Pandas basics
Bubble sort
Bubble sort
pandas memorandum
pandas memo
pandas SettingWithCopyWarning
When to_csv with Pandas, it became line by line
I tried to program bubble sort by language
Draw a graph by processing with Pandas groupby
Sort the elements of the array by specifying the conditions
100 language processing knock-99 (using pandas): visualization by t-SNE
[Python] How to sort instances by instance variables
Visualize how it is sorted by bubble sort
Python> Sort by number and sort by alphabet> Use sorted ()