Brave Search

stackoverflow.com › questions › 23482668 › sorting-by-a-custom-list-in-pandas

The below answer is an old answer. It still works. Anyhow, another very elegant solution has been posted below , using the key argument.

I just discovered that with pandas 15.1 it is possible to use categorical series (https://pandas.pydata.org/docs/user_guide/categorical.html)

As for your example, lets define the same data-frame and sorter:

import pandas as pd

data = {
    'id': [2967, 5335, 13950, 6141, 6169],
    'Player': ['Cedric Hunter', 'Maurice Baker', 
               'Ratko Varda' ,'Ryan Bowen' ,'Adrian Caldwell'],
    'Year': [1991, 2004, 2001, 2009, 1997],
    'Age': [27, 25, 22, 34, 31],
    'Tm': ['CHH', 'VAN', 'TOT', 'OKC', 'DAL'],
    'G': [6, 7, 60, 52, 81]
}

# Create DataFrame
df = pd.DataFrame(data)

# Define the sorter
sorter = ['TOT', 'ATL', 'BOS', 'BRK', 'CHA', 'CHH', 'CHI', 'CLE', 'DAL', 'DEN',
          'DET', 'GSW', 'HOU', 'IND', 'LAC', 'LAL', 'MEM', 'MIA', 'MIL',
          'MIN', 'NJN', 'NOH', 'NOK', 'NOP', 'NYK', 'OKC', 'ORL', 'PHI',
          'PHO', 'POR', 'SAC', 'SAS', 'SEA', 'TOR', 'UTA', 'VAN', 'WAS', 'WSB']

With the data-frame and sorter, which is a category-order, we can do the following in pandas 15.1:

# Convert Tm-column to category and in set the sorter as categories hierarchy
# You could also do both lines in one just appending the cat.set_categories()
df.Tm = df.Tm.astype("category")
df.Tm = df.Tm.cat.set_categories(sorter)

print(df.Tm)
Out[48]: 
0    CHH
1    VAN
2    TOT
3    OKC
4    DAL
Name: Tm, dtype: category
Categories (38, object): [TOT < ATL < BOS < BRK ... UTA < VAN < WAS < WSB]

df.sort_values(["Tm"])  ## 'sort' changed to 'sort_values'
Out[49]: 
   Age   G           Player   Tm  Year     id
2   22  60      Ratko Varda  TOT  2001  13950
0   27   6    Cedric Hunter  CHH  1991   2967
4   31  81  Adrian Caldwell  DAL  1997   6169
3   34  52       Ryan Bowen  OKC  2009   6141
1   25   7    Maurice Baker  VAN  2004   5335

Answer from dmeu on Stack Overflow

Pandas

pandas.pydata.org › docs › reference › api › pandas.DataFrame.sort_values.html

pandas.DataFrame.sort_values — pandas 3.0.2 documentation

For DataFrames, this option is only applied when sorting on a single column or label. na_position{‘first’, ‘last’}, default ‘last’ · Puts NaNs at the beginning if first; last puts NaNs at the end. ... If True, the resulting axis will be labeled 0, 1, …, n - 1. ... Apply the key function to the values before sorting.

GeeksforGeeks

geeksforgeeks.org › pandas › python-pandas-dataframe-sort_values-set-1

Pandas Dataframe.sort_values() - GeeksforGeeks

November 29, 2024 - In Pandas, sort_values() function sorts a DataFrame by one or more columns in ascending or descending order.

Discussions

python - sorting by a custom list in pandas - Stack Overflow

Naturally, since you must construct your custom list with all possible values in your sort field, this is good mostly for categorical sorting and would not be suitable for continuous variables (unless the possible values are known up front) and columns with a very high cardinality. import pandas as ... More on stackoverflow.com

stackoverflow.com

python - Why does my Pandas DataFrame not display new order using `sort_values`? - Stack Overflow

I have a Pandas DataFrame of register transactions with shape like (500,4): Time datetime64[ns] Net Total float64 Tax float64 Total Due float64 · I'm working through my code in a Python3 Jupyter notebook. I can't get past sorting any column. Working through the different code examples for sort, I'm not seeing the output reorder when I inspect the df. So, I've reduced the problem to trying to order just one column: df.sort_values... More on stackoverflow.com

stackoverflow.com

Sort_values() doesn't work

I have calculated the “earn >50K countries”, and used “.sort_values(ascending=False)” to sort the highest value (Iran). However, it becomes Haiti, which is obviously not the highest value in the DataFrame. Can anyone please help me figure out why “sort_values()” doesn’t work in ... More on forum.freecodecamp.org

forum.freecodecamp.org

October 1, 2022

[Pandas] What does "sort_values.index" do in this code?

It might be that seeing the value of some parts of the dataframe and groupby object will be clearer that any description. I would get on an interactive repl or jupyter and examine the subexpressions used and see how the data is being rearranged in each step: d.head() d.groupby('class')['survived'] d.groupby('class')'survived'].mean() d.groupby('class')'survived'].mean().sort_values() x m A lot of pandas operations can best be learned by doing trial and error in an interactive session; try everything out with variations, make errors to find the limits and to get used to the terminology used in tracebacks, experiment to solidify your understanding. More on reddit.com

r/learnpython

July 26, 2022

Videos

09:39

YouTube

How to sort in Pandas - YouTube

October 21, 2022

1.34K