pandas ratio between rows

stackoverflow.com › questions › 54198815 › pandas-ratio-between-rows

If I understand you correctly, you are describing the following:

Setup

d = {'id1': ['x', 'x'], 'id2': ['z','w'], 'metric': [100,10] }
df = pd.DataFrame(data=d)
df

Solution

# Manually choose the value by which to scale the column 'metric'
scaler = df.loc[(df['id1'] == 'x') & (df['id2'] == 'z'), 'metric'].values

# Divide all 'metric' values by the above scaler value
df['result'] = df['metric'] / scaler

df
  id1 id2  metric  result
0   x   z     100     1.0
1   x   w      10     0.1

Answer from Peter Leimbigler on Stack Overflow

Stack Overflow

stackoverflow.com › questions › 54198815 › pandas-ratio-between-rows

python - Pandas - Ratio between rows - Stack Overflow

Top answer

1 of 1

If I understand you correctly, you are describing the following:

Setup

d = {'id1': ['x', 'x'], 'id2': ['z','w'], 'metric': [100,10] }
df = pd.DataFrame(data=d)
df

Solution

# Manually choose the value by which to scale the column 'metric'
scaler = df.loc[(df['id1'] == 'x') & (df['id2'] == 'z'), 'metric'].values

# Divide all 'metric' values by the above scaler value
df['result'] = df['metric'] / scaler

df
  id1 id2  metric  result
0   x   z     100     1.0
1   x   w      10     0.1

GeeksforGeeks

geeksforgeeks.org › divide-a-dataframe-in-a-ratio

Divide a DataFrame in a ratio | GeeksforGeeks

August 17, 2020 - Python is a great language for doing data analysis, primarily because of the fantastic ecosystem of data-centric python packages. Pandas is one of those packages and makes importing and analyzing data much easier.Pandas dataframe.floordiv() function is used for integer division of the dataframe with

Discussions

Pandas - Creating new column with proportion/ratio from criteria/other cols

Similar to the other comment but I'd suggest just using a groupby apply to get a summary df.groupby('Surgeon').apply(lambda x: sum((x['Preop'] == 1) &(df['Conversion'] ==1))/sum((x['Preop'] == 1))).reset_index(name='Conversion Rate') More on reddit.com

r/learnpython

January 26, 2024

python - Calculating the ratio of two columns - Stack Overflow

Dataframe with four columns: x1 x2 x3 x4 Desired output: x1/x2 x1/x2 x1/x3 x2/x3 x2/x4 x3/x4 I want to create new columns which are ratios of the original columns. Only way I could th... More on stackoverflow.com

stackoverflow.com

python - How to calculate ratio of values in a pandas dataframe column? - Stack Overflow

I'm new to pandas and decided to learn it by playing around with some data I pulled from my favorite game's API. I have a dataframe with two columns "playerId" and "winner" lik... More on stackoverflow.com

stackoverflow.com

pandas - Adding new column in dataframe by taking ratio of two existing columns - Stack Overflow

I want to create a new column in dataframe by taking a ratio of two existing columns. Following code works but it does not retain the column df[price_per_sqft]. df['price_per_sqft'] = (df['SalePr... More on stackoverflow.com

stackoverflow.com

w3resource

w3resource.com › python-exercises › pandas › python-pandas-data-frame-exercise-38.php

Pandas: Divide a DataFrame in a given ratio - w3resource

part_30 = df.drop(part_70.index): ... the rows from df. ... Write a Pandas program to split a DataFrame into two parts in a 70:30 ratio and then output the row counts for each part....

Quora

quora.com › In-DataFrame-how-should-I-compute-between-rows

In DataFrame, how should I compute between rows? - Quora

Answer (1 of 4): Use shift(). i.e. [code]df['Cl'] - df['Cl'].shift(1) [/code]pandas.Series.shift - pandas 0.22.0 documentation

reddit.com › r/learnpython › pandas - creating new column with proportion/ratio from criteria/other cols

r/learnpython on Reddit: Pandas - Creating new column with proportion/ratio from criteria/other cols

January 26, 2024 -

Hi guys,

Fairly new to python - i'll try to be clear as I can

I have a dummy DF with some columns of appointments in healthcare - Surgeon, Conversion (1/0 or Yes/No)

I want to create a new column with "Conversion Rate" i.e. what % of the Surgeon's appointments resulted in a Conversion (1/True)

In plain english that looks like - if surgeon matches the one highlighted, return sum (Conversion) / count (Surgeon) but i'm struggling to get that delivered + applied via a function.

Here's my rough stab below but any thoughts/advice appreciated!

I am using pandas to do this and I do have other columns so I need to create a new column only with this rather than any over-arching DF operations. So if the surgeons name appeared 17 times I want the conversion rate column to show the same rate for that surgeon each time.

def convrate(surgeon):

for surgeon in df['Surgeon']:

ConvPts = 0

Non-Conv== 0

if df['Preop'] == 1:

if df['Conversion'] == 1:

ConvPts += 1

else:

Non-Conv += 1

return ConvPts / (ConvPts + Non-Conv)

Top answer

1 of 2

2 of 2

Why are you adding a column? Wouldn't you just create a summary report of each surgeon and their conversion rate? use the pivottable function to create a pivot with the left column being the surgeons, the next being appointments, next being conversions and last being a calculated column that's conversions / appointments. That's a lot easier.

Stack Overflow

stackoverflow.com › questions › 57920366 › calculating-the-ratio-of-two-columns

python - Calculating the ratio of two columns - Stack Overflow

Top answer

1 of 3

Create all pairs combinations of columns names, loop and divide to new columns:

from  itertools import combinations

for a, b in combinations(df.columns, 2):
    df[f'{a}/{b}'] = df[a].div(df[b])

Or use list comprehension, join together by concat and add original columns by join:

df = df.join(pd.concat([df[a].div(df[b]).rename(f'{a}/{b}') 
                         for a, b in combinations(df.columns, 2)], 1))

print (df)
   x1  x2  x3  x4     x1/x2     x1/x3     x1/x4     x2/x3     x2/x4     x3/x4
0   4   7   1   5  0.571429  4.000000  0.800000  7.000000  1.400000  0.200000
1   5   8   3   3  0.625000  1.666667  1.666667  2.666667  2.666667  1.000000
2   4   9   5   6  0.444444  0.800000  0.666667  1.800000  1.500000  0.833333
3   5   4   7   9  1.250000  0.714286  0.555556  0.571429  0.444444  0.777778
4   5   2   1   2  2.500000  5.000000  2.500000  2.000000  1.000000  0.500000
5   4   3   0   4  1.333333       inf  1.000000       inf  0.750000  0.000000

2 of 3

you can try :

df = pd.DataFrame({'x1':[1,2,3,4,5], 'x2': [10, 10, 10, 10, 10], 'x3' : [100, 100, 100 ,100, 100], 'x4': [10, 10, 10, 10, 10]})

columns = df.columns

def pattern(c = columns):
    yield from ((v1, v2) for i, v1 in enumerate(c) for v2 in c[i + 1:])

for name1, name2 in pattern():
    df[f'{name1}/{name2}'] = df[name1].div(df[name2])

output:

also, you can concatenate all your desired columns:

pd.concat([df[n1].div(df[n2]).rename(f'{n1}/{n2}') for n1, n2 in pattern()], 1)

output:

Stack Overflow

stackoverflow.com › questions › 68796549 › how-to-calculate-ratio-of-values-in-a-pandas-dataframe-column

python - How to calculate ratio of values in a pandas dataframe column? - Stack Overflow

Top answer

1 of 4

We can take advantage of the way that Boolean values are handled mathematically (True being 1 and False being 0) and use 3 aggregation functions sum, count and mean per group (groupby aggregate). We can also take advantage of Named Aggregation to both create and rename the columns in one step:

df = (
    df.groupby('playerId', as_index=False)
        .agg(wins=('winner', 'sum'),
             totalCount=('winner', 'count'),
             winPct=('winner', 'mean'))
)
# Scale up winPct
df['winPct'] *= 100

df:

   playerId  wins  totalCount  winPct
0      1848     1           2    50.0
1      1988     0           2     0.0
2      3543     1           1   100.0

DataFrame and imports:

import pandas as pd

df = pd.DataFrame({
    'playerId': [1848, 1988, 3543, 1848, 1988],
    'winner': [True, False, True, False, False]
})

2 of 4

In your case just do mean can yield the pct

out = df.groupby('playerId')['winner'].agg(['sum','count','mean'])
Out[22]: 
          sum  count  mean
playerId                  
1848        1      2   0.5
1988        0      2   0.0
3543        1      1   1.0

Pandas

pandas.pydata.org › docs › reference › api › pandas.DataFrame.pct_change.html

pandas.DataFrame.pct_change — pandas 3.0.1 documentation

Computes the fractional change from the immediately previous row by default. This is useful in comparing the fraction of change in a time series of elements. ... Despite the name of this method, it calculates fractional change (also known as per unit change or relative change) and not percentage ...

Find elsewhere

Google Bing Mojeek

TutorialsPoint

tutorialspoint.com › divide-a-dataframe-in-a-ratio

Divide a DataFrame in a ratio

November 2, 2023 - Following example divides the dataframe in a ratio of 70% and 30% using the numpy.split() function. import pandas as pd import numpy as np dic = {"Letters":['A','B','C','D','E','F','G','H'], "Number":[1,2,3,4,5,6,7,8]} data = pd.DataFrame(dic) print("The Original data:") print(data) print("The 70% of the original data") train_data, test_data= np.split(data,[int(0.7*len(data))]) print(train_data) print("Another 30% of the data") print(test_data)

Stack Overflow

stackoverflow.com › questions › 52703857 › adding-new-column-in-dataframe-by-taking-ratio-of-two-existing-columns › 52703931

pandas - Adding new column in dataframe by taking ratio of two existing columns - Stack Overflow

Top answer

1 of 1

This should do:

df['price_per_sqft'] = df['SalePrice']/df['LotArea']

or you can use pd.assign

df.assign(price_per_sqft = df.SalePrice/df.LotArea)

Stack Overflow

stackoverflow.com › questions › 42322809 › getting-a-ratio-in-pandas-groupby-object

python - Getting a ratio in Pandas groupby object - Stack Overflow

Top answer

1 of 3

How about:

user_count=df3.groupby('user_state')['user_count'].mean()
#(or however you think a value for each state should be calculated)

engaged_unique=df3.groupby('user_state')['engaged_count'].nunique()

engaged_pct=engaged_unique/user_count

(you could also do this in one line in a bunch of different ways)

Your original solution was almost fine except that you were dividing a value by the entire user count series. So you were getting a Series instead of a value. You could try this slight variation:

def f(x):
    engaged_percent = x['engaged_count'].nunique()/x['user_count'].mean()
    return engaged_percent

by = df3.groupby(['user_state']).apply(f)
by

2 of 3

I would just use groupby and apply directly

df3['engaged_percent'] = df3.groupby('user_state')
                            .apply(lambda s: s.engaged_count.nunique()/s.user_count).values

Demo

>>> df3
    engaged_count  user_count  user_state
0               3          21  California
1               3          21  California
2               3          21  California
...
19              4           7     Florida
20              4           7     Florida
21              4           7     Florida

>>> df3['engaged_percent'] = df3.groupby('user_state').apply(lambda s: s.engaged_count.nunique()/s.user_count).values

>>> df3
    engaged_count  user_count  user_state  engaged_percent
0               3          21  California         0.095238
1               3          21  California         0.095238
2               3          21  California         0.095238
...
19              4           7     Florida         0.285714
20              4           7     Florida         0.285714
21              4           7     Florida         0.285714

Stack Overflow

stackoverflow.com › questions › 54752750 › calculate-ratio-with-pandas › 54752998

python - calculate ratio with pandas - Stack Overflow

Use groupby and just calculate the ratio of sum over size using transform to broadcast the results to original size.

Sololearn

sololearn.com › en › Discuss › 2748074 › i-cannot-figure-out-what-do-the-ask-me-to-output

I cannot figure out what do the ask me to output | Sololearn: Learn to code for FREE!

Then output the dataframe row where the ratio is equal to the max() for the dataframe. It only takes a couple lines of code. df['newColumnName'] = ratio_formula print(df[df['newColumnName'] == df['newColumnName'].max()]) ... This is my code: ###### This is the given code ####### import pandas as pd df = pd.read_csv("/usercode/files/ca-covid.csv") df.drop('state', axis=1, inplace=True) df.set_index('date', inplace=True) ############################# # Looking for max value serie = df.describe() max = serie.loc['max'] ratio = {'cases':max['cases'], 'deaths':max['deaths']} df['ratio'] = ratio['cases'] - ratio['deaths'] dt = df.loc['26.12.20'] df_dict = { 'cases':dt['cases'], 'deaths':dt['deaths'], 'ratio':dt['ratio']} dt = pd.DataFrame(df_dict, index = ['26.12.20']) print(dt)

Stack Overflow

stackoverflow.com › questions › 41131462 › pandas-dataframe-ratio-of-difference-of-consecutive-columns-to-first-value

python - Pandas dataframe ratio of difference of consecutive columns to first value - Stack Overflow

Top answer

1 of 2

After performing your groupby, use pct_change:

# Sort the DataFrame, if necessary.
df = df.sort_values(['name', 'order'])

# Use groupby and pcnt_change on the 'quantity' column.
df['quantity'] = df.groupby('name')['quantity'].pct_change()

The resulting output:

  name  order  quantity
0    A      1       NaN
1    A      2  0.500000
2    A      3 -0.666667
3    B      1       NaN
4    B      2  2.000000

2 of 2

You could take your result and divide it by the shifted 'quantity' column in df:

diff_df.quantity = diff_df.quantity / df.quantity.shift(1)

Stack Overflow

stackoverflow.com › questions › 72159338 › pandas-calculating-ratio-between-values-of-dataset-for-some-subset

python - Pandas: calculating ratio between values of dataset for some subset - Stack Overflow

May 8, 2022 - I have a dataset which looks like value 34 45 3 -3 I want to calculate ratio of values for this dataset, i.e. ratio of value to next value: 34/45 , 45/3, 3/-3 I can do it via myDataset["value&...

CopyProgramming

copyprogramming.com › howto › pandas-groupby-and-aggregate-two-columns-for-respective-totals-then-calculate-ratio

Python DataFrame GroupBy Two Columns and Count: A Complete Guide for 2026

December 7, 2025 - This guide covers the complete methodology for grouping by two columns, counting combinations, aggregating multiple columns for distinct totals, and calculating ratios—all while implementing the latest pandas 3.0 best practices and performance optimizations for 2026. Grouping by two columns and counting the occurrences of each unique combination is straightforward using the groupby() and size() methods. The size() method counts all rows in each group, including those with null values, making it ideal for counting group combinations.

Stack Overflow

stackoverflow.com › questions › 63427536 › pandas-taking-ratio-of-difference-from-the-row-above-and-store-the-value-in-anot

python - Pandas taking ratio of difference from the row above and store the value in another column, with multi-index - Stack Overflow

August 15, 2020 - #df # A B C # total diff total diff total diff #0 100 0 200 0 20 0 #df2 # A B C # total diff total diff total diff #0 200.0 NaN 50.0 NaN 30.0 NaN # Take last row of current DataFrame i.e. `df` curr = df.iloc[-1].xs('total', level=1) #Get total values # Take total values of new DataFrame you get everyday i.e.

Saturn Cloud

saturncloud.io › blog › how-to-calculate-ratio-of-values-in-a-pandas-dataframe-column-a-comprehensive-guide

How to Calculate Ratio of Values in a Pandas DataFrame Column: A Comprehensive Guide | Saturn Cloud Blog

August 25, 2023 - To calculate the ratio of values in a column, we’ll use the pct_change() function, which computes the percentage change between the current and a prior element. This function by default calculates the percentage change from the immediately ...

Stack Overflow

stackoverflow.com › questions › 70362174 › get-ratio-of-column-values-for-different-rows

pandas - Get ratio of column values for different rows - Stack Overflow

Top answer

1 of 1

Idea is create MultiIndex and then add one year for datetimes in rename, divide columns and add suffix by DataFrame.add_suffix, last add to original by DataFrame.join:

df1 = df.set_index(['Date','Route'])

df2 = df1.rename(lambda x: x + pd.offsets.DateOffset(years=1), level=0)

df3 = df1[['Col1','Col2']].div(df2[['Col1','Col2']]).add_suffix('_ratio')

df = df.join(df3, on=['Date','Route'])
print (df)
         Date Route  Col1  Col2  Col1_ratio  Col2_ratio
0  2020-01-01     A    99    26         NaN         NaN
1  2020-01-02     A    37    96         NaN         NaN
2  2020-01-03     A    20    83         NaN         NaN
3  2021-01-01     A    50    79    0.505051    3.038462
4  2021-01-02     A    16    50    0.432432    0.520833
5  2021-01-03     A    52    33    2.600000    0.397590
6  2020-01-01     B    48    44         NaN         NaN
7  2020-01-02     B    96    30         NaN         NaN
8  2020-01-03     B    17    42         NaN         NaN
9  2021-01-01     B    34    74    0.708333    1.681818
10 2021-01-02     B    49    37    0.510417    1.233333
11 2021-01-03     B    70    12    4.117647    0.285714

Stack Overflow

stackoverflow.com › questions › 41040952 › pandas-average-of-the-ratio-of-column-differences-between-any-two-consecutive-ro

python - Pandas average of the ratio of column differences between any two consecutive rows in dataframe - Stack Overflow

Top answer

1 of 1

You can use custom function f with dt.days for convert Timedelta to daysand divide by div:

def f(x):
    d = x.date.diff().dt.days
    q = x.quantity.diff()
    return (q.div(d)).mean()

df1 = df.sort_values('date').groupby('name').apply(f).reset_index(name='ratio')
print (df1)
  name  ratio
0  'A'   -5.0
1  'B'    NaN