What is the fastest way to compute an RBF kernel in python?

stackoverflow.com › questions › 47271662 › what-is-the-fastest-way-to-compute-an-rbf-kernel-in-python

I am going to present four different methods for computing such a kernel, followed by a comparison of their run-time.

Using pure numpy

Here, I use the fact that ||x-y||^2 = ||x||^2 + ||y||^2 - 2 * x^T * y.

import numpy as np

X_norm = np.sum(X ** 2, axis = -1)
K = var * np.exp(-gamma * (X_norm[:,None] + X_norm[None,:] - 2 * np.dot(X, X.T)))

Using numexpr

numexpr is a python package that allows for efficient and parallelized array operations on numpy arrays. We can use it as follows to perform the same computation as above:

import numpy as np
import numexpr as ne

X_norm = np.sum(X ** 2, axis = -1)
K = ne.evaluate('v * exp(-g * (A + B - 2 * C))', {
        'A' : X_norm[:,None],
        'B' : X_norm[None,:],
        'C' : np.dot(X, X.T),
        'g' : gamma,
        'v' : var
})

Using `scipy.spatial.distance.pdist`

We could also use scipy.spatial.distance.pdist to compute a non-redundant array of pairwise squared euclidean distances, compute the kernel on that array and then transform it to a square matrix:

import numpy as np
from scipy.spatial.distance import pdist, squareform

K = squareform(var * np.exp(-gamma * pdist(X, 'sqeuclidean')))
K[np.arange(K.shape[0]), np.arange(K.shape[1])] = var

Using `sklearn.metrics.pairwise.rbf_kernel`

sklearn provides a built-in method for direct computation of an RBF kernel:

import numpy as np
from sklearn.metrics.pairwise import rbf_kernel

K = var * rbf_kernel(X, gamma = gamma)

Run-time comparison

I use 25,000 random samples of 512 dimensions for testing and perform experiments on an Intel Core i7-7700HQ (4 cores @ 2.8 GHz). More precisely:

X = np.random.randn(25000, 512)
gamma = 0.01
var = 5.0

Each method is run 7 times and the mean and standard deviation of the time per execution is reported.

|               Method                |       Time        |
|-------------------------------------|-------------------|
| numpy                               | 24.2 s ± 1.06 s   |
| numexpr                             | 8.89 s ± 314 ms   |
| scipy.spatial.distance.pdist        | 2min 59s ± 312 ms |
| sklearn.metrics.pairwise.rbf_kernel | 13.9 s ± 757 ms   |

First of all, scipy.spatial.distance.pdist is surprisingly slow.

numexpr is almost 3 times faster than the pure numpy method, but this speed-up factor will vary with the number of available CPUs.

sklearn.metrics.pairwise.rbf_kernel is not the fastest way, but only a bit slower than numexpr.

Answer from Callidior on Stack Overflow

Medium

ujangriswanto08.medium.com › step-by-step-implementation-of-the-rbf-kernel-in-python-or-r-a498b3acf9d6

Step-by-Step Implementation of the RBF Kernel in Python (or R) | by Ujang Riswanto | Medium

July 11, 2025 - In this guide, we’ll break down the RBF kernel step by step, implementing it from scratch in both Python and R. We’ll also see how it performs when used with an SVM and discuss how to fine-tune it for the best results. Let’s dive in!

Stack Overflow

stackoverflow.com › questions › 47271662 › what-is-the-fastest-way-to-compute-an-rbf-kernel-in-python

numpy - What is the fastest way to compute an RBF kernel in python? - Stack Overflow

Using pure numpy

Here, I use the fact that ||x-y||^2 = ||x||^2 + ||y||^2 - 2 * x^T * y.

import numpy as np

X_norm = np.sum(X ** 2, axis = -1)
K = var * np.exp(-gamma * (X_norm[:,None] + X_norm[None,:] - 2 * np.dot(X, X.T)))

Using numexpr

numexpr is a python package that allows for efficient and parallelized array operations on numpy arrays. We can use it as follows to perform the same computation as above:

import numpy as np
import numexpr as ne

X_norm = np.sum(X ** 2, axis = -1)
K = ne.evaluate('v * exp(-g * (A + B - 2 * C))', {
        'A' : X_norm[:,None],
        'B' : X_norm[None,:],
        'C' : np.dot(X, X.T),
        'g' : gamma,
        'v' : var
})

Using `scipy.spatial.distance.pdist`

We could also use scipy.spatial.distance.pdist to compute a non-redundant array of pairwise squared euclidean distances, compute the kernel on that array and then transform it to a square matrix:

import numpy as np
from scipy.spatial.distance import pdist, squareform

K = squareform(var * np.exp(-gamma * pdist(X, 'sqeuclidean')))
K[np.arange(K.shape[0]), np.arange(K.shape[1])] = var

Using `sklearn.metrics.pairwise.rbf_kernel`

sklearn provides a built-in method for direct computation of an RBF kernel:

import numpy as np
from sklearn.metrics.pairwise import rbf_kernel

K = var * rbf_kernel(X, gamma = gamma)

Run-time comparison

I use 25,000 random samples of 512 dimensions for testing and perform experiments on an Intel Core i7-7700HQ (4 cores @ 2.8 GHz). More precisely:

X = np.random.randn(25000, 512)
gamma = 0.01
var = 5.0

Each method is run 7 times and the mean and standard deviation of the time per execution is reported.

|               Method                |       Time        |
|-------------------------------------|-------------------|
| numpy                               | 24.2 s ± 1.06 s   |
| numexpr                             | 8.89 s ± 314 ms   |
| scipy.spatial.distance.pdist        | 2min 59s ± 312 ms |
| sklearn.metrics.pairwise.rbf_kernel | 13.9 s ± 757 ms   |

First of all, scipy.spatial.distance.pdist is surprisingly slow.

numexpr is almost 3 times faster than the pure numpy method, but this speed-up factor will vary with the number of available CPUs.

sklearn.metrics.pairwise.rbf_kernel is not the fastest way, but only a bit slower than numexpr.

2 of 4

Well you are doing a lot of optimizations in your answer post. I would like to add few more (mostly tweaks). I would build upon the winner from the answer post, which seems to be numexpr based on.

Tweak #1

First off, np.sum(X ** 2, axis = -1) could be optimized with np.einsum. Though this part isn't the biggest overhead, but optimization of any sort won't hurt. So, that summation could be expressed as -

X_norm = np.einsum('ij,ij->i',X,X)

Tweak #2

Secondly, we could leverage Scipy supported blas functions and if allowed use single-precision dtype for noticeable performance improvement over its double precision one. Hence, np.dot(X, X.T) could be computed with SciPy's sgemm like so -

sgemm(alpha=1.0, a=X, b=X, trans_b=True)

Few more tweaks on rearranging the negative sign with gamma lets us feed more to sgemm. Also, we would push in gamma into the alpha term.

Tweaked implementations

Thus, with these two optimizations, we would have two more variants (if I could put it that way) of the numexpr method, listed below -

from scipy.linalg.blas import sgemm

def app1(X, gamma, var):
    X_norm = -np.einsum('ij,ij->i',X,X)
    return ne.evaluate('v * exp(g * (A + B + 2 * C))', {\
        'A' : X_norm[:,None],\
        'B' : X_norm[None,:],\
        'C' : np.dot(X, X.T),\
        'g' : gamma,\
        'v' : var\
    })

def app2(X, gamma, var):
    X_norm = -gamma*np.einsum('ij,ij->i',X,X)
    return ne.evaluate('v * exp(A + B + C)', {\
        'A' : X_norm[:,None],\
        'B' : X_norm[None,:],\
        'C' : sgemm(alpha=2.0*gamma, a=X, b=X, trans_b=True),\
        'g' : gamma,\
        'v' : var\
    })

Runtime test

Numexpr based one from your answer post -

def app0(X, gamma, var):
    X_norm = np.sum(X ** 2, axis = -1)
    return ne.evaluate('v * exp(-g * (A + B - 2 * C))', {
            'A' : X_norm[:,None],
            'B' : X_norm[None,:],
            'C' : np.dot(X, X.T),
            'g' : gamma,
            'v' : var
    })

Timings and verification -

In [165]: # Setup
     ...: X = np.random.randn(10000, 512)
     ...: gamma = 0.01
     ...: var = 5.0

In [166]: %timeit app0(X, gamma, var)
     ...: %timeit app1(X, gamma, var)
     ...: %timeit app2(X, gamma, var)
1 loop, best of 3: 1.25 s per loop
1 loop, best of 3: 1.24 s per loop
1 loop, best of 3: 973 ms per loop

In [167]: np.allclose(app0(X, gamma, var), app1(X, gamma, var))
Out[167]: True

In [168]: np.allclose(app0(X, gamma, var), app2(X, gamma, var))
Out[168]: True

Videos

15:52

YouTube

Support Vector Machines Part 3: The Radial (RBF) Kernel (Part 3 ...

November 4, 2019

297K

youtube.com

Support Vector Machine SVM with rbf kernel Python Code / Machine ...

February 9, 2023

youtube.com

Heart Classification SVM with RBF kernel Scratch Python Code / ...

July 7, 2024

16:21

YouTube

Radial Basis Interpolation From Scratch Using Python - YouTube

April 11, 2021

07:57

YouTube

Radial Basis Function Kernel : Data Science Concepts - YouTube

SVM Kernal- Polynomial And RBF Implementation Using Sklearn- Machine ...

March 10, 2021

View all

Stack Exchange

stats.stackexchange.com › questions › 239008 › rbf-kernel-algorithm-python

RBF kernel algorithm Python - Cross Validated

Top answer

1 of 1

Say that mat1 is $\text{[math]}$ and mat2 is $\text{[math]}$ .

Recall that the Gaussian RBF kernel is defined as $\text{[math]}$ . But we can write $\text{[math]}$ as $\text{[math]}$ . The code uses this decomposition.

First, the trnorms1 vector stores $\text{[math]}$ for each input $\text{[math]}$ in mat1, and trnorms2 stores $\text{[math]}$ for each $\text{[math]}$ in mat2.

Then, the k1 matrix is obtained by multiplying the $\text{[math]}$ matrix of $\text{[math]}$ entries by a $\text{[math]}$ matrix of ones, getting an $\text{[math]}$ matrix with $\text{[math]}$ entries repeated across the rows, so that k1[i, j] is $\text{[math]}$ .

The next line does basically the same thing for the $\text{[math]}$ norms repeated across columns, getting an $\text{[math]}$ matrix with k2[i, j] of $\text{[math]}$ .

k is then their sum, so that k[i, j] is $\text{[math]}$ . The next line then subtracts twice the product of the data matrices, so that k[i, j] becomes $\text{[math]}$ .

Then, the code multiplies by $\text{[math]}$ and finally takes the elementwise $\text{[math]}$ , getting out the Gaussian kernel.

If you dig into the scikit-learn implementation, it's exactly the same, except:

It's parameterized instead with $\text{[math]}$ .
It's written in much better Python, not wasting memory all over the place and doing computations in a needlessly slow way.
It's broken up into helper functions.

But, algorithmically, it's doing the same basic operations.

scikit-learn

scikit-learn.org › stable › modules › generated › sklearn.gaussian_process.kernels.RBF.html

RBF — scikit-learn 1.8.0 documentation

The gradient of the kernel k(X, X) with respect to the log of the hyperparameter of the kernel.

GitHub

github.com › xbeat › Machine-Learning › blob › main › The Mathematics of RBF Kernel in Python.md

Machine-Learning/The Mathematics of RBF Kernel in Python.md at main · xbeat/Machine-Learning

Low gamma values mean far away points have a high influence, while high gamma values mean only nearby points have a high influence. import numpy as np import matplotlib.pyplot as plt def rbf_kernel(x, y, gamma): return np.exp(-gamma * ...

Author xbeat

scikit-learn

scikit-learn.org › stable › modules › generated › sklearn.metrics.pairwise.rbf_kernel.html

rbf_kernel — scikit-learn 1.8.0 documentation

>>> from sklearn.metrics.pairwise import rbf_kernel >>> X = [[0, 0, 0], [1, 1, 1]] >>> Y = [[1, 0, 0], [1, 1, 0]] >>> rbf_kernel(X, Y) array([[0.71, 0.51], [0.51, 0.71]])

scikit-learn

scikit-learn.org › stable › auto_examples › svm › plot_rbf_parameters.html

RBF SVM parameters — scikit-learn 1.8.0 documentation

Download Jupyter notebook: plot_rbf_parameters.ipynb · Download Python source code: plot_rbf_parameters.py

GitHub

github.com › topics › rbf-kernel

rbf-kernel · GitHub Topics · GitHub

Open Source Code for Data-Driven Dimensional Analysis. data-driven physics rbf-kernel dimensional-analysis voronoi mechanics realworld fuzzy-clustering noisy-data ... deep-neural-networks gpu rbf-kernel vgg image-classification object-detection nas bert-model training-free-nas large-language-model ... Image Processing and classification using Machine Learning : Image Classification using Open CV and SVM machine learning model · python opencv machine-learning svm rbf-kernel scikit-learn pandas classification indian one-to-one rbf hu-moments one-vs-rest dances

Find elsewhere

Google Bing Mojeek

DZone

dzone.com › data engineering › ai/ml › svm rbf kernel parameters with code examples

SVM RBF Kernel Parameters With Code Examples

July 28, 2020 - In this post, you will learn about SVM RBF (Radial Basis Function) kernel hyperparameters with the python code example.

Bogotobogo

bogotobogo.com › python › scikit-learn › scikit_machine_learning_Linearly_Separable_NonLinearly_RBF_Separable_Data_SVM_GUI.php

scikit-learn : Supervised Learning - Radial Basis Function kernel, RBF - 2020

"In machine learning, the (Gaussian) radial basis function kernel, or RBF kernel, is a popular kernel function used in support vector machine classification." - Radial basis function kernel ... Let's see how a nonlinear classification problem looks like using a sample dataset created by XOR logical operation (outputs true only when inputs differ - one is true, the other is false). In the code below, we create XOR gate dataset (500 samples with either a class label of 1 or -1) using NumPy's logical_xor function:

scikit-learn

scikit-learn.org › 1.5 › modules › generated › sklearn.metrics.pairwise.rbf_kernel.html

rbf_kernel — scikit-learn 1.5.2 documentation

>>> from sklearn.metrics.pairwise import rbf_kernel >>> X = [[0, 0, 0], [1, 1, 1]] >>> Y = [[1, 0, 0], [1, 1, 0]] >>> rbf_kernel(X, Y) array([[0.71..., 0.51...], [0.51..., 0.71...]])

VitalFlux

vitalflux.com › home › data science › svm rbf kernel parameters: python examples

SVM RBF Kernel Parameters: Python Examples - Analytics Yogi

April 15, 2023 - It can thus be understood that the selection of appropriate values of Gamma is important. Here is the code which is used. svm = SVC(kernel='rbf', random_state=1, gamma=0.008, C=0.1) svm.fit(X_train_std, y_train)

scikit-learn

scikit-learn.org › 1.5 › modules › generated › sklearn.gaussian_process.kernels.RBF.html

scikit-learn: RBF Kernel

Determines whether the gradient with respect to the log of the kernel hyperparameter is computed.

Kaggle

kaggle.com › code › manmohan291 › 16-sklearn-svm-rbf-kernel

16 SKLearn - SVM RBF Kernel

Checking your browser before accessing www.kaggle.com · Click here if you are not automatically redirected after 5 seconds

Towards Data Science

towardsdatascience.com › home › latest › svm classifier and rbf kernel – how to make better models in python

SVM Classifier and RBF Kernel - How to Make Better Models in Python | Towards Data Science

January 23, 2025 - SVM with RBF kernel and high gamma. See how it was created in the Python section at the end of this story. Image by author. It is essential to understand how different Machine Learning algorithms work to succeed in your Data Science projects. I have written this story as part of the series that dives into each ML algorithm explaining its mechanics, supplemented by Python code examples and intuitive visualizations.

Read the Docs

gpy.readthedocs.io › en › deploy › _modules › GPy › kern › src › rbf.html

Source code for GPy.kern.src.rbf

[docs] def parameters_changed(self): if self.use_invLengthscale: self.lengthscale[:] = 1./np.sqrt(self.inv_l+1e-200) super(RBF,self).parameters_changed() [docs] def get_one_dimensional_kernel(self, dim): """ Specially intended for Grid regression.

Quark Machine Learning

quarkml.com › home › data science › machine learning

The RBF kernel in SVM: A Complete Guide - Quark Machine Learning

April 6, 2025 - In this article, we’ll discuss what exactly makes this kernel so powerful, look at its working, and study examples of it in action. We’ll also provide code samples for implementing the RBF kernel from scratch in Python that illustrates how to use the RBF kernel on your own data sets.

Stack Overflow

stackoverflow.com › questions › 62242579 › implementing-svm-rbf

python - Implementing SVM RBF - Stack Overflow

Using pure numpy

Using numexpr

Using scipy.spatial.distance.pdist

Using sklearn.metrics.pairwise.rbf_kernel

Run-time comparison

Using pure numpy

Using numexpr

Using scipy.spatial.distance.pdist

Using sklearn.metrics.pairwise.rbf_kernel

Run-time comparison

Tweak #1

Tweak #2

Tweaked implementations

Runtime test

Videos

Using `scipy.spatial.distance.pdist`

Using `sklearn.metrics.pairwise.rbf_kernel`

Using `scipy.spatial.distance.pdist`

Using `sklearn.metrics.pairwise.rbf_kernel`