Brave Search

Is pytorch faster than numpy on a single CPU?

reddit.com › r › Python › comments › 1e1cfpt › is_pytorch_faster_than_numpy_on_a_single_cpu

If you really enforce a single thread, then it's all up to the BLAS library which does the heavy lifting. Both numpy and pytorch are compatible with the most common libs, so it's a matter of which lib they are linked against respectively. Numpy with MKL will most likely beat pytorch with OpenBLAS for most workloads. Answer from Frankelstner on reddit.com

reddit.com › r/python › is pytorch faster than numpy on a single cpu?

r/Python on Reddit: Is pytorch faster than numpy on a single CPU?

July 12, 2024 -

A while ago I had benchmarked pytorch against numpy for fairly basic matrix operations (broadcast, multiplication, inversion). I didn't run the benchmark for a variety of sizes though. It seemed that pytorch was markedly faster than numpy, possibly it was using more than one core (the hardware had a dozen of cores). Is that a general rule even if constraining pytorch to a single core?

Top answer

1 of 4

2 of 4

PyTorch and numpy use similar backends for the CPU. So if it’s way faster I would guess you’re using multiple threads. You can see the number of threads it’s using with torch.get_num_threads() and you can similarly force torch to use a certain number of threads with torch.set_num_threads()

reddit.com › r/learnmachinelearning › tensors vs numpy arrays

r/learnmachinelearning on Reddit: Tensors vs Numpy Arrays

April 21, 2023 -

As I'm trying to learn pytorch, I notice this heavy focus on this tensors as a datatype, but I'm not really clear what it's advantages are over numpy arrays. After all, numpy arrays can be 0,1, 2, and even 3 dimensional, so I'm just unclear on the advantage of tensors, and I wanted to ask you guys, "When and why do we use tensors instead of numpy arrays?"

Top answer

1 of 5

reach disarm sophisticated strong six coherent cooperative important reminiscent telephone This post was mass deleted and anonymized with Redact

2 of 5

I don't know pytorch's backend but Tensors are their implementation of then data type and numpy arrays is numpy's implementation of the data type. I imagine in pytorch's they convert numpy arrays to tensors anyway as I would think pytorch's internally typing would require tensors and the tensor implementation may have optimizations. You should use the built-in data type when possible.to.avoid any unnecessary overhead.

Videos

09:09

YouTube

L.1.6.98: Pytorch Tensors and Numpy - YouTube

May 9, 2024

02:30

YouTube

Numpy Array vs PyTorch Tensor - YouTube

January 22, 2024

12:55

YouTube

JAX vs PyTorch vs Numpy - Basic Matrix Operations on M2 CPU - YouTube

April 26, 2024

View all

reddit.com › r/pytorch › learn numpy before pytorch?

r/pytorch on Reddit: Learn numpy before pytorch?

February 14, 2021 -

Hello, I’m about to start learning pytorch soon but I’ve read it’s a lot like numpy. Do you need a deep understanding of numpy before using it? I already know how to do general purpose python and have used it for data science so I’m comfortable with the language, but just haven’t used numpy a lot. Is knowing how to build a neural network in numpy a prerequisite for learning pytorch? Or in general a deep understanding of numpy?

Top answer

1 of 2

You should pretty comfortable with numpys basic functions as it is very helpful when working with pytorch. As for knowing how make a nueral net from scratch with numpy it is not entirely necessary however I would highly recommend that you. It will help you really get a better understanding how nueral nets work and help with your numpy. Best of luck on your ml journey!! Hope this helps.

2 of 2

The primary differences between numpy and pytorch are gpu computing and autograd. You may not always need gpu computing or autograd, in those cases numpy provides a simpler lightweight solution. You can learn numpy alongside pytorch as well!

reddit.com › r/python › i understand machine learning with numpy and pytorch better since i started focusing on the basics

r/Python on Reddit: I Understand Machine Learning with Numpy and PyTorch Better Since I Started Focusing on the Basics

October 12, 2024 -

I've recently started appreciating ML in Python more since I began looking at the concepts from the ground up.

For example, I took a closer look at the basics of classification neural networks, and now I have a better understanding of how more complex networks work. The foundation here is logistic regression, and understanding that has really helped me grasp the overall concepts better. It also helped me implementing the code in Numpy and in PyTorch.

If you're also interested in Machine Learning with Python and sometimes feel overwhelmed by all the complicated topics, I really recommend going back to the basics. I've made a video where I explain logistic regression step by step using a simple example.

The video will be attached here: https://youtu.be/EB4pqThgats?si=Z-lXOjuNKEP5Yehn

I'd be happy if you could take a look and give me some feedback! I'm curious to hear what you think of my approach and if you have any tips on how to make it even clearer.

Top answer

1 of 4

I took a ML elective last year and one of the problem sets was building a neural network from scratch in numpy. The TAs provided a framework but we had to fill in all the code, including the backprop. Was quite challenging but quite informative.

2 of 4

for me memes and rush editing kill all the educational value

reddit.com › r/pytorch › ask for help: what is the best way to have code both support torch and numpy?

r/pytorch on Reddit: Ask for help: what is the best way to have code both support torch and numpy?

February 22, 2023 -

I want to implement the code with the same functionality ( by numpy and torch). I don't know how to support both numpy and torch with only once implemention.

For example, I want to implement these:

def fun_torch(a):
    return torch.sin(a) + torch.cos(a)

def fun_np(a):
    return np.sin(a) + np.cos(b)

I want to implement them in only one function, but I dont want this:

def func(a):
    if isintance(a, torch.Tensor):
        return torch.sin(a) + torch.cos(a)
   elif isinatance(a, np.ndarray):
        return np.sin(a) + np.cos(b)

Top answer

1 of 2

Sorry for this observation, why don't you just convert the input and all the tensors to pytorch?

2 of 2

Check Ivy.

reddit.com › r/machinelearning › [p] using pytorch + numpy? a bug that plagues thousands of open-source ml projects.

r/MachineLearning on Reddit: [P] Using PyTorch + NumPy? A bug that plagues thousands of open-source ML projects.

April 10, 2021 -

Using NumPy’s random number generator with multi-process data loading in PyTorch causes identical augmentations unless you specifically set seeds using the worker_init_fn option in the DataLoader. I didn’t and this bug silently regressed my model’s accuracy.

How many others has this bug done damage to? Curious, I downloaded over a hundred thousand repositories from GitHub that import PyTorch, and analysed their source code. I kept projects that define a custom dataset, use NumPy’s random number generator with multi-process data loading, and are more-or-less straightforward to analyse using abstract syntax trees. Out of these, over 95% of the repositories are plagued by this problem. It’s inside PyTorch's official tutorial, OpenAI’s code, and NVIDIA’s projects. Even Karpathy admitted falling prey to it.

For example, the following image shows the duplicated random crop augmentations you get when you blindly follow the official PyTorch tutorial on custom datasets:

You can read more details here.

Top answer

1 of 5

100

I faced this problem last week while doing some experiments in RL. Fortunately, I found the solution and corrected my experiments. But it took me the better part of the day. As others have pointed out, it is not bug, but a feature which is probably not well-known and can cause hard to debug problems when overlooked.

2 of 5

Perhaps people are misunderstanding the issue - the problem isn't that setting a specific random seed results in the same sequence of random numbers being generated during each training run (which would obviously be "working as intended"). I think the problem is that each of the multiple dataloading processes (set by num_workers in pytorch right?) will output the same sequence of random numbers during one particular training run. This could certainly mess up projects depending on how you are doing your dataloading and augmentations (speaking from experience). Even if it is "working as intended", it is good that you've pointed this out to a wider audience! Also the easy fix would be to use torch random numbers instead I think (as mentioned in OP's link) Edit: Relevant github issue here: https://github.com/pytorch/pytorch/issues/5059

reddit.com › r/machinelearning › [d] pytorch implementation best practices

r/MachineLearning on Reddit: [D] PyTorch implementation best practices

April 12, 2019 -

Hi r/MachineLearning! Let's discuss PyTorch best practices.

I recently finished a PyTorch re-implementation (with help from various sources) for the paper Zero-shot User Intent Detection via Capsule Neural Networks, which originally had Python 2 code for TensorFlow.

I'd like to request perhaps a critique on the code I've written so far (it's not perfect, yet!) and any suggestions if there are best practices specifically in PyTorch, for implementing directly from research papers as well as converting them from other frameworks.

Some thoughts I had while programming (feel free to raise more!):

I've been implementing a Dataset class and custom batch functions for every dataset I've been working with. Is this the PyTorch best practice?
Where is the optimal place to shift Tensors to .cuda()? I've been doing this in the training loop, just before feeding it into the model.
How to manage the use of both numpy and torch, seeing as PyTorch aims to reinvent many of the basic operations in numpy?

If you're a fellow PyTorch user/contributor please share a little!

Top answer

1 of 5

I've been implementing a Dataset class and custom batch functions for every dataset I've been working with. Is this the PyTorch best practice? Not sure whether people would consider it best practice, but I do it as well because it's most convenient. Where is the optimal place to shift Tensors to .cuda()? You can do it immediately after loading the data in your training loop. Could be the first line after your for loop over the dataset, except that there is some computation you want to perform on the CPU before you pass it to the model. Btw, since ~1/2 year ago, .cuda() has been deprecated I think. The recommendation is to use .to(torch.device('cuda:0')) for example. How to manage the use of both numpy and torch, seeing as PyTorch aims to reinvent many of the basic operations in numpy? I don't use NumPy anymore when I write PyTorch code because, like you said, you can do most of it in PyTorch. Very rarely, there is something I need to do in NumPy/SciPy (a recent example was the cdf of the beta distribution, which is not implemented in PyTorch, yet). Btw. you have to be careful when using NumPy in your model because its operations are not tracked by autograd.

2 of 5

As a new adopter of pytorch I have been running into deadlock scenarios trying to use hdf5 or opencv during the dataloader. I have to reopen the hdf5 file in every batch versus keeping an open instance on instantiation of the dataset class. Using pillow rather than opencv is pretty annoying as well. Other than that so far I love pytorch, it behaves alot like numpy and I'm enjoying learning it.

PyTorch Forums

discuss.pytorch.org › t › torch-is-slow-compared-to-numpy › 117502

Torch is slow compared to numpy - PyTorch Forums

April 15, 2021 - In this benchmark I implemented the same algorithm in numpy/cupy, pytorch and native cpp/cuda. The benchmark is attached below. In all tests numpy was significantly faster than pytorch.

Find elsewhere

Google Bing Mojeek

Kaggle

kaggle.com › code › amirmotefaker › pytorch-vs-numpy

PyTorch vs NumPy

July 14, 2024 - Explore and run AI code with Kaggle Notebooks | Using data from No attached data sources

Quora

quora.com › What-are-the-benefits-of-PyTorch-over-NumPy

What are the benefits of PyTorch over NumPy? - Quora

Answer: PyTorch is a deep learning focused library while NumPy is for scientific computing. Both of these libraries are made with different goals in mind: * If you want to just do basic matrix operations/transformation and array operations then ...

reddit.com › r/machinelearning › [d] shocking confusing speed / timing results of algorithms (sklearn, numpy, scipy, pytorch, numba) | prelim results | hyperlearn

r/MachineLearning on Reddit: [D] Shocking Confusing Speed / Timing results of Algorithms (Sklearn, Numpy, Scipy, Pytorch, Numba) | Prelim results | HyperLearn

September 1, 2018 -

So you might or might not know, I was working on HyperLearn --> a faster optimized ML package designed to make everything at least 50% (I hope) faster.

Thanks so much for all the support Redditors for HyperLearn! https://github.com/danielhanchen/hyperlearn [Made it to the Trending Github list for Jup Notebooks!! yayy!]

Anyways, I didn't update the code a lot, but that's because I was busily testing and finding out which algos were the most stable and best.

Key findings for N = 5,000 P = 6,000 [more features than N near square matrix]

For pseudoinverse, (used in Linear Reg, Ridge Reg, lots of other algos), JIT, Scipy MKL, PinvH, Pinv2 and HyperLearn's Pinv are very similar. PyTorch's is clearly problematic, having close to over x4 slower than Scipy MKL.
For Eigh (used in PCA, LDA, QDA, other algos), Sklearn's PCA utilises SVD. Clearly, not a good idea, since it is much better to compute the eigenvec / eigenval on XTX. JIT Eigh is the clear winner at 14.5 seconds on XTX, whilst Numpy is 2x slower. Torch likewise is slower once again...
So, for PCA, a speedup of 3 times is seem if using JIT compiled Eigh when compared to Sklearn's PCA
To solve X*theta = y, Torch GELS is super unstable. Like really. If you use Torch GELS, don't forget to call theta_hat[np.isnan(theta_hat) | np.isinf(theta_hat)] = 0, or else results are problematic. All other algos have very similar MSEs, and HyperLearn's Regularized Cholesky Solve takes a mere 0.635 seconds when compared to say using Sklearn's next fastest Ridge Solve (via cholesky) by over 100% (after considering matrix multiplication time) --> HyperLearn 2.89s vs 4.53s Sklearn.

So to conclude:

HyperLearn's Pseudoinverse has no speed improvement
HyperLearn's PCA will have over 2 times speed boost. (200% improvement)
HyperLearn's Linear Solvers will be over 1 times faster. (100% improvement)

Help make HyperLearn better! All contributors are welcome, as this is truly an overwhelming project... https://github.com/danielhanchen/hyperlearn

Lower Time == better

Top answer

1 of 5

This looks really cool! You said that for pca, it is better to do eigendecomposition on XT X (covariance) instead of SVD. Why is that? I thought that SVD was more numerically stable than eig. Moreover, you can do econ SVD if there are more features than samples since you know the number of nonzero singular values is min(p,n). Thanks!

2 of 5

If you're finding issues in PyTorch that haven't already been noted, please go ahead and file an issue . This includes speed if there's orders of magnitude differences against other libraries.

PyTorch Forums

discuss.pytorch.org › t › pytorch-tensor-constructor-speed-vs-numpy › 191487

Pytorch tensor constructor speed vs numpy - PyTorch Forums

November 8, 2023 - So I was comparing the performance of the tensor constructor to the numpy array constructor For pytorch torch.inference_mode() total_time = 0.0 iterations = 10000 for _ in range(iterations): data = np.random.normal(0, 1, (1000, 10)).tolist() # Use numpy for RNG # but convert back to python ...

Stack Overflow

stackoverflow.com › questions › 52526082 › pytorch-cuda-vs-numpy-for-arithmetic-operations-fastest

python 3.x - PyTorch CUDA vs Numpy for arithmetic operations? Fastest? - Stack Overflow

Top answer

1 of 1

GPU operations have to additionally get memory to/from the GPU

The problem is that your GPU operation always has to put the input on the GPU memory, and then retrieve the results from there, which is a quite costly operation.

NumPy, on the other hand, directly processes the data from the CPU/main memory, so there is almost no delay here. Additionally, your matrices are extremely small, so even in the best-case scenario, there should only be a minute difference.

This is also partially the reason why you use mini-batches when training on a GPU in neural networks: Instead of having several extremely small operations, you now have "one big bulk" of numbers that you can process in parallel.
Also note that GPU clock speeds are generally way lower than CPU clocks, so the GPU only really shines because it has way more cores. If your matrix does not utilize all of them fully, you are also likely to see a faster result on your CPU.

TL;DR: If your matrix is big enough, you will eventually see a speed-up in CUDA than Numpy, even with the additional cost of the GPU transfer.

reddit.com › r/programming › i don’t like numpy

r/programming on Reddit: I don’t like NumPy

August 31, 2025 - Many uses of numpy have moved over to pytorch. There's tons of investment in it. > I think the strongest complaint is the lack of composibility, that if you write a custom function you can't treat it as a black-box for the purpose of vectorizing it. pytorch doesn't fix this, but there is a large and impressive backend with torch...

reddit.com › r/learnmachinelearning › tensorflow or pytorch?

r/learnmachinelearning on Reddit: tensorflow or pytorch?

December 16, 2025 -

i read the hands on machine learning book (the tensorflow one) and i am a first year student. i came to know a little later that the pytorch one is a better option. is it possible that on completing this book and getting to know about pytorch the skills are transferrable.

sorry if this might sound stupid or obvious but i dont really know

Pytorch has more discussions and more papers using it to implement, so it will be easier to find examples and tutorials. I think that alone is enough for students. Performance related stuff is not that relevant until you start to work on real projects

Medium

medium.com › @yunjiangster › comparing-speed-of-torch-and-numpy-f22b11aabfcf

Comparing Speed of Torch and Numpy | by Yunjiang Jiang | Medium

June 19, 2023 - I knew from experience working with FAISS library that if implemented well, inner product on gpu can be wicked fast. However I wasn’t so sure about torch. As a counter datapoint, I knew that training a 3-layer MLP (multi-layer perceptron, aka feed-forward neural net) on gpu isn’t much faster than on cpu, back in the tensorflow days.

linkedin.com › pulse › numpy-vs-pytorch-nadir-riyani-z6cdf

NumPy vs PyTorch

We cannot provide a description for this page right now

Medium

medium.com › @ashish.iitr2015 › comparison-between-pytorch-tensor-and-numpy-array-de41e389c213

Comparison between Pytorch Tensor and Numpy Array | by Ashish Singh | Medium

August 11, 2020 - Numpy arrays are mainly used in typical machine learning algorithms (such as k-means or Decision Tree in scikit-learn) whereas pytorch tensors are mainly used in deep learning which requires heavy matrix computation.

PyTorch Forums

discuss.pytorch.org › t › numerical-differences-between-numpy-and-pytorch › 89607

Numerical differences between numpy and pytorch? - PyTorch Forums

July 17, 2020 - Hello all, When computing mean and std on numpy or pytorch tensors, they yield different results. How come ? rng = np.random.RandomState(1) dataset1 = rng.uniform(low=-0.01, high=0.01, size=(1000, 20)) dataset2 = torch.from_numpy(data_numpy) print(dataset1.mean(), dataset1.std()) ...

reddit.com › r/cpp › c++ desperately needs something like numpy

r/cpp on Reddit: C++ desperately needs something like numpy

September 6, 2023 -

Anybody else agree? At this point, I don’t even care if it doesn’t support expression templates for performance. A library like that allows you to be SO MUCH more productive when doing neural network stuff, computer vision, pre-processing and post-processing data. It takes years to standardise something like mdspan and that’s miles off numpy. We are literally going to have to wait 100 years.

Top answer

1 of 5

https://eigen.tuxfamily.org/

2 of 5

what? there are plenty of numerical libraries out there. That's really not the thing I would be critical of C++