`numpy.astype` is slower than for loops for large arrays

stackoverflow.com › questions › 78530840 › numpy-astype-is-slower-than-for-loops-for-large-arrays

You are actually comparing two vectorized implementations and the later is more expensive due to the memory hierarchy overheads (as pointed ou by @hpaulj in comments). The overhead of CPython loops is small in both cases.

More specifically, the former code operates on arrays of size 240*320 = 76800. which fits in the L2 cache on all mainstream CPUs while the later operate on arrays of size 3_840_000 which does not fit in the L2 cache but the L3 on some CPU or only the RAM on others. The L3 cache is slower than the L2 and the RAM si much slower than the L3 cache. CPU caches closer to cores are much faster but also much smaller. Because of that, doing the computation chunk by chunk like you do in the first code is a good practice to improve temporal memory locality and thus performance.

One reason why temporal memory locality is improved here is because the internal allocator generally tends to recycle array (that is providing back the address of a newly deleted array of a similar size). This often happens on temporary arrays created by Numpy. Another reason is that temporary array are written and often directly read again and reading the array back is faster if the array is in a fast CPU cache. For example, FRAMES == 3 creates and fill a boolean array which is directly read back by the astype function.

On my machine with a i5-9600KF CPU, the L3 is large enough (9 MiB) to (mostly) hold the arrays so the difference in performance is not so huge. I expect this not to be the case on your machine.

However, note that .astype(np.uint8) is rather expensive here and not even needed in this case. Indeed, you can use .view(np.uint8). The former create a new array and convert items while the later just do a reinterpret cast of the item in memory without any copy. The later is only safe if you know exactly what you are doing. Here it is safe since the input (np.bool_) and output (np.uint8) types are of the same size (1 byte), both are unsigned and of a compatible kind (boolean versus integers). Once this modification is done, the code is much faster.

Here are performance results on my machine (with Cpython 3.8.1, Numpy 1.24.4 on Windows):

Initial implementation:
    - Native Python:  5.990550100000291
    - Vectorized:     9.843130900000233

Optimized implementation (using np.view):
    - Native Python:  0.9843445999999858
    - Vectorized:     2.8540657999997165

In the optimized version of the "Native Python" implementation, the big input array completely fit in the L3 and small temporary array do not impact is much so data is not evicted. In the second implementation data is more often evicted from the L3 cache because the temporary array is relatively big (the minimum space required is 3840000*2=7680000 bytes assuming the allocator and the cache optimal, but they are not and there is not enough space in my L3 cache for 3 times the array size, that is 3840000*3=11520000 bytes, hence a significantly higher execution time, not to mention the "Native Python" mostly operates in the L1 cache for temporary arrays).

Answer from Jérôme Richard on Stack Overflow

NumPy

numpy.org › devdocs › reference › generated › numpy.ndarray.astype.html

numpy.ndarray.astype — NumPy v2.5.dev0 Manual

>>> x[:2].astype(np.int_, casting="same_value") array([1, 2])

NumPy

numpy.org › doc › 2.2 › reference › generated › numpy.ndarray.astype.html

numpy.ndarray.astype — NumPy v2.2 Manual

By default, astype always returns a newly allocated array.

Videos

01:36

YouTube

astype in NumPy Python | Module NumPy Tutorial - Part 05 - YouTube

September 22, 2022

12:00

YouTube

NumPy Data Types | astype function in NumPy | NumPy Datatypes - ...

December 31, 2023

00:14

YouTube

How to use numpy.astype() method to convert data type #python ...

August 30, 2025

07:24

YouTube

Python NumPy Tutorial For Beginners - NumPy Astype method - YouTube

August 9, 2022

tiktok.com

Replying to @maddiew26 preparation is key! #pregnancytiktok #firsttimemom #pregnancyhumor #birthstory | TikTok

tiktok.com

Here’s the science. interested in booking one-to-one time to help you improve your liver health and metabolic health? Interested in a one hour webinar doing a deep dive into everything related to metabolic liver disease ? Click on bio for more information Free nutritional guide available ! #metabolichealth #liverhealth #diseaseprevention #reels #doctor #weightloss #lifestyle #alcohol #sober #sobriety #booze #mocktails #livebetter #livelong #womenshealth #menopause | TikTok

View all

NumPy

numpy.org › doc › 2.3 › reference › generated › numpy.ndarray.astype.html

numpy.ndarray.astype — NumPy v2.3 Manual

When casting from complex to float or int. To avoid this, one should use a.real.astype(t). ... Try it in your browser! >>> import numpy as np >>> x = np.array([1, 2, 2.5]) >>> x array([1.

Programiz

programiz.com › python-programming › numpy › methods › astype

NumPy astype()

The astype() method converts an array to a specified data type. import numpy as np # original array of integers integerArray = np.array([1, 2, 3, 4, 5])

NumPy

numpy.org › devdocs › reference › generated › numpy.astype.html

numpy.astype — NumPy v2.5.dev0 Manual

>>> import numpy as np >>> arr = np.array([1, 2, 3]); arr array([1, 2, 3]) >>> np.astype(arr, np.float64) array([1., 2., 3.])

NumPy

numpy.org › doc › 2.2 › reference › generated › numpy.astype.html

numpy.astype — NumPy v2.2 Manual

>>> import numpy as np >>> arr = np.array([1, 2, 3]); arr array([1, 2, 3]) >>> np.astype(arr, np.float64) array([1., 2., 3.])

GeeksforGeeks

geeksforgeeks.org › numpy › change-numpy-array-data-type

Change the Data Type of the Given NumPy Array - GeeksforGeeks

03:28

The numpy.astype() method is used to change the data type NumPy array from one data type to another.

Published July 11, 2025

Find elsewhere

Google Bing Mojeek

W3Schools

w3schools.com › python › numpy › numpy_data_types.asp

NumPy Data Types

The astype() function creates a copy of the array, and allows you to specify the data type as a parameter. The data type can be specified using a string, like 'f' for float, 'i' for integer etc. or you can use the data type directly like float ...

Stack Overflow

stackoverflow.com › questions › 78530840 › numpy-astype-is-slower-than-for-loops-for-large-arrays

performance - `numpy.astype` is slower than for loops for large arrays - Stack Overflow

OUTPUT for above

[25 56 12 85 34 75]
<class 'numpy.int64'>
[25.+0.j 56.+0.j 12.+0.j 85.+0.j 34.+0.j 75.+0.j]
[ 42.+0.j   3.+0.j  86.+0.j  32.+0.j 856.+0.j  46.+0.j]
<class 'numpy.complex128'>

It can be seen that astype changes the type temporally as we do in normal type casting but where as the generic method changes the type permanently

Codecademy

codecademy.com › docs › python:numpy › ndarray › .astype()

Python:NumPy | ndarray | .astype() | Codecademy

July 18, 2024 - The .astype() function in NumPy allows changing the data type of the elements in an array.

NumPy

numpy.org › doc › stable › user › basics.types.html

Data types — NumPy v2.4 Manual

To convert the type of an array, use the .astype() method. For example: ... Note that, above, we could have used the Python float object as a dtype instead of numpy.float64. NumPy knows that int refers to numpy.int_, bool means numpy.bool, that float is numpy.float64 and complex is numpy.complex128.

NumPy

numpy.org › devdocs › reference › generated › numpy.record.astype.html

numpy.record.astype — NumPy v2.4.dev0 Manual

Please see ndarray.astype.

JAX Documentation

docs.jax.dev › en › latest › _autosummary › jax.numpy.astype.html

jax.numpy.astype — JAX documentation

>>> x = jnp.array([0, 1, 2, 3]) >>> x Array([0, 1, 2, 3], dtype=int32) >>> x.astype('float32') Array([0.0, 1.0, 2.0, 3.0], dtype=float32)

Vultr Docs

docs.vultr.com › python › third-party › numpy › array › astype

Python Numpy array astype() - Convert Data Type | Vultr Docs

November 8, 2024 - Create a Numpy array. Use the astype() method to convert the array to another data type.

Medium

mr-amit.medium.com › changing-data-type-dtype-in-numpy-arrays-a-step-by-step-guide-ea7e323e1959

Changing Data Type (dtype) in NumPy Arrays: A Step-by-Step Guide | by It's Amit | Medium

March 6, 2025 - .astype() Creates a New Array When you use .astype(), it doesn’t modify your original array. Instead, it creates a new one. So if you need the updated array, make sure to save it, like this: ... 2. Data Loss During Conversion Some conversions ...

NumPy

numpy.org › doc › 2.1 › reference › generated › numpy.astype.html

numpy.astype — NumPy v2.1 Manual

>>> import numpy as np >>> arr = np.array([1, 2, 3]); arr array([1, 2, 3]) >>> np.astype(arr, np.float64) array([1., 2., 3.])