space complexity of matrix multiplication

cstheory.stackexchange.com › questions › 36403 › memory-requirement-for-fast-matrix-multiplication

ds.algorithms - Memory requirement for fast matrix multiplication - Theoretical Computer Science Stack Exchange

1 of 2

The space usage is at most $\text{[math]}$ for all Strassen-like algorithms (i.e. those based on upper bounding the rank of matrix multiplication algebraically). See Space complexity of Coppersmith–Winograd algorithm

However, I realized in my previous answer that I did not explain why the space usage is $\text{[math]}$ ... so here goes something hand-wavy. Consider what a Strassen-like algorithm does. It starts from a fixed algorithm for $\text{[math]}$ matrix multiplication that uses $\text{[math]}$ multiplications for some constant $\text{[math]}$ . In particular, this algorithm (whatever it is) can WLOG be written so that:

It computes $\text{[math]}$ different matrices $\text{[math]}$ which multiply entries of the first matrix $\text{[math]}$ by various scalars and $\text{[math]}$ matrices $\text{[math]}$ from the second matrix $\text{[math]}$ of a similar form,
It multiplies those linear combinations $\text{[math]}$ , then
It multiplies entries of $\text{[math]}$ by various scalars, then adds all these matrices up entrywise to obtain $\text{[math]}$ .

(This is a so-called "bilinear" algorithm, but it turns out that every "algebraic" matrix multiplication algorithm can be written in this way.) For each $\text{[math]}$ , this algorithm only has to store the current product $\text{[math]}$ and the current value of $\text{[math]}$ (initially set to all-zeroes) in memory at any given point, so the space usage is $\text{[math]}$ .

Given this finite algorithm, it is then extended to arbitrary $\text{[math]}$ matrices, by breaking the large matrices into $\text{[math]}$ blocks of dimensions $\text{[math]}$ , applying the finite $\text{[math]}$ algorithm to the block matrices, and recursively calling the algorithm whenever it needs to multiply two blocks. At each level of recursion, we need to keep only $\text{[math]}$ field elements in memory (storing $\text{[math]}$ different $\text{[math]}$ matrices). Assuming the space usage for $\text{[math]}$ matrix multiplication is $\text{[math]}$ , the space usage of this recursive algorithm is $\text{[math]}$ , which for $\text{[math]}$ solves to $\text{[math]}$ .

2 of 2

More generally, fast matrix multiplication can be done on $\text{[math]}$ processors in $\text{[math]}$ memory per processor. However, the communication between processors is then suboptimal. Optimal communication can be achieved by using more memory. As far as I know, it is not known whether optimal communication and optimal memory can be achieved simultaneously. Details are in http://dx.doi.org/10.1007/PL00008264

cstheory.stackexchange.com › questions › 1313 › space-complexity-of-coppersmith-winograd-algorithm

ds.algorithms - Space complexity of Coppersmith–Winograd algorithm - Theoretical Computer Science Stack Exchange

baeldung.com › home › core concepts › math and logic › matrix multiplication algorithm time complexity

Yes, all algorithms which stem from Strassen's original algorithm (this includes most known $\text{[math]}$ algorithms for matrix multiplication, but not all -- see the comments) have space complexity $\text{[math]}$ . If you could find a $\text{[math]}$ time algorithm with $poly(\log n)$ space complexity, this would be a great advance. One application would be a $\text{[math]}$ time, $poly(n)$ space algorithm for the Subset-Sum problem.

However there are some obstacles to such a result. For some computational models, there are fairly strong lower bounds for the time-space product of matrix multiplication. References like Yesha and Abrahamson will give you more information.

Baeldung

Matrix Multiplication Algorithm Time Complexity | Baeldung on Computer Science

March 18, 2024 - The naive matrix multiplication algorithm contains three nested loops. For each iteration of the outer loop, the total number of the runs in the inner loops would be equivalent to the length of the matrix. Here, integer operations take time. In general, if the length of the matrix is , the total time complexity would be .

ACM Digital Library

dl.acm.org › doi › fullHtml › 10.1145 › 3631908.3631910

Large Matrix Multiplication Algorithms: Analysis and Comparison

This arises from the need to iterate through the rows, columns, and elements of the matrices. Space Efficiency: The standard algorithm does not require additional memory, giving it an advantage in terms of space efficiency over algorithms that require auxiliary storage.

math.stackexchange.com › questions › 3890773 › complexity-of-matrix-multiplication-with-different-size

Complexity of matrix multiplication with different size - Mathematics Stack Exchange

heycoach.in › blog › space-complexity-of-matrix-multiplication

The "naive" matrix multiplication for $\text{[math]}$ involves multiplying and adding $\text{[math]}$ terms for each of $\text{[math]}$ entries in $\text{[math]}$ . So the complexity is $\text{[math]}$ . And then multiplying this $\text{[math]}$ matrix by $\text{[math]}$ requires multiplying and adding $\text{[math]}$ terms for each of $\text{[math]}$ entries. So the total complexity is $\text{[math]}$ . (EDIT: This conclusion was incorrect and based on a silly arithmetic mistake that I made. The correct answer, as explained in the comments, is $\text{[math]}$ . Many thanks to @HoldenLee and @Rami Zouari for catching this.)

However, this may not be optimal algorithm. But unfortunately, there's very little information online about the efficiency of non-square matrix multiplication. If all three matrices were square, then the fastest known algorithm for multiplying two of them has complexity $\approx O(N^{2.3729})$; this means that multiplying three $\text{[math]}$ matrices will have complexity just under $\text{[math]}$ . If the matrices have dimensions that are multiples of each other (or close to multiples) then we can use the square algorithms and block multiplication to speed up the implementation.

I managed to find a paper from 2012 which gets better than $\text{[math]}$ results for multiplying $\text{[math]}$ by $\text{[math]}$ matrices, for those values $M<N^{0.30298}$. Assuming that either $\text{[math]}$ or $\text{[math]}$ is less than or equal to $M^{0.30298}$, you could perform one multiplication naively and then use the results of the paper to perform the other multiplication quickly. Depending on the exact values in question, that might be able to get you a result that is better by maybe a factor of $\text{[math]}$ . But in general, I don't think that there are any known algorithms for your specific case that get any better than $\text{[math]}$ .

HeyCoach Blog

Space Complexity Of Matrix Multiplication

January 16, 2025 - In-Place Multiplication: There are advanced algorithms that can perform matrix multiplication in-place, but they’re like trying to fit a king-sized bed into a tiny studio apartment! Memory Usage: The total space complexity can be expressed as O(m * n + n * p + m * p), which simplifies to ...

Find elsewhere

Google Bing Mojeek

ScienceDirect

sciencedirect.com › science › article › pii › S0885064X02000079

On the complexity of the multiplication of matrices of small formats - ScienceDirect

December 21, 2002 - The currently best upper bound for 4×4-matrix multiplication follows by applying Strassen's algorithm two times. This yields the upper bound 49. Any improvement of this result immediately yields an algorithm with less than O(nlog27) arithmetic operations. Investigating the bilinear complexity of the multiplication of matrices of some small format is an interesting and challenging problem, see e.g.

Stack Overflow

stackoverflow.com › questions › 8546756 › matrix-multiplication-algorithm-time-complexity

Matrix multiplication algorithm time complexity - Stack Overflow

1 of 5

Using linear algebra, there exist algorithms that achieve better complexity than the naive O(n³). Solvay Strassen algorithm achieves a complexity of O(n^2.807) by reducing the number of multiplications required for each 2x2 sub-matrix from 8 to 7.

The fastest known matrix multiplication algorithm is Coppersmith-Winograd algorithm with a complexity of O(n^2.3737). Unless the matrix is huge, these algorithms do not result in a vast difference in computation time. In practice, it is easier and faster to use parallel algorithms for matrix multiplication.

2 of 5

The naive algorithm, which is what you've got once you correct it as noted in comments, is O(n^3).

There do exist algorithms that reduce this somewhat, but you're not likely to find an O(n^2) implementation. I believe the question of the most efficient implementation is still open.

See this wikipedia article on Matrix Multiplication for more information.

cstheory.stackexchange.com › questions › 33809 › space-complexity-for-multiplying-m-matrices

lower bounds - Space complexity for multiplying $m$ matrices - Theoretical Computer Science Stack Exchange

cs.auckland.ac.nz › software › AlgAnim › mat_chain.html

Letting m be [number of vertices minus 1] and letting each matrix be
[the adjacency matrix of the result of giving the vertex t a loop]
gives a reduction from st-connectivity to your problem, so no such algorithm is known.

University of Auckland

Data Structures and Algorithms: Matrix Chain Multiplication

one containing the index of last ... each of the O(n2) costs and entries in the best matrix for an overall complexity of O(n3) time at a cost of O(n2) space....

cstheory.stackexchange.com › questions › 55449 › efficient-matrix-multiplication-in-small-space

time complexity - Efficient matrix multiplication in small space - Theoretical Computer Science Stack Exchange

arxiv.org › abs › 2309.06317

Both answers are perfectly consistent. There is a huge difference in having a parallel algorithm with T processors running in S time (like answer 1 says exists), and having a time O(T) and space poly(S) algorithm (like answer 2 says is open). The former is not known (or believed) to imply the latter, especially when S is poly(log T). That's literally asking for a fine grained version of NC contained in SC, a major open problem.

We do know that S depth can be simulated in O(S) space, but the time bound in the resulting simulation is obliterated.

arXiv

[2309.06317] The Time Complexity of Fully Sparse Matrix Multiplication

September 12, 2023 - Our main contribution is a new algorithm that reduces sparse matrix multiplication to dense (but smaller) rectangular matrix multiplication. Our running time thus depends on the optimal exponent $\omega(a,b,c)$ of multiplying dense $n^a\times n^b$ by $n^b\times n^c$ matrices. We discover that when $m_{out}=\Theta(m_{in}^r)$ the time complexity of sparse matrix multiplication is $O(m_{in}^{\sigma+\epsilon})$, for all $\epsilon > 0$, where $\sigma$ is the solution to the equation $\omega(\sigma-1,2-\sigma,1+r-\sigma)=\sigma$. No matter what $\omega(\cdot,\cdot,\cdot)$ turns out to be, and for all $r\in(0,2)$, the new bound beats the state of the art, and we provide evidence that it is optimal based on the complexity of the all-edge triangle problem.

IJERT

ijert.org › optimizing-the-complexity-of-matrix-multiplication-algorithm

Optimizing the Complexity of Matrix Multiplication Algorithm – IJERT

April 24, 2018 - Also, parallel computations [8] helps to reduce the Time and Space Complexity of the algorithm. DCT using strassens algorithm provide faster output rather than the naÃ¯ve. ... The mathematical definition of matrix multiplication algorithm [4] states that if C = AB for nÃ—m matrix A and mÃ—p

Stack Overflow

stackoverflow.com › questions › 8859715 › is-the-matrix-multiplication-algorithm-for-nxm-and-mxp-matrices-onp-in-space

language agnostic - Is the matrix multiplication algorithm for NxM and MxP matrices O(NP) in space? - Stack Overflow