java stream parallel foreach

Why is list.parallelStream().forEach() not processing all the elements in the list in Java?

stackoverflow.com › questions › 60095513 › why-is-list-parallelstream-foreach-not-processing-all-the-elements-in-the-li

Because ArrayList is not a thread-safe collection. Using a thread-safe collection like CopyOnWriteArrayList would make it correct but not necessarily efficient.

Using a Collector instead would be much simpler and correct. e.g.

source.parallelStream().collect(Collectors.toList())

Answer from Sleiman Jneidi on Stack Overflow

Oracle

docs.oracle.com › javase › tutorial › collections › streams › parallelism.html

Parallelism (The Java™ Tutorials > Collections > Aggregate Operations)

Consequently, when you execute a stream in parallel, the Java compiler and runtime determine the order in which to process the stream's elements to maximize the benefits of parallel computing unless otherwise specified by the stream operation. The fifth pipeline uses the method forEachOrdered, which processes the elements of the stream in the order specified by its source, regardless of whether you executed the stream in serial or parallel.

Stack Overflow

stackoverflow.com › questions › 60095513 › why-is-list-parallelstream-foreach-not-processing-all-the-elements-in-the-li

Why is list.parallelStream().forEach() not processing all the elements in the list in Java? - Stack Overflow

Top answer

1 of 4

Because ArrayList is not a thread-safe collection. Using a thread-safe collection like CopyOnWriteArrayList would make it correct but not necessarily efficient.

Using a Collector instead would be much simpler and correct. e.g.

source.parallelStream().collect(Collectors.toList())

2 of 4

The forEach operation of the parallel stream is adding elements to an un-synchronized Collection (an ArrayList) from multiple threads. Therefore, the operation is not thread safe, and has unexpected results.

Using forEachOrdered() instead of forEach() will ensure all the elements of the source List are added to the destination List.

However, as mentioned in the other answer, using collect(Collectors.toList()) is the correct way to produce an output List from a Stream.

Discussions

java - Does Stream.forEach() always work in parallel? - Stack Overflow

To my understanding, multiple threads would be working in the forEach() case only if the stream is parallel. More on stackoverflow.com

stackoverflow.com

A surprising pain point regarding Parallel Java Streams (featuring mailing list discussion with Viktor Klang).

I did want to follow up about one point Viktor made later on in the conversation. https://mail.openjdk.org/pipermail/core-libs-dev/2024-November/134542.html And here is the quote. In a potential future where all intermediate operations are Gatherer-based, and all terminal operations are Collector-based, it would just work as expected. But with that said, I'm not sure it is practically achievable because some operations might not have the same performance-characteristics as before. Me personally, I would GLADLY accept a flag on stream (similar to parallel() or unordered()) that would allow me to guarantee that my stream never pre-fetches, even if I take a massive performance hit. If that can be accomplished by making all intermediate operations be implemented by a Gatherer under the hood, that is A-OK with me. The reality is, not all streams are compute bound. Some are IO bound, but are otherwise, a great fit for streams. Having a method that allows us to optimize for that fact is a new type of performance enhancement that I would greatly appreciate, even if it degrades performance in other ways. More on reddit.com

r/java

223

November 20, 2024

java - Should I always use a parallel stream when possible? - Stack Overflow

With Java 8 and lambdas, it's easy to iterate over collections as streams, and just as easy to use a parallel stream. Two examples from the documentation, the second one using parallelStream: More on stackoverflow.com

stackoverflow.com

java 8 parallelStream().forEach Result data loss - Stack Overflow

When using multiple threads you ... 2 possible solutions). Even better would be to use the collect method on the stream to put everything into a list. ... ParallelStream with forEach is a deadly combo if not used carefully.... More on stackoverflow.com

stackoverflow.com

Videos