Class IntermediateCacheStream<Original,R>
- All Implemented Interfaces:
AutoCloseable
,BaseStream<R,
,Stream<R>> Stream<R>
,BaseCacheStream<R,
,Stream<R>> CacheStream<R>
-
Nested Class Summary
Nested classes/interfaces inherited from interface org.infinispan.BaseCacheStream
BaseCacheStream.SegmentCompletionListener
Nested classes/interfaces inherited from interface java.util.stream.Stream
Stream.Builder<T extends Object>
-
Constructor Summary
ConstructorDescriptionIntermediateCacheStream
(BaseCacheStream remoteStream, IntermediateType type, org.infinispan.stream.impl.local.LocalCacheStream<R> localStream, org.infinispan.stream.impl.IntermediateCacheStreamSupplier supplier) IntermediateCacheStream
(DistributedCacheStream<Original, R> remoteStream) -
Method Summary
Modifier and TypeMethodDescriptionboolean
boolean
void
close()
<R1> R1
Performs a mutable reduction operation on the elements of this stream using aCollector
that is lazily created from theSupplier
provided.<R1> R1
collect
(Supplier<R1> supplier, BiConsumer<R1, ? super R> accumulator, BiConsumer<R1, R1> combiner) <R1,
A> R1 <R1> R1
collect
(SerializableSupplier<Collector<? super R, ?, R1>> supplier) Performs a mutable reduction operation on the elements of this stream using aCollector
that is lazily created from theSerializableSupplier
provided.long
count()
Disables tracking of rehash events that could occur to the underlying cache.distinct()
distributedBatchSize
(int batchSize) Controls how many keys are returned from a remote node when using a stream terminal operation with a distributed cache to back this stream.filterKeys
(Set<?> keys) Filters which entries are returned by only returning ones that map to the given key.filterKeySegments
(Set<Integer> segments) Filters which entries are returned by what segment they are present in.filterKeySegments
(IntSet segments) Filters which entries are returned by what segment they are present in.findAny()
<R1> CacheStream<R1>
flatMapToDouble
(Function<? super R, ? extends DoubleStream> mapper) flatMapToInt
(Function<? super R, ? extends IntStream> mapper) flatMapToLong
(Function<? super R, ? extends LongStream> mapper) <K,
V> void forEach
(BiConsumer<Cache<K, V>, ? super R> action) Same asCacheStream.forEach(Consumer)
except that it takes aBiConsumer
that provides access to the underlyingCache
that is backing this stream.void
void
forEachOrdered
(Consumer<? super R> action) boolean
iterator()
limit
(long maxSize) <R1> CacheStream<R1>
mapToDouble
(ToDoubleFunction<? super R> mapper) mapToInt
(ToIntFunction<? super R> mapper) mapToLong
(ToLongFunction<? super R> mapper) max
(Comparator<? super R> comparator) min
(Comparator<? super R> comparator) boolean
parallel()
This would enable sending requests to all other remote nodes when a terminal operator is performed.reduce
(BinaryOperator<R> accumulator) reduce
(R identity, BinaryOperator<R> accumulator) <U> U
reduce
(U identity, BiFunction<U, ? super R, U> accumulator, BinaryOperator<U> combiner) Allows registration of a segment completion listener that is notified when a segment has completed processing.This would disable sending requests to all other remote nodes compared to one at a time.skip
(long n) sorted()
sorted
(Comparator<? super R> comparator) Sets a given time to wait for a remote operation to respond by.Object[]
toArray()
<A> A[]
toArray
(IntFunction<A[]> generator) Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Methods inherited from interface org.infinispan.CacheStream
allMatch, anyMatch, collect, filter, flatMap, flatMapToDouble, flatMapToInt, flatMapToLong, forEach, forEach, map, mapToDouble, mapToInt, mapToLong, max, min, noneMatch, peek, reduce, reduce, reduce, sorted, toArray
-
Constructor Details
-
IntermediateCacheStream
-
IntermediateCacheStream
public IntermediateCacheStream(BaseCacheStream remoteStream, IntermediateType type, org.infinispan.stream.impl.local.LocalCacheStream<R> localStream, org.infinispan.stream.impl.IntermediateCacheStreamSupplier supplier)
-
-
Method Details
-
sequentialDistribution
Description copied from interface:CacheStream
This would disable sending requests to all other remote nodes compared to one at a time. This can reduce memory pressure on the originator node at the cost of performance.Parallel distribution is enabled by default except for
CacheStream.iterator()
andCacheStream.spliterator()
- Specified by:
sequentialDistribution
in interfaceBaseCacheStream<Original,
R> - Specified by:
sequentialDistribution
in interfaceCacheStream<Original>
- Returns:
- a stream with parallel distribution disabled.
-
parallelDistribution
Description copied from interface:BaseCacheStream
This would enable sending requests to all other remote nodes when a terminal operator is performed. This requires additional overhead as it must process results concurrently from various nodes, but should perform faster in the majority of cases.Parallel distribution is enabled by default except for
CacheStream.iterator()
andCacheStream.spliterator()
- Specified by:
parallelDistribution
in interfaceBaseCacheStream<Original,
R> - Specified by:
parallelDistribution
in interfaceCacheStream<Original>
- Returns:
- a stream with parallel distribution enabled.
-
filterKeySegments
Description copied from interface:CacheStream
Filters which entries are returned by what segment they are present in. This method can be substantially more efficient than using a regularCacheStream.filter(Predicate)
method as this can control what nodes are asked for data and what entries are read from the underlying CacheStore if present.- Specified by:
filterKeySegments
in interfaceBaseCacheStream<Original,
R> - Specified by:
filterKeySegments
in interfaceCacheStream<Original>
- Parameters:
segments
- The segments to use for this stream operation. Any segments not in this set will be ignored.- Returns:
- a stream with the segments filtered.
-
filterKeySegments
Description copied from interface:CacheStream
Filters which entries are returned by what segment they are present in. This method can be substantially more efficient than using a regularCacheStream.filter(Predicate)
method as this can control what nodes are asked for data and what entries are read from the underlying CacheStore if present.- Specified by:
filterKeySegments
in interfaceBaseCacheStream<Original,
R> - Specified by:
filterKeySegments
in interfaceCacheStream<Original>
- Parameters:
segments
- The segments to use for this stream operation. Any segments not in this set will be ignored.- Returns:
- a stream with the segments filtered.
-
filterKeys
Description copied from interface:CacheStream
Filters which entries are returned by only returning ones that map to the given key. This method will be faster than a regularCacheStream.filter(Predicate)
if the filter is holding references to the same keys.- Specified by:
filterKeys
in interfaceBaseCacheStream<Original,
R> - Specified by:
filterKeys
in interfaceCacheStream<Original>
- Parameters:
keys
- The keys that this stream will only operate on.- Returns:
- a stream with the keys filtered.
-
distributedBatchSize
Description copied from interface:CacheStream
Controls how many keys are returned from a remote node when using a stream terminal operation with a distributed cache to back this stream. This value is ignored when terminal operators that don't track keys are used. Key tracking terminal operators areCacheStream.iterator()
,CacheStream.spliterator()
,CacheStream.forEach(Consumer)
. Please see those methods for additional information on how this value may affect them.This value may be used in the case of a a terminal operator that doesn't track keys if an intermediate operation is performed that requires bringing keys locally to do computations. Examples of such intermediate operations are
CacheStream.sorted()
,CacheStream.sorted(Comparator)
,CacheStream.distinct()
,CacheStream.limit(long)
,CacheStream.skip(long)
This value is always ignored when this stream is backed by a cache that is not distributed as all values are already local.
- Specified by:
distributedBatchSize
in interfaceBaseCacheStream<Original,
R> - Specified by:
distributedBatchSize
in interfaceCacheStream<Original>
- Parameters:
batchSize
- The size of each batch. This defaults to the state transfer chunk size.- Returns:
- a stream with the batch size updated
-
segmentCompletionListener
Description copied from interface:CacheStream
Allows registration of a segment completion listener that is notified when a segment has completed processing. If the terminal operator has a short circuit this listener may never be called.This method is designed for the sole purpose of use with the
CacheStream.iterator()
to allow for a user to track completion of segments as they are returned from the iterator. Behavior of other methods is not specified. Please seeCacheStream.iterator()
for more information.Multiple listeners may be registered upon multiple invocations of this method. The ordering of notified listeners is not specified.
This is only used if this stream did not invoke
BaseCacheStream.disableRehashAware()
and has no flat map based operations. If this is done no segments will be notified.- Specified by:
segmentCompletionListener
in interfaceBaseCacheStream<Original,
R> - Specified by:
segmentCompletionListener
in interfaceCacheStream<Original>
- Parameters:
listener
- The listener that will be called back as segments are completed.- Returns:
- a stream with the listener registered.
-
disableRehashAware
Description copied from interface:CacheStream
Disables tracking of rehash events that could occur to the underlying cache. If a rehash event occurs while a terminal operation is being performed it is possible for some values that are in the cache to not be found. Note that you will never have an entry duplicated when rehash awareness is disabled, only lost values.Most terminal operations will run faster with rehash awareness disabled even without a rehash occuring. However if a rehash occurs with this disabled be prepared to possibly receive only a subset of values.
- Specified by:
disableRehashAware
in interfaceBaseCacheStream<Original,
R> - Specified by:
disableRehashAware
in interfaceCacheStream<Original>
- Returns:
- a stream with rehash awareness disabled.
-
timeout
Description copied from interface:CacheStream
Sets a given time to wait for a remote operation to respond by. This timeout does nothing if the terminal operation does not go remote.If a timeout does occur then a
TimeoutException
is thrown from the terminal operation invoking thread or on the next call to theIterator
orSpliterator
.Note that if a rehash occurs this timeout value is reset for the subsequent retry if rehash aware is enabled.
- Specified by:
timeout
in interfaceBaseCacheStream<Original,
R> - Specified by:
timeout
in interfaceCacheStream<Original>
- Parameters:
timeout
- the maximum time to waitunit
- the time unit of the timeout argument- Returns:
- a stream with the timeout set
-
isParallel
public boolean isParallel()- Specified by:
isParallel
in interfaceBaseStream<Original,
R>
-
sorted
Description copied from interface:CacheStream
This operation is performed entirely on the local node irrespective of the backing cache. This operation will act as an intermediate iterator operation requiring data be brought locally for proper behavior. Beware this means it will require having all entries of this cache into memory at one time. This is described in more detail at
CacheStream
Any subsequent intermediate operations and the terminal operation are also performed locally.
-
sorted
Description copied from interface:CacheStream
This operation is performed entirely on the local node irrespective of the backing cache. This operation will act as an intermediate iterator operation requiring data be brought locally for proper behavior. Beware this means it will require having all entries of this cache into memory at one time. This is described in more detail at
CacheStream
Any subsequent intermediate operations and the terminal operation are then performed locally.
-
limit
Description copied from interface:CacheStream
This intermediate operation will be performed both remotely and locally to reduce how many elements are sent back from each node. More specifically this operation is applied remotely on each node to only return up to the maxSize value and then the aggregated results are limited once again on the local node.
This operation will act as an intermediate iterator operation requiring data be brought locally for proper behavior. This is described in more detail in the
CacheStream
documentationAny subsequent intermediate operations and the terminal operation are then performed locally.
-
skip
Description copied from interface:CacheStream
This operation is performed entirely on the local node irrespective of the backing cache. This operation will act as an intermediate iterator operation requiring data be brought locally for proper behavior. This is described in more detail in the
CacheStream
documentationDepending on the terminal operator this may or may not require all entries or a subset after skip is applied to be in memory all at once.
Any subsequent intermediate operations and the terminal operation are then performed locally.
-
peek
Description copied from interface:CacheStream
-
distinct
Description copied from interface:CacheStream
This operation will be invoked both remotely and locally when used with a distributed cache backing this stream. This operation will act as an intermediate iterator operation requiring data be brought locally for proper behavior. This is described in more detail in the
CacheStream
documentationThis intermediate iterator operation will be performed locally and remotely requiring possibly a subset of all elements to be in memory
Any subsequent intermediate operations and the terminal operation are then performed locally.
-
filter
Description copied from interface:CacheStream
-
map
Description copied from interface:CacheStream
Just like in the cache,
null
values are not supported. -
mapToDouble
Description copied from interface:CacheStream
- Specified by:
mapToDouble
in interfaceCacheStream<Original>
- Specified by:
mapToDouble
in interfaceStream<Original>
- Parameters:
mapper
- a non-interfering, stateless function to apply to each element- Returns:
- the new double cache stream
-
mapToInt
Description copied from interface:CacheStream
-
mapToLong
Description copied from interface:CacheStream
-
flatMap
Description copied from interface:CacheStream
-
flatMapToDouble
Description copied from interface:CacheStream
- Specified by:
flatMapToDouble
in interfaceCacheStream<Original>
- Specified by:
flatMapToDouble
in interfaceStream<Original>
- Returns:
- the new cache stream
-
flatMapToInt
Description copied from interface:CacheStream
- Specified by:
flatMapToInt
in interfaceCacheStream<Original>
- Specified by:
flatMapToInt
in interfaceStream<Original>
- Returns:
- the new cache stream
-
flatMapToLong
Description copied from interface:CacheStream
- Specified by:
flatMapToLong
in interfaceCacheStream<Original>
- Specified by:
flatMapToLong
in interfaceStream<Original>
- Returns:
- the new cache stream
-
parallel
Description copied from interface:CacheStream
- Specified by:
parallel
in interfaceBaseStream<Original,
R> - Specified by:
parallel
in interfaceCacheStream<Original>
- Returns:
- a parallel cache stream
-
sequential
Description copied from interface:CacheStream
- Specified by:
sequential
in interfaceBaseStream<Original,
R> - Specified by:
sequential
in interfaceCacheStream<Original>
- Returns:
- a sequential cache stream
-
unordered
Description copied from interface:CacheStream
- Specified by:
unordered
in interfaceBaseStream<Original,
R> - Specified by:
unordered
in interfaceCacheStream<Original>
- Returns:
- an unordered cache stream
-
forEach
Description copied from interface:CacheStream
This operation is performed remotely on the node that is the primary owner for the key tied to the entry(s) in this stream.
NOTE: This method while being rehash aware has the lowest consistency of all of the operators. This operation will be performed on every entry at least once in the cluster, as long as the originator doesn't go down while it is being performed. This is due to how the distributed action is performed. Essentially the
CacheStream.distributedBatchSize(int)
value controls how many elements are processed per node at a time when rehash is enabled. After those are complete the keys are sent to the originator to confirm that those were processed. If that node goes down during/before the response those keys will be processed a second time.It is possible to have the cache local to each node injected into this instance if the provided Consumer also implements the
CacheAware
interface. This method will be invoked before the consumeraccept()
method is invoked.This method is ran distributed by default with a distributed backing cache. However if you wish for this operation to run locally you can use the
stream().iterator().forEachRemaining(action)
for a single threaded variant. If you wish to have a parallel variant you can useStreamSupport.stream(Spliterator, boolean)
passing in the spliterator from the stream. In either case remember you must close the stream after you are done processing the iterator or spliterator.. -
forEachOrdered
- Specified by:
forEachOrdered
in interfaceStream<Original>
-
forEach
Description copied from interface:CacheStream
Same asCacheStream.forEach(Consumer)
except that it takes aBiConsumer
that provides access to the underlyingCache
that is backing this stream.Note that the
CacheAware
interface is not supported for injection using this method as the cache is provided in the consumer directly.- Specified by:
forEach
in interfaceCacheStream<Original>
- Type Parameters:
K
- key type of the cacheV
- value type of the cache- Parameters:
action
- consumer to be ran for each element in the stream
-
reduce
-
reduce
-
reduce
-
collect
Description copied from interface:CacheStream
Note when using a distributed backing cache for this stream the collector must be marshallable. This prevents the usage of
Collectors
class. However you can use theCacheCollectors
static factory methods to create a serializable wrapper, which then creates the actual collector lazily after being deserialized. This is useful to use any method from theCollectors
class as you would normally. Alternatively, you can callCacheStream.collect(SerializableSupplier)
too.Note: The collector is applied on each node until all the local stream's values are reduced into a single object. Because of marshalling limitations, the final result of the collector on remote nodes is limited to a size of 2GB. If you need to process more than 2GB of data, you must force the collector to run on the originator with
CacheStream.spliterator()
:StreamSupport.stream(stream.filter(entry -> ...) .map(entry -> ...) .spliterator(), false) .collect(Collectors.toList());
-
collect
Description copied from interface:CacheStream
Performs a mutable reduction operation on the elements of this stream using aCollector
that is lazily created from theSerializableSupplier
provided. This method behaves exactly the same asCacheStream.collect(Collector)
with the enhanced capability of working even when the mutable reduction operation has to run in a remote node and the operation is notSerializable
or otherwise marshallable. So, this method is specially designed for situations when the user wants to use aCollector
instance that has been created byCollectors
static factory methods. In this particular case, the function that instantiates theCollector
will be marshalled according to theSerializable
rules.Note: The collector is applied on each node until all the local stream's values are reduced into a single object. Because of marshalling limitations, the final result of the collector on remote nodes is limited to a size of 2GB. If you need to process more than 2GB of data, you must force the collector to run on the originator with
CacheStream.spliterator()
:StreamSupport.stream(stream.filter(entry -> ...) .map(entry -> ...) .spliterator(), false) .collect(Collectors.toList());
- Specified by:
collect
in interfaceCacheStream<Original>
- Type Parameters:
R1
- The resulting type of the collector- Parameters:
supplier
- The supplier to create the collector that is specifically serializable- Returns:
- the collected value
-
collect
Description copied from interface:CacheStream
Performs a mutable reduction operation on the elements of this stream using aCollector
that is lazily created from theSupplier
provided. This method behaves exactly the same asCacheStream.collect(Collector)
with the enhanced capability of working even when the mutable reduction operation has to run in a remote node and the operation is notSerializable
or otherwise marshallable. So, this method is specially designed for situations when the user wants to use aCollector
instance that has been created byCollectors
static factory methods. In this particular case, the function that instantiates theCollector
will be marshalled using InfinispanExternalizer
class or one of its subtypes.Note: The collector is applied on each node until all the local stream's values are reduced into a single object. Because of marshalling limitations, the final result of the collector on remote nodes is limited to a size of 2GB. If you need to process more than 2GB of data, you must force the collector to run on the originator with
CacheStream.spliterator()
:StreamSupport.stream(stream.filter(entry -> ...) .map(entry -> ...) .spliterator(), false) .collect(Collectors.toList());
- Specified by:
collect
in interfaceCacheStream<Original>
- Type Parameters:
R1
- The resulting type of the collector- Parameters:
supplier
- The supplier to create the collector- Returns:
- the collected value
-
collect
public <R1> R1 collect(Supplier<R1> supplier, BiConsumer<R1, ? super R> accumulator, BiConsumer<R1, R1> combiner) Description copied from interface:CacheStream
Note: The accumulator and combiner are applied on each node until all the local stream's values are reduced into a single object. Because of marshalling limitations, the final result of the collector on remote nodes is limited to a size of 2GB. If you need to process more than 2GB of data, you must force the collector to run on the originator with
CacheStream.spliterator()
:StreamSupport.stream(stream.filter(entry -> ...) .map(entry -> ...) .spliterator(), false) .collect(Collectors.toList());
-
max
-
min
-
count
public long count() -
anyMatch
-
allMatch
-
noneMatch
-
findFirst
-
findAny
-
iterator
Description copied from interface:CacheStream
Usage of this operator requires closing this stream after you are done with the iterator. The preferred usage is to use a try with resource block on the stream.
This method has special usage with the
BaseCacheStream.SegmentCompletionListener
in that as entries are retrieved from the next method it will complete segments.This method obeys the
CacheStream.distributedBatchSize(int)
. Note that when using methods such asCacheStream.flatMap(Function)
that you will have possibly more than 1 element mapped to a given key so this doesn't guarantee that many number of entries are returned per batch.Note that the
Iterator.remove()
method is only supported if no intermediate operations have been applied to the stream and this is not a stream created from aCache.values()
collection.- Specified by:
iterator
in interfaceBaseStream<Original,
R> - Specified by:
iterator
in interfaceCacheStream<Original>
- Returns:
- the element iterator for this stream
-
spliterator
Description copied from interface:CacheStream
Usage of this operator requires closing this stream after you are done with the spliterator. The preferred usage is to use a try with resource block on the stream.
- Specified by:
spliterator
in interfaceBaseStream<Original,
R> - Specified by:
spliterator
in interfaceCacheStream<Original>
- Returns:
- the element spliterator for this stream
-
toArray
-
toArray
-
onClose
Description copied from interface:CacheStream
- Specified by:
onClose
in interfaceBaseStream<Original,
R> - Specified by:
onClose
in interfaceCacheStream<Original>
- Returns:
- a cache stream with the handler applied
-
close
public void close()- Specified by:
close
in interfaceAutoCloseable
- Specified by:
close
in interfaceBaseStream<Original,
R>
-