Apache Flink

*What data types can I use in incremental window aggregation for safe checkpointing?*

Hi! I want to do incremental window aggregation using `.aggregate(new MyAggregateFunction, new MyProcessWindowFunction)` as described in <https://nightlies.apache.org/flink/flink-docs-release-1.14/docs/dev/datastream/operators/windows/#incremental-window-aggregation-with-aggregatefunction|documentation>.

I'm wondering what Java/Scala types I can safely use as part of the incremental state ("accumulator"). I would like to, for example, keep a set of seen IDs in <https://docs.scala-lang.org/overviews/collections/sets.html|Scala Set>. Everything seems to work fine, but I'm wondering if also checkpoint and creating savepoints will work as expected. Will Flink know how to serialize such a data structure for checkpointing and savepointing? Thanks for any insights :pray:

EDIT: The comment by David Anderson in this <https://stackoverflow.com/questions/71673071/flink-falling-back-to-default-kryo-serializer-because-chill-serializer-couldnt|SO question> says that "_an AggregateFunction where the accumulator is a Set tends to be problematic_". Why is it problematic? If I use Java HashMap, is that problematic?