Alpakka-Cassandra Error handling

pierangeloc · September 9, 2018, 3:31pm

I am using akka-stream-alpakka-cassandra to read from one table and then do some transformations and save in another table. I modeled the reading stage through a CassandraSource and the saving stage through a CassandraFlow.createWithPassThrough.

Implementing the happy path is straightforward. Just the unhappy path poses some difficulties to me.

E.g. it can happen that during the save something goes wrong on Cassandra and the stream ends up failing (I didn’t define any SupervisionStrategy, yet). What I would like to do is to be in full control of the error that occurred per emitted element, therefore the implementation of createWithPassThrough is not particularly useful in this case because it is based on mapAsync and if the Future returned by

session
          .executeAsync(statementBinder(t, statement))

fails, the stream fails and stops. Putting a recover at the end of it doesn’t solve the problem because the stream completes successfully and stops consuming from upstream.

Wouldn’t it give more control to the developer, to guarantee that the Future in mapAsync is always successful, and emit instead an Either[Throwable, T]?, or even better a pair (T, Either[Throwable, Done]) (we don’t want to expose the ResultSet), so one knows the input for the statement being bound, together with the outcome of the (write) query?

If there are other strategies to handle this, can someone give some suggestions?

cfsnyder · August 22, 2019, 6:28pm

Hi @pierangeloc, I’m encountering the same concern that you’ve documented here and I’m curious if you ever found a strategy for dealing with this?

iosven · May 6, 2022, 9:12pm

Hi @pierangeloc @cfsnyder

We have the same problem.

Did you find any solution?

iosven · May 6, 2022, 9:14pm

For the record in our scenario this happens when trying to write to Cassandra and the keyspace or the table does not exist. As of today we are using CassandraFlow.createBatch of alpakka 3.0.4 with Scala 2.13.8

A workaround is to check if keyspace and table already exist before even creating a stream recipe. In order to check one could execute a statement more directly on the CassandraSession as in

cs.select(s"SELECT ${column_name} from ${keyspace_name}.${table_name} limit 1").run()

Basically you can assume that the keyspace and table both exist if the Future does not fail. No matter if you then get 0 or 1 row back. This workaround is a bit tedious but works in my scenario.

iosven · May 6, 2022, 9:19pm

It is relatively easy to recreate such a situation:

Implement a simple stream recipe for reading from or writing to Cassandra.
Run the stream and ensure one element is processed while you have not yet created the relevant Cassandra keyspace.
Alternatively to 2. run the stream and ensure one element is processed while you have not yet created the relevant Cassandra table.
You should notice that the stage reading from or writing to Cassandra basically stops working, but without debugging etc. you will not notice any exception about that. The whole thing just halts rather silently (even if you have implemented and set via the attributes a supervision strategy that logs any exceptions it would handle if an exception would reach it).

Topic		Replies	Views
Infinite Streaming from Cassandra Akka Streams & Alpakka streams	4	1136	September 19, 2018
Is it possible to cancel outstanding futures in mapAsync on stream failure? Akka Streams & Alpakka	7	1668	March 21, 2019
Cassandra: Usage of Mapper Akka Streams & Alpakka	2	538	July 18, 2019
Cassandra - Driver Version 2.5.1 - Upgrade? Akka Streams & Alpakka	4	1199	March 29, 2019
Akka cluster using Cassandra driver and running on EKS, with Keyspaces for Cassandra, generates weird warning Akka Libraries akka-cluster , alpakka	1	2925	October 8, 2020

Alpakka-Cassandra Error handling

Related topics