Best streaming engine compared to Kinesis
What do you like best?
Spark docs are saying, it is not atomic but it is near-atomic in the way of committing data but it loses in consistency. In our case, we are using a spark engine to read JSON from Apache kafka and write it to Data lake properties. Best part is streaming application on batch based.
What do you dislike?
It is not good in processing in-memory data when it auto-commit is disabled.
What problems are you solving with the product? What benefits have you realized?
We are doing event processing and analysis using Spark by reading leads from Kafka to streaming to Datalake ecosystem to segregate value data for analytics