Helper library for Spark to deal with large data sets
What do you like best?
- Easy integration to spark.
- Dealing with arrays made easier.
- Good tool for data analysis.
What do you dislike?
- locally it is no good than numpy, instead using numpy is better if dealing locally with the data.
- It only supports spark, should also be implemented for other frameworks.
Recommendations to others considering the product:
If you are dealing with data locally instead use numpy: https://docs.scipy.org/doc/ it is far better than bolt.
What problems are you solving with the product? What benefits have you realized?
- Dealing with complicated data sets.
- Analyzing the data efficiently.