Organizations have to deal with data that runs into terabytes and petabytes in volume. For these organizations, big data is no more an option but a necessity. And big data is not just about acquiring large amount of data. What counts is, how you use the data efficiently and make interpret the meaning.
SORAL tackles few specific problems of big data. One problem is the need to convert such amount of data from one format to another format. Apache Spark uses a format of data called Parquet. We have built a tool that can convert large json file to parquet file and vice versa, which can be used by Spark.