What is Spark SQL?

 Spark SQL is a Spark module for structured data processing. Unlike the basic Spark RDD API, the interfaces provided by Spark SQL provide Spark with more information about the structure of both the data and the computation being performed. Internally, Spark SQL uses this extra information to perform extra optimization. 

One use of Spark SQL is to execute SQL queries.

When running SQL from within another programming language the results will be returned as a Dataset/DataFrame.

No comments:

Post a Comment