In a recent paper, researchers introduced Flare, a back-end for Spark that improves the framework’s performance closer to that of the top SQL query engines for relational and machine learning ...
Nodes can run as SQL compute nodes, SQL storage nodes or HDFS data nodes. In the HDFS case, SQL Server and Apache Spark run co-located, in the same container. All of this interoperability is enabled ...