This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
Which user requirements are best suited for using HDInsight Interactive Query?
When you want to use MapReduce on unstructured data with role-based access controls.
When you want to use SQL-like queries on structured data with row and column level controls.
When you want to use SQL-like queries on high concurrency data for long running-computations.
What file formats are supported with Interactive Query?
.xml, .doc, .log
.json, .csv, .txt
.PDF, .DBK, .MD
Which scenario is best for HDInsight Interactive Query?
Batch processing
Streaming data.
Ad hoc queries
Why is the Hive Warehouse Connector needed?
Hive and Spark are different cluster types.
Hive and Spark have two different metastores. They require a connector to bridge between the two.
Hive is for static data and Spark is for streaming data.
Why is using the Hive Warehouse Connector more efficient and scalable than using a standard JDBC connection from Spark to Hive?
Because the library loads data from the HiveServer into the spark driver in parallel
Because the Hive Warehouse Connector is optimized for streaming data.
Because the library loads data from LLAP daemons into Spark executors in parallel
You must answer all questions before checking your work.
Was this page helpful?