Microsoft Perform Data Engineering on Microsoft Azure HDInsight (70-775) Free Practice Test
Question 1
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this sections, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You have an Apache Pig table named Sales in Apache HCatalog.
You need to make the data in the table accessible from Apache Pig.
Solution: You use the following script.

Does this meet the goal?
After you answer a question in this sections, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You have an Apache Pig table named Sales in Apache HCatalog.
You need to make the data in the table accessible from Apache Pig.
Solution: You use the following script.

Does this meet the goal?
Correct Answer: A
Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).
Question 2
Note: This question is part of a series of questions that present the same Scenario.
Each question I the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution while others might not have correct solution.
You are implementing a batch processing solution by using Azure HDInsight.
You plan to import 300 TB of data.
You plan to use one job that has many concurrent tasks to import the data in memory.
You need to maximize the amount of concurrent tanks for the job.
What should you do?
Each question I the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution while others might not have correct solution.
You are implementing a batch processing solution by using Azure HDInsight.
You plan to import 300 TB of data.
You plan to use one job that has many concurrent tasks to import the data in memory.
You need to maximize the amount of concurrent tanks for the job.
What should you do?
Correct Answer: G
Question 3
You have an array of integers in Apache Spark.
You need to save the data to an Apache Parquet file.
Which methods should you use?
You need to save the data to an Apache Parquet file.
Which methods should you use?
Correct Answer: A
Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).
Question 4
Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.
You are implementing a batch processing solution by using Azure HDInsight.
You plan to import 300 TB of dat
a.
You plan to use one job that has many concurrent tasks to import the data in memory.
You need to maximize the amount of concurrent tasks for the job.
What should you do?
You are implementing a batch processing solution by using Azure HDInsight.
You plan to import 300 TB of dat
a.
You plan to use one job that has many concurrent tasks to import the data in memory.
You need to maximize the amount of concurrent tasks for the job.
What should you do?
Correct Answer: D
Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).
Question 5
You have an Azure HDlnsight cluster.
You need to build a solution to ingest real-time streaming data into nonrelational distributed database.
What should you use to build the solution?
You need to build a solution to ingest real-time streaming data into nonrelational distributed database.
What should you use to build the solution?
Correct Answer: A
Question 6
DRAG DROP
You have a domain-joined Azure HDInsight cluster.
You plan to assign permissions to several support staff.
You need to assign roles to the staff so that they can perform specific tasks. The solution must use the principle of least privilege.
Which role should you assign for each task? To answer, drag the appropriate roles to the correct targets.
Each role may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.

You have a domain-joined Azure HDInsight cluster.
You plan to assign permissions to several support staff.
You need to assign roles to the staff so that they can perform specific tasks. The solution must use the principle of least privilege.
Which role should you assign for each task? To answer, drag the appropriate roles to the correct targets.
Each role may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.

Correct Answer:

Question 7
You have an Azure HDInsight cluster.
You need a build a solution to ingest real-time streaming data into a nonrelational distributed database.
What should you use to build the solution?
You need a build a solution to ingest real-time streaming data into a nonrelational distributed database.
What should you use to build the solution?
Correct Answer: A
Explanation: Only visible for TestSimulate members. You can sign-up / login (it's free).