EMC Data Science and Big Data Analytics (E20-007) Free Practice Test
Question 1
Which graphical representation shows the distribution and multiple summary statistics of a continuous variable for each value of a corresponding discrete variable?
Correct Answer: C
Question 2
Your colleague, who is new to Hadoop, approaches you with a question. They want to know how best to access their data. This colleague has previously worked extensively with SQL and databases.
Which query interface would you recommend?
Which query interface would you recommend?
Correct Answer: D
Question 3
A data scientist wants to predict the probability of death from heart disease based on three risk factors: age, gender, and blood cholesterol level.
What is the most appropriate method for this project?
What is the most appropriate method for this project?
Correct Answer: B
Question 4
Which SQL OLAP extension provides all possible grouping combinations?
Correct Answer: B
Question 5
A disk drive manufacturer has a defect rate of less than 1.0% with 98% confidence. A quality assurance team samples 1000 disk drives and finds 14 defective units. Which action should the team recommend?
Correct Answer: B
Question 6
Review the following code:
SELECT pn, vn, sum(prc*qty)
FROM sale
GROUP BY CUBE(pn, vn)
ORDER BY 1, 2, 3;
Which combination of subtotals do you expect to be returned by the query?
SELECT pn, vn, sum(prc*qty)
FROM sale
GROUP BY CUBE(pn, vn)
ORDER BY 1, 2, 3;
Which combination of subtotals do you expect to be returned by the query?
Correct Answer: B
Question 7
In which lifecycle stage are appropriate analytical techniques determined?
Correct Answer: D
Question 8
Refer to the exhibit.

In the exhibit, a correlogram is provided based on an autocorrelation analysis of a sample dataset.
What can you conclude from only this exhibit?

In the exhibit, a correlogram is provided based on an autocorrelation analysis of a sample dataset.
What can you conclude from only this exhibit?
Correct Answer: B