Free DP-100 Exam Dumps

Question 16

- (Exam Topic 3)
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You are creating a new experiment in Azure Machine Learning Studio.
One class has a much smaller number of observations than the other classes in the training set. You need to select an appropriate data sampling strategy to compensate for the class imbalance. Solution: You use the Stratified split for the sampling mode.
Does the solution meet the goal?

Correct Answer:B
Instead use the Synthetic Minority Oversampling Technique (SMOTE) sampling mode.
Note: SMOTE is used to increase the number of underepresented cases in a dataset used for machine learning. SMOTE is a better way of increasing the number of rare cases than simply duplicating existing cases.
References:
https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/smote

Question 17

- (Exam Topic 3)
You use Azure Machine Learning Studio to build a machine learning experiment.
You need to divide data into two distinct datasets. Which module should you use?

Correct Answer:A
Partition and Sample with the Stratified split option outputs multiple datasets, partitioned using the rules you specified.
References:
https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/partition-and-sample

Question 18

- (Exam Topic 3)
Your team is building a data engineering and data science development environment. The environment must support the following requirements:
DP-100 dumps exhibit support Python and Scala
DP-100 dumps exhibit compose data storage, movement, and processing services into automated data pipelines
DP-100 dumps exhibit the same tool should be used for the orchestration of both data engineering and data science
DP-100 dumps exhibit support workload isolation and interactive workloads
DP-100 dumps exhibit enable scaling across a cluster of machines You need to create the environment.
What should you do?

Correct Answer:B
In Azure Databricks, we can create two different types of clusters.
DP-100 dumps exhibit Standard, these are the default clusters and can be used with Python, R, Scala and SQL
DP-100 dumps exhibit High-concurrency
Azure Databricks is fully integrated with Azure Data Factory.