Which module should you use?

You are building an Azure Machine Learning Experiment.
You need to transform a string column that has 47 distinct values into a binary indicator column. The solution must use the One-vs-All Multiclass model.
Which module should you use?
A. Select Columns Transform
B. Convert to Indicator Values
C. Group Categorical Values
D. Edit Metadata

Answer: C

How should you complete the R code?

DRAG DROP
You have an Execute R Script module that has one input from either a Partition and Sample module or a Web Service input module.
You need to preprocess tweets by using R. The Solution must meet the following requirements:
* Remove digit
* Remove punctuation
* Convert to lowercase
How should you complete the R code? To answer drag the appropriate value to correct Target.

image004 - How should you complete the R code?

Answer:

image006 - How should you complete the R code?

Which type of Azure HDInsight cluster should you use to produce results as quickly as possible?

You are analyzing taxi trips in New York City. You leverage the Azure Data Factory to create data pipelines and to orchestrate data movement.
You plan to develop a predictive model for 170 million rows (37 GB) of raw data in Apache Hive by using Microsoft R Serve to identify which factors contributes to the passenger tipping behavior.
All of the platforms that are used for the analysis are the same. Each worker node has eight processor cores and 28 GB Of memory.
Which type of Azure HDInsight cluster should you use to produce results as quickly as possible?
A. Hadoop
B. HBase
C. Interactive Hive
D. Spark

Answer: A

What capability does the module provide?

You plan to use Azure Machine Learning to develop a predictive model. You plan to include an Execute Python Script module.
What capability does the module provide?
A. outputting a file to a network location
B. performing interactive debugging of a Python script
C. saving the result of a Python script run in a Machine Learning environment to a local file
D. visualizing univariate and multivariate summaries by using Python code

Answer: C

Which method should you use?

Note: This question is part of a series of questions that present the same Scenario. Each question I the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution while others might not have correct solution.
Start of repeated Scenario:
A Travel agency named Margie’s Travel sells airline tickets to customers in the United States.
Margie’s Travel wants you to provide insights and predictions on flight delays. The agency is considering implementing a system that will communicate to its customers as the flight departure near about possible delays due to weather conditions.
The flight data contains the following attributes:
* DepartureDate: The departure date aggregated at a per hour granularity.
* Carrier: The code assigned by the IATA and commonly used to identify a carrier.
* OriginAirportID: An identification number assigned by the USDOT to identify a unique airport (the flight’s Origin)
* DestAirportID: The departure delay in minutes.
*DepDet30: A Boolean value indicating whether the departure was delayed by 30 minutes or more ( a value of 1 indicates that the departure was delayed by 30 minutes or more)
The weather data contains the following Attributes: AirportID, ReadingDate (YYYY/MM/DD HH), SKYConditionVisibility, WeatherType, Windspeed, StationPressure, PressureChange and HourlyPrecip.
End of repeated Scenario:
You need to use historical data about on-time flight performance and the weather data to predict whether the departure of a scheduled flight will be delayed by more than 30 minutes.
Which method should you use?
A. Clustering
B. Linear regression
C. Classification
D. anomaly detection

Answer: C
Explanation:
https://gallery.cortanaintelligence.com/Experiment/837e2095ce784f1ba5ac623a60232027

Which model should you use?

You are building an Azure Machine Learning Solution for an Online retailer.
When a customer selects a product, you need to recommend products that the customer might like to purchase at the same time. The recommendation should be based on what other customers purchased the same product.
Which model should you use?
A. Collaborative Filtering
B. Boosted Decision Tree Regression Model
C. Two-Class boosted decision tree
D. K-Means Clustering

Answer: A

Which three actions can you perform by using both R code and Python in the Machine Learning environment?

You have an Azure Machine Learning Environment.
You are evaluating whether to use R code or Python.
Which three actions can you perform by using both R code and Python in the Machine Learning environment? Each correct answer presents a complete solution.
NOTE: Each correct selection is worth one point.
A. Preprocess cleanse, and group data.
B. Score a training model.
C. Create visualizations.
D. Create an untrained model that can be used with the Train Model module.
E. Implement feature ranking.

Answer: BCD

Does this meet the goal?

Note: This question is part of a series of questions that present the same Scenario. Each question I the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution while others might not have correct solution.
You are designing an Azure Machine Learning workflow.
You have a dataset that contains two million large digital photographs.
You plan to detect the presence of trees in the photographs.
You need to ensure that your model supports the following:
* Hidden Layers that support a directed graph structure.
* User-defined core components on the GPU
Solution: You create an Azure notebook that supports the Microsoft Cognitive Toolkit.
Does this meet the goal?
A. YES
B. NO

Answer: B