Microsoft Analyzing Big Data with Microsoft R

1.

You have a Microsoft SQL Server instance that has R Services (In-Database) installed. The server has a comma-separated values (CSV) file stored in the local file system.
For analytic purposes, you need to read the CSV file into a database table in the SQL Server instance.
You connect to the SQL Server instance by using SQL Server Management Studio.
What should you use from sp_execute_external_script?

RxSqlServerData and specify the CSV file path in the connection string rxDataStep and specify the CSV file path as the inFile argument rxImportToXdf and specify specify the CSV file as the input read.csv and specify the CSV file path as the parameter

2.

You plan to read data from an Oracle database table and to store the data in the file system for later processing by dplyrXdf. The size of the data is larger than the memory on the server to be used for modelling.
You need to ensure that the data can be processed by dplyrXdf in the least amount of time possible.
How should you transfer the data from the Oracle database?

Define a data source to the Oracle database server by using RxOdbcData. Use rxImport to save the data to a comma-separated values(CSV) file. Use the RODBC library, connect to the Oracle database server by using odbcConnect, and then use rxDataStep to export the data to a comma-separatedvalues (CSV) file. Define a data source to the Oracle database server by using RxOdbcData,and then use rxImport to save the data to an XDF file. Use the RODBC library, connect to the Oracle database server by using odbcConnect, and then use rxSplit to save the data to multiple comma-separatedvalues (CSV) file.

3.

You are running a parallel function that uses the following R code segment. (Line numbers are included for reference only.)
01 cp <- 0.01 xval <- 0 maxdepth <- 5
02
03 (form, data = "segmentationDataBig", maxDepth = maxdepth, cp = cp, xval = xval, blocksPerRead = 250
You need to complete the R code. The solution must support chunking.
Which function should insert at line 02?

rxBTrees rxExec rxDForest rxDTree

4.

You have one-class support vector machines (SVMs).
You have a large dataset, but you do not have enough training time to fully test the model.
What is an alternative method to validate the model?

Use Principal Components Analysis (PCA)-Based Anomaly Detection. Replace the SVMs with two-class SVMs. Perform feature selection. Use outlier detection.

5.

You need to build a model that looks at the probability of an outcome. You must regulate between L1 and L2.
Which classification method should you use?

Two-Class Neutral Network Two-Class Support Vector Machine Two-Class Decision Forest Two-Class Logistic Regression

6.

You have a dataset.
You need to repeatedly split randomly the dataset so that 80 percent of the data is used as a training set and the remaining 20 percent is used as a test set.
Which method should you use?

threshold binary classification imputation cross validation pruning

7.

You need to run a large data tree model by using rxDForest. The model must use cross validation.
Which rxDForest option should you use?

maxSurrogate maxNumBins maxDepth maxCompete xVal

8.

You have the following regression forest.

Which variable contributes the most to the dependent variable?

stack.loss Water.Temp Air.Flow Acid.Conc

Microsoft Analyzing Big Data with Microsoft R

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Congratulations

COMPANY

Products

OTHERS

Partner

Microsoft Analyzing Big Data with Microsoft R

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Answer & Solution

Answer: Option B

Solution:

Congratulations

Detail Form

COMPANY

Products

OTHERS

Partner