Practice Free D-DS-FN-23 Exam Online Questions
Question #91
What is the primary function of the NameNode in Hadoop?
- A . Keeps track of which MapReduce jobs have been assigned to each TaskTracker
- B . Monitors the state of each JobTracker node and signals an event if unavailable
- C . Runs some number of mapping tasks against its assigned data
- D . Acts as a regulator/resolver among clients and DataNodes
Question #92
Which word or phrase completes the statement? Mahout is to Hadoop as MADlib is to _______.
- A . PostgreSQL
- B . R
- C . Excel
- D . SAS
Question #93
What provides the means for matching and manipulating text strings in SQL?
- A . PACF
- B . Association rules
- C . Regular expressions
- D . TF-IDF
Question #94
After which phase of the data analytics lifecycle should you determine if the model is robust enough?
- A . Discovery
- B . Operationalize
- C . Data preparation
- D . Model building
Question #95
You have plotted the distribution of savings account sizes for a bank.
Based on the distribution shown in the exhibit, how would you proceed?
- A . Data is extremely skewed. Replot the data on a logarithmic scale to get a better understanding of it.
- B . Data is extremely skewed but looks bimodal. Replot the data in the range 2,500 – 10,000 to be certain.
- C . Accounts of sizes greater than 2,500 are rare and are most likely outliers. Eliminate them from future analysis.
- D . Data is extremely skewed. Split the analysis into two cohorts; accounts less than 2,500 and accounts greater than 2,500.
Question #96
You are provided with the following list.
Which window function is missing?
cume_dist()
dense_rank()
rank()
percent_rank()
first_value()
last_value()
lag()
lead()
ntile()
- A . row_preceding()
- B . row_number()
- C . median()
- D . cumulative_sum()