Practice Free D-DS-FN-23 Exam Online Questions
Which analytic technique would be appropriate to estimate blood pressure based on age and weight?
- A . Naïve Bayesian classification
- B . Linear regression
- C . K-means clustering
- D . Time series analysis
Refer to the exhibit.
You are using K-means clustering to classify customer behavior for a large retailer. You need to determine the optimum number of customer groups. You plot the within-sum-of- squares (wss) data as shown in the exhibit.
How many customer groups should you specify?
- A . 2
- B . 3
- C . 4
- D . 8
Which participant in a data analytics project is typically responsible for assessing the validity of the model?
- A . Data scientist
- B . Business user
- C . Project sponsor
- D . Project manager
When is the GROUP BY ROLLUP clause used in an OLAP query?
- A . All subtotals and grand totals are to be included in the output
- B . Subtotals are only to be included in the output
- C . Grand totals are only to be included in the output
- D . Specific subtotals and grand totals for a combination of variables are only to be included in the output
The web analytics team uses Hadoop to process access logs. They now want to correlate this data with structured user data residing in a production single-instance JDBC database. They collaborate with the production team to import the data into Hadoop.
Which tool should they use?
- A . Sqoop
- B . Pig
- C . Chukwa
- D . Scribe
The web analytics team uses Hadoop to process access logs. They now want to correlate this data with structured user data residing in a production single-instance JDBC database. They collaborate with the production team to import the data into Hadoop.
Which tool should they use?
- A . Sqoop
- B . Pig
- C . Chukwa
- D . Scribe
Refer to the exhibit.
In the exhibit, a correlogram is provided based on an autocorrelation analysis of a sample dataset.
What can you conclude based only on this exhibit?
- A . There appears to be no structure left to model in the data
- B . There appears to be a seasonal component in the data
- C . Lag 1 has a significant autocorrelation
- D . There appears to be a cyclical component in the data
What describes the data repository represented by the ‘A’ in MAD?
- A . Attracts all data sources that occur within an organization, regardless of data quality
- B . Allows analysts to easily ingest, digest, produce, and adapt data at a rapid pace
- C . Centrally managed and based on long-range, careful design, planning and governance
- D . Enables analysts to study very large datasets without being limited to samples and extracts
You are analyzing a time series and want to determine its stationarity. You also want to determine the order of autoregressive models.
How are the autocorrelation functions used?
- A . ACF as an indication of stationarity, and PACF for the correlation between Xt and Xt-k not explained by their mutual correlation with X1 through Xk-1.
- B . PACF as an indication of stationarity, and ACF for the correlation between Xt and Xt-k not explained by their mutual correlation with X1 through Xk-1.
- C . ACF as an indication of stationarity, and PACF to determine the correlation of X1 through Xk-1.
- D . PACF as an indication of stationarity, and ACF to determine the correlation of X1 through Xk-1.
Which SQL OLAP extension provides all possible grouping combinations?
- A . CUBE
- B . ROLLUP
- C . UNION ALL
- D . CROSS JOIN