Practice Free C1000-144 Exam Online Questions
When choosing a plot to show the relationship between two ranked variables, which type of visualization is least appropriate?
- A . Scatterplot
- B . Heatmap
- C . Line plot
- D . Pie chart
What is one way that IBM AutoAI helps make it easier for data scientists to determine what key fields to use to join data tables?
- A . AutoAI automatically suggests joining on fields in different tables that have matching names.
- B . AutoAI automatically suggests joining on fields in different tables that have matching values.
- C . AutoAI automatically suggests joining on fields in different tables when those tables have the same number of rows.
- D . AutoAI automatically suggests joining on fields in different tables when those tables have the same number of fields.
Why is it important to have a rollback plan when updating models in production?
- A . To increase the model’s training time
- B . To comply with international data regulations
- C . To handle potential issues with new model versions
- D . To improve the graphical interface of the application
What is a primary application of clustering algorithms?
- A . Predicting class labels for a given set of predictors
- B . Identifying abnormal data patterns in the dataset
- C . Grouping similar items without predefined labels
- D . Recommending products based on user preferences
Which data points are essential to understand when analyzing available data for a business problem?
- A . Data volume and variety
- B . The age of the data
- C . Sources of the data
- D . All of the above
What is the reason oversampling on the minority class should be performed after the train test is split and not before?
- A . To not risk reducing the value of the recall
- B . To not risk reducing the value of the accuracy
- C . To not increase the number of records in the testing dataset
- D . So there is not duplicate records in both the training and testing datasets
What is the core principle of the IBM Garage Methodology?
- A . Utilizing high-volume data storage effectively
- B . Maximizing remote working capabilities
- C . Creating minimal viable products (MVPs) through iterative development
- D . Implementing strict project management protocols
In the context of anomaly detection, what is the algorithm primarily searching for?
- A . Patterns that do not conform to expected behavior
- B . The best way to group similar data points
- C . The optimal number of clusters in the data
- D . The strongest predictors of a target variable
When selecting tools for deploying machine learning models, what factors should be considered? (Choose Two)
- A . Compatibility with existing data systems
- B . The programming language familiarity of the team
- C . The visual appeal of the tool’s GUI
- D . The cost of licenses and support
What is the main idea behind the Linear Support Vector Machine (SVM)?
- A . To maximize the margin between the decision boundary and the nearest data points
- B . To minimize the distance to the nearest data points
- C . To maximize the decision boundary curvature
- D . To classify data points by a random boundary