When verifying that an autonomous AI-based system is acting appropriately, which of the following are MOST important to include?
Correct Answer: C
The syllabus highlights that testing for unnecessary human intervention is a key focus for autonomous AI- based systems: "For autonomous AI-based systems, testers must ensure that the system does not prompt for unnecessary human intervention, as this contradicts the autonomy concept." (Reference: ISTQB CT-AI Syllabus v1.0, Section 8.2, page 59 of 99)
Question 2
An e-commerce developer built an application for automatic classification of online products in order to allow customers to select products faster. The goal is to provide more relevant products to the user based on prior purchases. Which of the following factors is necessary for a supervised machine learning algorithm to be successful?
Correct Answer: A
Supervised machine learning requires correctly labeled data to train an effective model. The learning process relies on input-output mappings where each training example consists of an input (features) and a correctly labeled output (target variable). Incorrect labeling can significantly degrade model performance. * Supervised Learning Process * The algorithm learns from labeled data, mapping inputs to correct outputs during training. * If labels are incorrect, the model will learn incorrect relationships and produce unreliable predictions. * Quality of Training Data * The accuracy of any supervised ML model ishighly dependent on the quality of labels. * Poorly labeled data leads to mislabeled training sets, resulting inbiased or underperforming models. * Error Minimization and Model Accuracy * Incorrectly labeled data affects theconfusion matrix, reducing precision, recall, and accuracy. * It leads to overfitting or underfitting, which decreases the model's ability to generalize. * Industry Standard Practices * Many AI development teams spend a significant amount of time ondata annotation and quality controlto ensure high-quality labeled datasets. * (B) Minimizing the amount of time spent training the algorithm#(Incorrect) * While reducing training time is important for efficiency, the quality of training is more critical. A well-trained model takes time to process large datasets and optimize its parameters. * (C) Selecting the correct data pipeline for the ML training#(Incorrect) * A good data pipeline helps, butit does not directly impact learning successas much as labeling does.Even a well-optimized pipeline cannot fix incorrect labels. * (D) Grouping similar products together before feeding them into the algorithm#(Incorrect) * This describesclustering, which is anunsupervised learning technique. Supervised learningrequires labeled examples, not just grouping of data. * Labeled data is necessary for supervised learning."For supervised learning, it is necessary to have properly labeled data." * Data labeling errors can impact performance."Supervised learning assumes that the data is correctly labeled by the data annotators.However, it is rare in practice for all items in a dataset to be labeled correctly." Why Labeling is Critical?Why Other Options are Incorrect?References from ISTQB Certified Tester AI Testing Study GuideThus,option A is the correct answer, ascorrectly labeled data is essential for supervised machine learning success.
Question 3
An e-commerce developer built an application for automatic classification of online products in order to allow customers to select products faster. The goal is to provide more relevant products to the user based on prior purchases. Which of the following factors is necessary for a supervised machine learning algorithm to be successful?
Correct Answer: A
The syllabus explains that supervised learning requires correctly labeled data so the algorithm can learn the relationship between input features and output labels: "In supervised learning, the algorithm creates the ML model from labeled data during the training phase. The labeled data is used to infer the relationship between the input data and output labels." (Reference: ISTQB CT-AI Syllabus v1.0, Section 3.1.1)
Question 4
Which ONE of the following options does NOT describe a challenge for acquiring test data in ML systems? SELECT ONE OPTION
Correct Answer: C
* Challenges for Acquiring Test Data in ML Systems: Compliance needs, the changing nature of data over time, and sourcing data from public sources are significant challenges. Data being generated quickly is generally not a challenge; it can actually be beneficial as it provides more data for training and testing. * Reference: ISTQB_CT-AI_Syllabus_v1.0, Sections on Data Preparation and Data Quality Issues.
Question 5
Which of the following is an example of a clustering problem that can be resolved by unsupervised learning?
Correct Answer: A
Clustering is a form ofunsupervised learning, which groups data points based onsimilarities without predefined labels. According toISTQB CT-AI Syllabus, clustering is used in scenarios where: * The objective is to find natural groupings in data. * The dataset does not have labeled outputs. * Patterns and structures need to be identified automatically. Analyzing the answer choices: * A. Associating shoppers with their shopping tendencies # Correct * Shoppers can be grouped based on purchasing behaviors(e.g., luxury shoppers vs. budget- conscious shoppers), which is a typical clustering application in market segmentation. * B. Grouping individual fish together based on their types of fins # Incorrect * If thetypes of fins are labeled, it becomes aclassification problem, which requires supervised learning. * C. Classifying muffin purchases based on packaging attractiveness # Incorrect * Classification, not clustering, because attractiveness scores or labels must be predefined. * D. Estimating the expected purchase of cat food after an ad campaign # Incorrect * This is a prediction task, best suited forregression models, which are part of supervised learning. Thus,Option A is the best answer, asclusteringis used togroup shoppers based on tendencies without predefined labels. Certified Tester AI Testing Study Guide References: * ISTQB CT-AI Syllabus v1.0, Section 3.1.2 (Unsupervised Learning - Clustering and Association) * ISTQB CT-AI Syllabus v1.0, Section 3.3 (Selecting a Form of ML - Clustering).