What is a common characteristic of data bias in machine learning?

Prepare for the CompTIA Security+ SY0-601 exam. Explore comprehensive flashcards and diverse multiple-choice questions with hints and explanations. Get exam-ready now!

A common characteristic of data bias in machine learning is that it can result from unequal representation in training data. When the training data is not representative of the actual population or is skewed towards certain groups or categories, this imbalance can lead to biased predictions. For instance, if a facial recognition system is trained predominantly on images of one demographic, it may perform poorly on individuals from other demographics, leading to inaccuracies and reinforcing existing biases.

This issue highlights the importance of ensuring that datasets are diverse and representative to minimize bias. It underscores why careful data collection and preprocessing are crucial in the development of machine learning models, as the quality and composition of the data have a significant impact on the model’s performance and fairness.

Subscribe

Get the latest from Examzify

You can unsubscribe at any time. Read our privacy policy