Synthetic data is not a solution for data scarcity. It is an amplifier of existing statistical patterns, including biases and errors, when generated from a limited source dataset. This creates a false sense of data abundance that degrades model performance.














