The Hidden Challenge in Medical AI: Domain Shift
Deep learning has made remarkable progress in medical image analysis. Models can now detect diseases and highlight abnormalities in radiographs with impressive accuracy. However, one major obstacle remains: models often fail when deployed in environments that differ from their training data.
This issue—known as domain shift is particularly common in medical imaging. Differences in imaging devices, acquisition protocols, or even patient populations can lead to substantial performance degradation. In dental radiology, this challenge is evident in the detection of periapical lesions. While panoramic X-rays are widely used for their low radiation and accessibility, their two-dimensional projection makes small lesions difficult to identify, leading to lower clinical sensitivity compared to advanced modalities like cone beam CT.
Why Data Scarcity Makes the Problem Harder
Training robust models requires massive, diverse datasets. In medical imaging, acquiring such data is expensive and time-consuming due to ethical guidelines and the need for expert annotations. Our study utilized 196 panoramic radiographs, of which 145 contained annotated lesions. While valuable, this size rarely covers full clinical variability, causing detectors to struggle with unusual lesion shapes or artifacts. To overcome this, we explored synthetic data augmentation.
Two Approaches to Synthetic Lesion Generation
Our research compared two controllable pipelines for generating synthetic lesions:
1. Expert-Guided Synthesis (EGS)
In this approach, clinicians manually draw lesion masks on healthy radiographs. A procedural rendering pipeline then applies noise-based texture patterns, intensity modulation (to simulate radiolucency), and Gaussian edge blending to integrate the lesion realistically with the surrounding bone.
2. Prompt-Guided Lesion Synthesis (PGLS)
We introduce a novel idea: using natural language prompts to guide generation. Instead of manual masks, clinicians describe attributes—size, location, margin sharpness, and contrast. A vision-language foundation model interprets these prompts to perform image-to-image edits. This approach requires no task-specific generative model training and is highly scalable through prompt libraries.
Training Robust Lesion Detectors
We evaluated these strategies using a YOLOv10 object detection model trained on three regimes: Real-only data, Real + EGS, and Real + PGLS. The results were clear: both augmentation strategies improved performance, but prompt-guided synthesis consistently produced the strongest gains—achieving approximately 8–18% performance improvement across major metrics like mean Average Precision (mAP) and recall.
Why Prompt-Guided Augmentation Matters
Traditional generative approaches like GANs require massive datasets and complex tuning. Prompt-guided synthesis, however, uses existing foundation models for targeted edits. This allows for an iterative workflow: identify model failure cases (e.g., low-contrast lesions), describe them through prompts, generate targeted data, and retrain.
Conclusion: Toward More Robust Clinical AI
Beyond dental imaging, prompt-guided synthesis has broader implications for radiology, pathology, and dermatology. The goal is not simply to build accurate models in the lab, but to ensure they remain reliable across diverse clinical environments. Sometimes, improving AI performance is not about collecting more data—but about generating the right data.
References
- Özüdoğru, S., Coşkun, F. Ö., Uysal, F., Küçüktaş, Ü. T., Hardalaç, F., & Kaya, D. I. (2026). Prompt-guided synthetic data generation to mitigate domain shift in periapical lesion detection. PeerJ Computer Science, 12, e3577. DOI: 10.7717/peerj-cs.3577