Deploying a general-purpose LLM for clinical tasks introduces critical risks:
- High hallucination rates generating false medical information.
- Poor comprehension of clinical jargon, ontologies (SNOMED CT, ICD-10), and nuanced patient narratives.
- Inadequate safety guardrails for high-stakes decision support.
Generic models are trained on public internet data, not curated medical evidence, making them fundamentally unfit for clinical use.




