keyboard_arrow_up
Risk-Based Test Framework for LLM Features in Regulated Software

Authors

Zhiyin Zhou , New York, USA

Abstract

Large language models are increasingly embedded in regulated and safety critical software, including clinical research platforms and healthcare information systems. While these features enable natural language search, summarization, and configuration assistance, they introduce risks such as hallucinations, harmful or out of scope advice, privacy and security issues, bias, instability under change, and adversarial misuse. Prior work on machine learning testing and AI assurance offers useful concepts but limited guidance for interactive, product embedded assistants. This paper proposes a risk-based testing framework for LLM features in regulated software: a six-category risk taxonomy, a layered test strategy mapping risks to concrete tests across guardrail, orchestration, and system layers, and a case study applying the approach to a Knowledgebase assistant in a clinical research platform.

Keywords

Large language models, software testing, regulated software, healthcare, risk-based testing, safety assurance, red teaming, regression testing

Full Text  Volume 16, Number 2