Automating Test Design Using LLM

Results from an Empirical Study on the Public Sector

Authors

  • Artur Raffael Cavalcanti Computer Science Department, Federal Rural University of Pernambuco, Brazil https://orcid.org/0009-0005-9079-766X
  • Luan Accioly Computer Science Department, Federal Rural University of Pernambuco, Brazil
  • George Valença Computer Science Department, Federal Rural University of Pernambuco, Brazil https://orcid.org/0000-0001-9375-5354
  • Sidney C. Nogueira Computer Science Department, Federal Rural University of Pernambuco, Brazil https://orcid.org/0000-0002-8817-5029
  • Ana Carolina Morais Tribunal de Contas do Estado de Pernambuco, Brazil
  • Antônio Oliveira Tribunal de Contas do Estado de Pernambuco, Brazil
  • Sérgio Gomes Tribunal de Contas do Estado de Pernambuco, Brazil

DOI:

https://doi.org/10.59490/dgo.2025.1025

Keywords:

automated test design, LLM, user story, system test cases

Abstract

An efficient test process can detect failures earlier in software development, contributing to the quality of software produced by governmental entities and bringing the potential to improve government service delivery. Due to the high cost and the reduced resources available, the automation of the test activities plays a strategic role in improving testing efficiency. Designing test cases from user stories is a common approach to assessing the quality of a system under testing. This paper reports on the research, implementation, and evaluation of a tool that automatically generates system test designs from user stories with the support of Generative Pre-trained Transformer 4 (GPT-4) in the context of a public sector organization. The tool has been conceived to match the needs and particularities of the organization’s test process. Such a tool reads user stories from the Redmine tool, interacts with GPT-4 using a prompt that outputs test cases, and stores the automatically designed tests in the Squash TM test management tool. Organization test analysts stated that the tool produces good quality tests and reduces the effort to create tests. As a consequence, analysts can put more energy into other activities related to testing. Moreover, comparing the tests designed manually by test analysts with the tests designed by the tool shows that both have the same functional coverage. The paper discusses the impacts of the approach in the process, limitations, and related and future work.

Downloads

Download data is not yet available.

Downloads

Additional Files

Published

2025-05-23

How to Cite

Cavalcanti, A. R., Accioly, L., Valença, G., Nogueira, S. C., Morais, A. C., Oliveira, A., & Gomes, S. (2025). Automating Test Design Using LLM: Results from an Empirical Study on the Public Sector. Conference on Digital Government Research, 1. https://doi.org/10.59490/dgo.2025.1025