A Generative AI approach for creating and validating simplified versions of government documents

Karine Alves; Matheus Silva; Edney Santos; George Valença; Kellyton Brito

doi:10.59490/dgo.2025.968

Authors

Karine Alves Universidade Federal Rural de Pernambuco (UFRPE), Brasil https://orcid.org/0009-0004-9429-4317
Matheus Silva Universidade Federal Rural de Pernambuco (UFRPE), Brasil https://orcid.org/0009-0003-8972-0636
Edney Santos Universidade Federal Rural de Pernambuco (UFRPE), Brasil https://orcid.org/0009-0002-6784-8789
George Valença Universidade Federal Rural de Pernambuco (UFRPE), Brasil https://orcid.org/0000-0001-9375-5354
Kellyton Brito Universidade Federal Rural de Pernambuco (UFRPE), Brasil https://orcid.org/0000-0002-6883-8657

DOI:

https://doi.org/10.59490/dgo.2025.968

Keywords:

Generative AI, Large Language Model, Plain Language, E-Government

Abstract

Abstract. In the context of Brazil’s re-democratization and the need for greater transparency in public administration, the 1988 Constitution established the right to access public information. However, the complexity of legal language, particularly in court documents, poses a significant barrier to understanding for the general public, especially given that only about 25% of Brazilians aged 25 or older have completed or are pursuing higher education. This study addresses this issue by leveraging generative AI models to simplify legal texts from the Court of Accounts of Pernambuco into plain language, making them more accessible to individuals with a high school education level. The research evaluates the effectiveness of two Large Language Models (GPT and Gemini) and five prompt techniques (Tree of Thought, COSTAR, Zero Shot, One Shot, and Meta Prompting) in producing simplified versions of 14 preliminary decisions. A total of 140 simplified texts were generated and evaluated using an 18-question questionnaire based on plain language principles, with scores generated by AI models and validated through human review. The results show that Gemini with the Tree of Thought technique achieved the highest average score (67.64), based on responses to the plain language questionnaire, while GPT with the COSTAR technique performed best in preserving essential information and achieving the highest readability scores (Flesch Reading Ease: 55.26). However, omissions of critical information were a common issue across all models, highlighting the need for human oversight. The study also found that GPT outperformed Gemini in evaluation accuracy, with lower Mean Absolute Error (MAE) and Mean Absolute Percentage Error (MAPE) compared to human evaluations. Despite this, AI models tended to overestimate readability and comprehension, underscoring the importance of a hybrid approach that combines AI-generated assessments with human review. The findings demonstrate the potential of generative AI to reduce costs and improve accessibility to legal and governmental documents, while also emphasizing the need for further research to address limitations such as omissions, biases, and ethical considerations. This study contributes to the growing body of literature on AI-assisted text simplification and provides a foundation for future work in this area.

Downloads

Download data is not yet available.

References

Alves, A., Miranda, P., Mello, R., & Nascimento, A. (2023). Automatic Simplification of Legal Texts in Portuguese Using Machine Learning. https://doi.org/10.3233/FAIA230975

Alves, K., Santos, E., Silva, M. F., Chaves, A. C., Fernandes, J. A., Valenca, G., & Brito, K. (2024). Towards the regulation of Large Language Models (LLMs) and Generative AI use in the Brazilian Government: the case of a State Court of Accounts. Proceedings of the 17th International Conference on Theory and Practice of Electronic Governance, 28–35. https://doi.org/10.1145/3680127.3680219

Andersen, T. B. (2009). E-Government as an anti-corruption strategy. Information Economics and Policy, 21(3), 201–210. https://doi.org/10.1016/j.infoecopol.2008.11.003

APSC (Australian Public Service Comission). (2023). Australian Government Style Manual. [link]

Araújo, R. (2024, November 18). JuLIA Explica: novo módulo da IA do TJ-PI simplifica o acesso a informações processuais | Tribunal de Justiça do Piauí. [link]

Atricon recomenda que Tribunais de Contas adotem linguagem simples e direito visual – Atricon. (n.d.). Retrieved April 5, 2025, from [link]

Bandeira, G. (2024, March 22). 54,5% dos brasileiros têm formação básica completa, diz IBGE. [link]

Belém, M. (2013). A simplificação da linguagem jur’idica como meio de aproximação do cidadão à justiça. Revista Jur’idica Da Seção Judiciária de Pernambuco, 6, 313–320.

Chen, B., Zhang, Z., Langrené, N., & Zhu, S. (2024). Unleashing the potential of prompt engineering in Large Language Models: a comprehensive review. [link]

Constituição. (n.d.). Retrieved April 15, 2025, from [link]

DeepSeek-AI, Guo, D., Yang, D., Zhang, H., Song, J., Zhang, R., Xu, R., Zhu, Q., Ma, S., Wang, P., Bi, X., Zhang, X., Yu, X., Wu, Y., Wu, Z. F., Gou, Z., Shao, Z., Li, Z., Gao, Z., … Zhang, Z. (2025). DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning. [link]

Devlin, J., Chang, M.-W., Lee, K., Google, K. T., & Language, A. I. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North, 4171–4186. https://doi.org/10.18653/V1/N19-1423

Dyer, C., Fairbanks, J., Greiner, M., Barron, K., Skreen, J., Cerrillo-Ramirez, J., Lee, A., & Hinsee, B. (2013). Improving Access to Justice: Plain Language Family Law Court Forms in Washington State. Seattle Journal for Social Justice, 11(3). [link]

Haman, M., & Školník, M. (2024). Using ChatGPT to conduct a literature review. Accountability in Research, 31(8), 1244–1246. https://doi.org/10.1080/08989621.2023.2185514

Heerlijk Helder | Vlaanderen.be. (n.d.). Retrieved January 30, 2025, from [link]

Klarspråk - Språkrådet. (n.d.). Retrieved January 30, 2025, from [link]

Long, J. (2023). Large Language Model Guided Tree-of-Thought. [link]

Marques, N., Silva, R. R., & Bernardino, J. (2024). Using ChatGPT in Software Requirements Engineering: A Comprehensive Review. Future Internet, 16(6), 180. https://doi.org/10.3390/fi16060180

Martínez, P., Ramos, A., & Moreno, L. (2024). Exploring Large Language Models to generate Easy to Read content. Frontiers in Computer Science, 6. https://doi.org/10.3389/fcomp.2024.1394705

Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. 1st International Conference on Learning Representations, ICLR 2013 - Workshop Track Proceedings. [link]

OpenAI. (n.d.). GPT-4 Technical Report.

Openai, A. R., Openai, K. N., Openai, T. S., & Openai, I. S. (n.d.). Improving Language Understanding by Generative Pre-Training. Retrieved April 15, 2025, from [link]

Ospina-Henao, V., Flórez, S. L., Núñez, V. J. M., Lamas, Ó. L., & De la Prieta, F. (2024). Generative AI: Simplifying Text for Cognitive Impairments and Non-native Speakers (pp. 33–44). https://doi.org/10.1007/978-3-031-73538-7_4

Papastratis, I., Konstantinidis, D., Daras, P., & Dimitropoulos, K. (2024). AI nutrition recommendation using a deep generative model and ChatGPT. Scientific Reports, 14(1), 14620. https://doi.org/10.1038/s41598-024-65438-x

Petelin, R. (2010). Considering plain language: issues and initiatives. Corporate Communications: An International Journal, 15(2), 205–216. https://doi.org/10.1108/13563281011037964

Peters, M., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., & Zettlemoyer, L. (2018). Deep Contextualized Word Representations. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), 2227–2237. https://doi.org/10.18653/v1/N18-1202

Plain language, accessibility, and inclusive communications - Privy Council Office - Canada.ca. (n.d.). Retrieved January 30, 2025, from [link]

Ray, P. P. (2023). ChatGPT: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope. Internet of Things and Cyber-Physical Systems, 3, 121–154. https://doi.org/10.1016/j.iotcps.2023.04.003

Roedel, P. (2024). Manual de Linguagem Simples (Edições Câmara, Ed.).

Sahoo, P., Singh, A. K., Saha, S., Jain, V., Mondal, S., & Chadha, A. (2024). A Systematic Survey of Prompt Engineering in Large Language Models: Techniques and Applications. [link]

Silva, M., Santos, E., Alves, K., Silva, H., Pedrosa, F., Valença, G., & Brito, K. (2024). Using Generative AI for Simplifying Official Documents in the Public Accounts Domain. Anais Do XII Workshop de Computação Aplicada Em Governo Eletrônico (WCGE 2024), 246–253. https://doi.org/10.5753/wcge.2024.2915

Uso de inteligência artificial aprimora processos internos no Tribunal de Contas da União – Notícias | Portal TCU. (n.d.). Retrieved April 16, 2025, from [link]

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention Is All You Need. Advances in Neural Information Processing Systems, 2017-December, 5999–6009. [link]

Zhang, Y., Yuan, Y., & Yao, A. C.-C. (2024). Meta Prompting for AI Systems. [link]

A Generative AI approach for creating and validating simplified versions of government documents

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Conference Proceedings Volume

Section

License

Programme dg.o

Browse per article type:

Search:

Questions?