A Generative AI approach for creating and validating simplified versions of government documents
DOI:
https://doi.org/10.59490/dgo.2025.968Keywords:
Generative AI, Large Language Model, Plain Language, E-GovernmentAbstract
Abstract. In the context of Brazil’s re-democratization and the need for greater transparency in public administration, the 1988 Constitution established the right to access public information. However, the complexity of legal language, particularly in court documents, poses a significant barrier to understanding for the general public, especially given that only about 25% of Brazilians aged 25 or older have completed or are pursuing higher education. This study addresses this issue by leveraging generative AI models to simplify legal texts from the Court of Accounts of Pernambuco into plain language, making them more accessible to individuals with a high school education level. The research evaluates the effectiveness of two Large Language Models (GPT and Gemini) and five prompt techniques (Tree of Thought, COSTAR, Zero Shot, One Shot, and Meta Prompting) in producing simplified versions of 14 preliminary decisions. A total of 140 simplified texts were generated and evaluated using an 18-question questionnaire based on plain language principles, with scores generated by AI models and validated through human review. The results show that Gemini with the Tree of Thought technique achieved the highest average score (67.64), based on responses to the plain language questionnaire, while GPT with the COSTAR technique performed best in preserving essential information and achieving the highest readability scores (Flesch Reading Ease: 55.26). However, omissions of critical information were a common issue across all models, highlighting the need for human oversight. The study also found that GPT outperformed Gemini in evaluation accuracy, with lower Mean Absolute Error (MAE) and Mean Absolute Percentage Error (MAPE) compared to human evaluations. Despite this, AI models tended to overestimate readability and comprehension, underscoring the importance of a hybrid approach that combines AI-generated assessments with human review. The findings demonstrate the potential of generative AI to reduce costs and improve accessibility to legal and governmental documents, while also emphasizing the need for further research to address limitations such as omissions, biases, and ethical considerations. This study contributes to the growing body of literature on AI-assisted text simplification and provides a foundation for future work in this area.
Downloads
References
Alves, A., Miranda, P., Mello, R., & Nascimento, A. (2023). Automatic Simplification of Legal Texts in Portuguese Using Machine Learning. https://doi.org/10.3233/FAIA230975
Alves, K., Santos, E., Silva, M. F., Chaves, A. C., Fernandes, J. A., Valenca, G., & Brito, K. (2024). Towards the regulation of Large Language Models (LLMs) and Generative AI use in the Brazilian Government: the case of a State Court of Accounts. Proceedings of the 17th International Conference on Theory and Practice of Electronic Governance, 28–35. https://doi.org/10.1145/3680127.3680219
Andersen, T. B. (2009). E-Government as an anti-corruption strategy. Information Economics and Policy, 21(3), 201–210. https://doi.org/10.1016/j.infoecopol.2008.11.003
APSC (Australian Public Service Comission). (2023). Australian Government Style Manual. [link]
Araújo, R. (2024, November 18). JuLIA Explica: novo módulo da IA do TJ-PI simplifica o acesso a informações processuais | Tribunal de Justiça do Piauí. [link]
Atricon recomenda que Tribunais de Contas adotem linguagem simples e direito visual – Atricon. (n.d.). Retrieved April 5, 2025, from [link]
Bandeira, G. (2024, March 22). 54,5% dos brasileiros têm formação básica completa, diz IBGE. [link]
Belém, M. (2013). A simplificação da linguagem jur’idica como meio de aproximação do cidadão à justiça. Revista Jur’idica Da Seção Judiciária de Pernambuco, 6, 313–320.
Chen, B., Zhang, Z., Langrené, N., & Zhu, S. (2024). Unleashing the potential of prompt engineering in Large Language Models: a comprehensive review. [link]
Constituição. (n.d.). Retrieved April 15, 2025, from [link]
DeepSeek-AI, Guo, D., Yang, D., Zhang, H., Song, J., Zhang, R., Xu, R., Zhu, Q., Ma, S., Wang, P., Bi, X., Zhang, X., Yu, X., Wu, Y., Wu, Z. F., Gou, Z., Shao, Z., Li, Z., Gao, Z., … Zhang, Z. (2025). DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning. [link]
Devlin, J., Chang, M.-W., Lee, K., Google, K. T., & Language, A. I. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North, 4171–4186. https://doi.org/10.18653/V1/N19-1423
Dyer, C., Fairbanks, J., Greiner, M., Barron, K., Skreen, J., Cerrillo-Ramirez, J., Lee, A., & Hinsee, B. (2013). Improving Access to Justice: Plain Language Family Law Court Forms in Washington State. Seattle Journal for Social Justice, 11(3). [link]
Haman, M., & Školník, M. (2024). Using ChatGPT to conduct a literature review. Accountability in Research, 31(8), 1244–1246. https://doi.org/10.1080/08989621.2023.2185514
Heerlijk Helder | Vlaanderen.be. (n.d.). Retrieved January 30, 2025, from [link]
Klarspråk - Språkrådet. (n.d.). Retrieved January 30, 2025, from [link]
Long, J. (2023). Large Language Model Guided Tree-of-Thought. [link]
Marques, N., Silva, R. R., & Bernardino, J. (2024). Using ChatGPT in Software Requirements Engineering: A Comprehensive Review. Future Internet, 16(6), 180. https://doi.org/10.3390/fi16060180
Martínez, P., Ramos, A., & Moreno, L. (2024). Exploring Large Language Models to generate Easy to Read content. Frontiers in Computer Science, 6. https://doi.org/10.3389/fcomp.2024.1394705
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. 1st International Conference on Learning Representations, ICLR 2013 - Workshop Track Proceedings. [link]
OpenAI. (n.d.). GPT-4 Technical Report.
Openai, A. R., Openai, K. N., Openai, T. S., & Openai, I. S. (n.d.). Improving Language Understanding by Generative Pre-Training. Retrieved April 15, 2025, from [link]
Ospina-Henao, V., Flórez, S. L., Núñez, V. J. M., Lamas, Ó. L., & De la Prieta, F. (2024). Generative AI: Simplifying Text for Cognitive Impairments and Non-native Speakers (pp. 33–44). https://doi.org/10.1007/978-3-031-73538-7_4
Papastratis, I., Konstantinidis, D., Daras, P., & Dimitropoulos, K. (2024). AI nutrition recommendation using a deep generative model and ChatGPT. Scientific Reports, 14(1), 14620. https://doi.org/10.1038/s41598-024-65438-x
Petelin, R. (2010). Considering plain language: issues and initiatives. Corporate Communications: An International Journal, 15(2), 205–216. https://doi.org/10.1108/13563281011037964
Peters, M., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., & Zettlemoyer, L. (2018). Deep Contextualized Word Representations. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), 2227–2237. https://doi.org/10.18653/v1/N18-1202
Plain language, accessibility, and inclusive communications - Privy Council Office - Canada.ca. (n.d.). Retrieved January 30, 2025, from [link]
Ray, P. P. (2023). ChatGPT: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope. Internet of Things and Cyber-Physical Systems, 3, 121–154. https://doi.org/10.1016/j.iotcps.2023.04.003
Roedel, P. (2024). Manual de Linguagem Simples (Edições Câmara, Ed.).
Sahoo, P., Singh, A. K., Saha, S., Jain, V., Mondal, S., & Chadha, A. (2024). A Systematic Survey of Prompt Engineering in Large Language Models: Techniques and Applications. [link]
Silva, M., Santos, E., Alves, K., Silva, H., Pedrosa, F., Valença, G., & Brito, K. (2024). Using Generative AI for Simplifying Official Documents in the Public Accounts Domain. Anais Do XII Workshop de Computação Aplicada Em Governo Eletrônico (WCGE 2024), 246–253. https://doi.org/10.5753/wcge.2024.2915
Uso de inteligência artificial aprimora processos internos no Tribunal de Contas da União – Notícias | Portal TCU. (n.d.). Retrieved April 16, 2025, from [link]
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention Is All You Need. Advances in Neural Information Processing Systems, 2017-December, 5999–6009. [link]
Zhang, Y., Yuan, Y., & Yao, A. C.-C. (2024). Meta Prompting for AI Systems. [link]
Downloads
Published
How to Cite
Conference Proceedings Volume
Section
License
Copyright (c) 2025 Karine Alves, Matheus Silva, Edney Santos, George Valença, Kellyton Brito

This work is licensed under a Creative Commons Attribution 4.0 International License.
