Automation testing framework for reliable autonomous agentic AI

Martin Grant, Joanna Isabelle Olszewska*

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    2 Downloads (Pure)

    Abstract

    With the rise of autonomous systems and agentic AI, a heightened automation of testing processes is required to build, deploy, or repair reliable intelligent systems. For this purpose, we developed a framework allowing any intelligent tester, whatever human or agent, to automate test scripts on their developed applications and provide detailed results of the users’ automated test cases execution. This framework enables
    testers to create and replay automated test scripts quickly, so automation can be used in short development projects by unskilled (manual or agentic) testers. Furthermore, the implementation of this framework aims to solve the problem of performing secure regression testing on applications before each new application release candidate and every time an intelligent developer - human or artificial - makes changes to the programmes code, which can take up huge amounts of testing time in a software development project and over the life cycle of the product application. Thus, this automated testing framework reduces the time to carry out testing so the costs, while it increases quality testing and system reliability. The first prototype of this framework was developed successfully and tested thoroughly, and its deployment in real-world context showed promising results, paving the way for reliable agentic AI systems.
    Original languageEnglish
    Title of host publicationIEEE International Conference on Engineering Reliable Autonomous Systems (ERAS 2025)
    PublisherIEEE
    Number of pages8
    Publication statusAccepted/In press - 31 Mar 2025

    Fingerprint

    Dive into the research topics of 'Automation testing framework for reliable autonomous agentic AI'. Together they form a unique fingerprint.

    Cite this