

1 Rudra Dhar, Karthik Vaidhyanathan, Vasudeva Varma, Can LLMs Generate Architectural Design Decisions? – An Exploratory Empirical study, IEEE ICSA 2024 2 Shrikara Arun, Meghana Tedla, Karthik Vaidhyanathan, LLMs for Generation of Architectural Components: An Exploratory Empirical Study in the Serverless World, IEEE ICSA 2025 3 Bassam Adnan, Sathvika Miryala, Aneesh Sambu, Karthik Vaidhyanathan, Martina De Sanctis, Romina Spalazzese, Leveraging LLMs for Dynamic IoT Systems Generation through Mixed-Initiative Interaction, IEEE ICSA 2025 Companion 4 Raghav Donakanti, Prakhar Jain, Shubham Kulkarni, Karthik Vaidhyanathan, Reimagining self-adaptation in the age of large language models, IEEE ICSA 2024 Companion
To the Land: Engineering an Agentic AI Framework The challenges for engineering AI systems became more evident for us during my postdoc research (2020) on a project which resulted in a work where we listed the challenges in architecting ML-enabled systems5. It then also became a starting point for a workshop with some of our collaborators on software architecture and machine learning (SAML)6. Further, we also did a Dagstuhl seminar on this topic7. ML-enabled systems face different types of challenges which is even higher in the case of LLM systems. This is where the land part comes in. About 2 years ago, we began collaborating with MontyCloud, a CloudOps startup that leverages autonomous bots to manage cloud compliance, security, and continuous operations. When they created an autonomous copilot named Marvin8, it functioned as a conversational AI agent allowing users to interact with the platform while simultaneously simplifying everyday cloud operations. In addition to this, embodying its inherent intelligence, the copilot also performed automatic checks, generating actionable reports for cloud users. Very soon we faced some important challenges related to maintainability, extensibility, and data management. It was then that we realised the need for an agentic approach. We thought if multiple agents, each with its own knowledge can collaborate to achieve a specific functionality, we could overcome a number of limitations of our existing system primarily stemming from the complexity of managing systems on the cloud such as diverse data sources, orchestration of multiple processes and handling complex workflows to automate routine tasks. We took inspiration from the principles of Domain Driven Design (DDD) by starting to think of agents around various domains within the larger domain of CloudOps. The next step was about realizing the multi-agent system. There were a lot of existing frameworks like Langraph, crew.ai, autogen, etc. However, it did not work for us since they were all monolithic in nature. Moreover, it did not allow us the flexibility of using particular platforms to power certain agents. We wanted to go a step further where we could decide which agent could be built using which framework. That’s how along with the MontyCloud team, we created Meta Orchestrator of Your Agents (MOYA)9 – where we orchestrate how different agents can be built with different technologies. This also resulted in a research publication at CAIN 2025 (co-located with ICSE 2025) and was a candidate for the best paper award. Although MOYA came out of our efforts in developing an agentic approach to autonomous CloudOps, the framework itself is a generic framework that can be applied to any use case. To further validate this, we also conducted a hackathon at IIIT-H, Hack-IIIT in collaboration with the Open Source Developers Group (OSDG). More than 100 students participated in the hackathon with teams building agentic AI applications for a wide variety of use cases ranging from meme generators to framework enhancements to games. This further enforced and validated the capability of MOYA. One of the major feedbacks we received was on the simplicity of the framework which was in line with our goals of building the framework in the first place. We also received a lot of constructive feedback which has allowed us to further enhance MOYA.5 Henry Muccini, Karthik Vaidhyanathan, Software Architecture for ML-based Systems: What Exists and What Lies Ahead, WAIN 2021@ICSE 2021 6 https://sa-ml.github.io/saml2025/ 7 https://www.dagstuhl.de/seminars/seminar-calendar/seminar-details/23302 8 https://blogs.iiit.ac.in/montycloud 9 https://github.com/montycloud/moya
Onward and Forward Agentic AI is shifting the way we think about building software systems or services. As Agentic AI continues to gain traction, a few guiding principles are emerging: The big thing that the community needs to think about is what is an agent and when an agent is required. It’s not about converting all existing services to Agents. Some of the existing services (APIs) may eventually become tools for agents to leverage. Thinking in agents is a skill that needs to emerge. Not every time we need agents. For use cases like chatbot on document, all we need are LLMs that leverage RAG on the documents. Sometimes what we need is well orchestrated flows where one agent calls a tool and so on. The real power of agents comes in scenarios where we need dynamic behavior, in scenarios where agents have to communicate back and forth to achieve a task. Rather than building centralized models or rule-based systems, developers may now begin to view software as a collaboration of intelligent, goal-driven agents – each equipped with tools, memory, and autonomy. This mindset echoes the evolution that microservices brought to system architecture but elevates it by incorporating proactive behavior, situational awareness, and dynamic learning. This transition is especially significant for SaaS platforms. Traditional SaaS architectures are being challenged by increasing demands for autonomy, personalization, and continuous adaptation. Its not that SaaS is dead but the way we do SaaS will probably change. Agentic AI offers a new model where SaaS can be seen more as a composition of intelligent agents rather than static services. These agentic systems can better manage complexity, provide dynamic responses, and integrate seamlessly across workflows. Looking ahead, the software engineering research and practitioner community will need to adopt/adapt new/existing software engineering practices, including memory management for agents, enhancing reliability and robustness, sustainability, standardized interaction protocols, observability for autonomous workflows, developing responsible AI guidelines and evaluation frameworks for agentic behavior. While this is the case, we also need to rethink whether we need to have the “large” LLMs to power agents, instead can we think of domain specific Small Language Models (SLMs). While this is the case on the other hand, there is also a lot of emerging support for improving existing SE practices and processes like software design, deployment, maintenance or even potentially for migrating legacy to modern software systems. Having said that, shifting legacy APIs to make it more agent or LLM-friendly is a task by itself. But we also need to be aware so that we don’t reinvent the wheel. Many of the best practices in building service-oriented systems can be reused/adapted to the agentic AI systems. These are not just technical challenges, but cultural ones as well – requiring software architects, developers, and AI practitioners to collaborate more deeply than ever before.
July 2025

