Member-only story
Human-Friendly Observability with Generative AI
Generative AI Enhancing Observability
Preface
Imagine building a complex web application that works with many microservices working together. This is a pure SaaS native web application that has many elements that can go wrong. To provide 99.99% uptime & high availability, all the components/services should be up & running at any given point in time.
Sounds Challenging?
It is indeed. Now let’s put ourselves in the shoes of the user who is using this web application & is trying to edit some data in a table shown in a widget on the screen & it fails. There are two questions to answer here.
- How does the application operations team know that something is going wrong & take a preventive/proactive step to correct the state of the system?
- Now that the error has happened, how is the user made aware of what went wrong and why & what can the user do next to recover from that error?
This problem is not new & we have been using a lot of sophisticated mechanisms to answer both the questions above. In this article let us see how generative AI can help us answer them better.