Blog Layout

How genAI can aid Kubernetes troubleshooting

Bill Doerrfeld | December 23, 2024

"If you use a generic AI model to investigate these issues, it will simply fail you," says Itiel Schwartz, co-founder and CTO at Komodor.

Before everyone’s really checked out for the holidays, here’s a quick read for you: I just published a short piece for InfoWorld exploring how generative AI could help troubleshoot Kubernetes issues. Spoiler: it’s all about the training datasets.


I caught up with CTO Itiel Shwartz about leveraging finely tuned models like Komodor’s KlaudiaAI agent for ultra-specific DevOps challenges, like Kubernetes error diagnosis and remediation challenges. Schwartz didn’t hold back: "If you use a generic AI model to investigate these issues, it will simply fail you. We tried it time after time." According to him, the more specific the model is, the less likely it is to hallucinate for these particular use cases.


The article also touches on how major cloud players and industry leaders are stepping into this space — so whether you're in DevOps or just curious about AI in Kubernetes management, it's worth a look.


✨ Also, I’m working on a bigger feature about Kubernetes usability, sparked by some fantastic conversations at KubeCon NA. That’s coming in the new year, so stay tuned!


Featured image credit: Growtika on Unsplash.


Read: How generative AI could aid Kubernetes operations
Study reveals growing technical debt in AI age
By Bill Doerrfeld February 19, 2025
The 2nd annual code quality report from GitClear found 10x more duplicated code than two years ago and fewer signs of code reuse than ever before.
Kubernetes usability InfoWorld pilot cockpit Doerrfeld
By Bill Doerrfeld February 10, 2025
My feature on InfoWorld explores the state of Kubernetes usability, highlighting various advancements across workload types, support for edge and AI, and new features like observability and security.
Carving out time for large-scale engineering chores
By Bill Doerrfeld January 31, 2025
It's up to leadership to help prioritize large-scale engineering updates that keep software running smoothly. Kent Wills, Director of Engineering at Yelp, provides insight on the latest DirectorPlus.
5 potential use cases for Arazzo
By Bill Doerrfeld January 30, 2025
Italian for “tapestry,” Arazzo is aptly named since it can be used to weave together sequences of API calls to illustrate a specific business pattern.
Framing AI in the right light
By Bill Doerrfeld December 20, 2024
Rolling out AI is all about framing it in a positive light, says GitLab's CTO, Sabrina Farmer. I interviewed her for the latest edition of DirectorPlus for LeadDev, which is out today.
What's on the top of CIOs' minds lately? Resilience.
By Bill Doerrfeld December 19, 2024
Surmounting risks are encouraging CIOs to future-proof and update their resilience strategies. My latest feature for CIO.com explores resilience head-on.
DX Core 4 unifies developer productivity frameworks
By Bill Doerrfeld December 10, 2024
Today, DX debuted a new developer productivity framework, DX Core 4. Excited to break the news with LeadDev, interviewing one of the designers of the framework.
Migrating to microservices at MACH-speed
By Bill Doerrfeld December 2, 2024
In this issue of DirectorPlus, Gus Fune shares the story behind reverse engineering a monolithic e-commerce platform following MACH principles, which stands for microservices, APIs, cloud-native, and headless.
How spec-first API documentation aids partner integration
By Bill Doerrfeld November 29, 2024
Having good API documentation is one thing. Being specification-first is next level. Here are the benefits a specification-driven, git-based approach to documentation can bring to partner API integrations.
Understanding the root causes of api drift
By Bill Doerrfeld November 20, 2024
75% of APIs stray from their OpenAPI specifications. My feature explores why API drift is so common and how engineering leaders should mitigate this issue.
More Posts
Share by: