Welcome to the OpsMx blog
How to Ship Better Software Faster using OpsMx and AppDynamics
Introduction Most software organizations run CI/CD pipelines to deploy updates from development to production. This CI/CD process is a complex set of interconnected steps that deploy software to the target platforms. Each application typically generates a large amount of performance data through log entries during testing. On a human level, this data is complex, unstructured, […]
Introduction to Site Reliability Engineering
Site Reliability Engineering (SRE) is becoming a common term in software and Internet industries. Google Vice President of Engineering Ben Treynor Sloss, who coined the term, explained that “SRE is what happens when you ask a software engineer to design an operations team.” Since then, SRE has become a core component of the IT strategies […]
3 Options for Managing and Maintaining Spinnaker
Open source software is mainstream. Most businesses today have either already adopted open-source software or are thinking to adopt one. In fact, a survey found that “89% of businesses run open-source software internally, while 65% contribute to open-source software projects.“ Spinnaker is a widely deployed open-source Continuous Delivery (CD) platform adopted by thousands of enterprises. It was […]
Automating Verification of Spinnaker Deployments with OpsMx
Introduction DevOps methodologies have evolved over the years. And so have the deployments. Today, as businesses are increasingly transitioning to multi-cloud or cloud-first strategies, deployments are far more automated as well as complex than they used to be a decade ago. Additionally, the increasing demand for microservices requires businesses to use tools like Kubernetes. This […]
What is Chaos Engineering?
Introduction The world of server infrastructure has become complex. Microservices and distributed cloud architectures have contributed to this complexity. With the rise in complexity, so has risen the number of failures. And operating in such complex environments is challenging. Production failures impact both businesses and customers. Modern organizations have implemented SRE to ensure these outages […]
Self Tuning – Part 3. Using Prometheus metrics to patch the Kubernetes resource limits/requests
Problem Introduction: In Part 1, We have seen how to automate the process of increasing the limits/requests for the pods memory/cpu using the webhook alerts feature of alert manager and webhook triggered pipelines of spinnaker. In Part 2, , it was shown how to use verticalpodautoscaler (VPA) to get the recommendations for pod memory and cpu requests as […]
Self Tuning – Part 2. Using Vertical Pod Autoscaler to patch the Kubernetes resource limits/requests
Problem Introduction: In Part 1, We have seen how to automate the process of increasing the limits/requests for the pods memory/cpu using the webhook alerts feature of alert manager and webhook triggered pipelines of spinnaker. But what value should we patch? How to decide on the optimal values for cpu/memory as these values varies from application […]
What Is Infrastructure as Code (IaC)?
Introduction The last decade has seen a radical shift in managing IT infrastructure. As a junior engineer, I worked as a UNIX and LINUX administrator. We had servers ranging from Large Solaris Frames to smaller outdated HPUX servers. At the end of my junior engineer tenure, we migrated to VMs. Managing VMs was easy, but […]
How to deploy applications safely with Argo and Spinnaker
Businesses have rapidly embraced GitOps as a standard Kubernetes deployment strategy as it helps them deploy applications in the shortest possible time. This is enabled by Argo, the most popular GitOps tool for rapid deployments. Argo Rollout, a reliable Progressive Delivery tool for Kubernetes, provides deployment strategies like blue-green and canary deployments. Thus, with GitOps, […]
Self Tuning – Part 1. Using Spinnaker and Prometheus alert manager webhooks to adjust pod memory usage
Problem Introduction: Kubernetes does a good job in self healing and application recovery from failure. New pods come up in the place of pods that crash. One reason for pods failing in kubernetes cluster is the memory consumed exceeding the limit set. In this case, kubernetes pods are OOM killed ( out of memory) and […]
Is Jenkins enough for Continuous Delivery?
Why do you need to modernize Jenkins pipelines with CD Automation? If there’s one Continuous Integration (CI) tool that software delivery teams swear by, it is undoubtedly Jenkins. Jenkins has been an indispensable CI tool for DevOps for over a decade. Jenkins is a flexible, open-source CI tool that software delivery teams use to build, […]