Senior DevOps Engineer

23 Oct 2025
Apply

Ingram Barge is seeking a Senior DevOps Engineer to join our dynamic DevSecOps team.  This person will work alongside our Systems Architect, Application Development Architect, and Security Engineer and focuses on operationalizing our cloud-native infrastructure, enhancing CI/CD pipelines, ensuring system reliability and resilience, and providing 24x7 operational support.What you will be doing:Pipeline & AutomationDesigning and implementing advanced CI/CD pipeline features using GitLabDeveloping and maintaining Terraform modules for infrastructure provisioningCreating and optimizing Ansible playbooks for configuration management and deployment automationIntegrating security scanning and compliance checks into deployment pipelinesContainer & Kubernetes OperationsBuilding, configuring, and maintaining Azure Kubernetes Service (AKS) clustersDeveloping and optimizing Helm charts for application deploymentsImplementing and managing GitOps workflowsMonitoring and troubleshooting containerized applications and cluster performanceInfrastructure & ReliabilityImplementing Infrastructure as Code best practices using Terraform and AnsibleDesigning and executing disaster recovery procedures and business continuity plansPerforming system patching, upgrades, and maintenance activitiesEstablishing and maintaining comprehensive monitoring, alerting, and observability solutions using Prometheus and GrafanaCost Optimization & Resource ManagementMonitoring and analyzing Azure cloud spending patterns and resource utilizationImplementing cost optimization strategies including right-sizing, reserved instances, and auto-scaling policiesDeveloping dashboards and reports for cost tracking and forecastingCollaborating with teams to optimize resource allocation and eliminating wasteMonitoring & ObservabilityDesigning and implementing comprehensive monitoring solutions using Prometheus for metrics collectionBuilding and maintaining Grafana dashboards for infrastructure, application, and business metricsConfiguring intelligent alerting rules and escalation proceduresEstablishing SLIs, SLOs, and error budgets for critical services24x7 Support & Incident ResponseParticipating in on-call rotation for 24x7 production supportLeading Tier 3 incident response efforts for production outages and system issuesPerforming root cause analysis and implementing preventive measuresCollaborating with development teams on performance optimization and troubleshootingMaintaining runbooks and documentation for operational procedures

  • ID: #54709258
  • State: Kentucky Paducah 42001 Paducah USA
  • City: Paducah
  • Salary: USD TBD TBD
  • Job type: Full-time
  • Showed: 2025-10-23
  • Deadline: 2025-12-22
  • Category: Et cetera
Apply