DevOps Engineer
<div class="show-more-less-html__markup show-more-less-html__markup--clamp-after-5 relative overflow-hidden"> <p><strong>Role : Devops Engineer </strong></p><p><br/></p><p><strong>KEY RESPONSIBILITIES</strong></p><p><br/></p><ul><li>Design, deploy, and manage production-grade AWS cloud infrastructure including EKS, EC2, RDS, Lambda, and Step Functions</li><li>Manage multi-tenant Kubernetes clusters with a focus on autoscaling, workload isolation, network policies (Calico), and policy governance (Kyverno)</li><li>Implement and maintain Infrastructure as Code using Terraform with reusable, version-controlled modules across multiple environments</li><li>Build, own, and continuously improve CI/CD pipelines using GitLab CI and ArgoCD, following GitOps declarative deployment principles</li><li>Establish and maintain end-to-end observability using Prometheus, Grafana, Loki, Thanos, Datadog, and Dynatrace for real-time alerting and performance insights</li><li>Configure and enforce secure AWS networking (VPC, VPN, NAT, Transit Gateway) and implement IAM, WAF, and KMS security governance</li><li>Manage Linux-based environments (Amazon Linux, RHEL) including system configuration, networking, and automated patching via Ansible</li><li>Lead cost optimization initiatives through right-sizing, autoscaling policy design, and resource utilisation analysis</li><li>Perform troubleshooting and root cause analysis for production incidents, ensuring rapid resolution with minimal service impact</li><li>Contribute to security observability initiatives including SIEM integration (Wazuh or equivalent) and LLM-enabled operational tooling where applicable</li><li>Collaborate closely with program delivery teams, solution architects, and SITA stakeholders to align infrastructure with program objectives</li></ul><p><br/></p><p><strong>TECHNICAL SKILLS & REQUIREMENTS</strong></p><p><br/></p><p><strong>Skill Area</strong></p><p><strong>Required Proficiency</strong></p><p><strong>Kubernetes / EKS : </strong>Production-grade multi-tenant cluster management; autoscaling, Calico network policies, Kyverno policy enforcement, Karpenter node provisioning</p><p><strong>AWS Cloud Services: </strong>EKS, EC2, RDS, Lambda, Step Functions, Route53, CloudFront, WAF, KMS, IAM — hands-on deployment and governance</p><p><strong>Terraform (IaC): </strong>Reusable module design, multi-environment state management, automated provisioning pipelines</p><p><strong>CI/CD & GitOps: </strong>GitLab CI, ArgoCD declarative deployments, AWS CodeBuild/CodeDeploy; GitOps workflow ownership end-to-end</p><p><strong>Monitoring & Observability: </strong>Prometheus, Grafana, Loki, Thanos, CloudWatch, Datadog, Dynatrace — full-stack observability and alerting</p><p><strong>Networking & Security: </strong>VPC, VPN, NAT Gateway, Transit Gateway configuration; IAM, WAF, KMS governance; Wazuh SIEM or equivalent</p><p><strong>Linux Administration: </strong>Amazon Linux, RHEL/CentOS — system configuration, networking (ufw/firewalld), scripting</p><p><strong>Scripting & Automation: </strong>Bash scripting for system administration; Ansible for configuration management and patching</p><p><strong>Storage: </strong>Rook CEPH or equivalent distributed storage; persistent volume management for cloud-native workloads</p><p><strong>AI/LLM Integration: </strong>Desirable: on-prem or cloud LLM deployment (OpenShift/Kubernetes), prompt engineering, AI-enabled workflows</p><p><br/></p><p><strong>ESSENTIAL REQUIREMENTS</strong></p><ul><li>Minimum 4 years of hands-on DevOps / Cloud Infrastructure engineering experience in production environments</li><li>Demonstrated experience managing production EKS clusters with Terraform IaC — portfolio evidence or verifiable project history required</li><li>Proficiency in GitOps methodology with ArgoCD and GitLab CI as primary toolchain</li><li>Strong AWS services knowledge across compute, networking, security, and serverless (EKS, EC2, RDS, Lambda, IAM, WAF, KMS, Route53)</li><li>Full observability stack experience: Prometheus + Grafana + Loki minimum; Datadog or Dynatrace highly preferred</li><li>Linux administration proficiency (Amazon Linux / RHEL) with Bash scripting and Ansible automation</li></ul><p><br/></p><p><strong>DESIRABLE / NICE-TO-HAVE</strong></p><ul><li>Experience with LLM/AI platform deployment on Kubernetes or OpenShift (on-prem or cloud)</li><li>Familiarity with Wazuh SIEM, Sysdig, or equivalent security event monitoring tooling</li><li>Experience with Rook CEPH or distributed storage management for persistent workloads</li><li>Serverless migration experience (on-prem to AWS Lambda / Aurora / Step Functions)</li><li>Karpenter node provisioner experience for cost-efficient EKS autoscaling</li><li>Aviation or travel industry domain exposure (not mandatory)</li></ul><p><br/></p><p><strong>EDUCATION & QUALIFICATIONS</strong></p><p>A Bachelor's or Master's degree in Computer Science, Software Engineering, or a related technical discipline is preferred.</p><p>Relevant certifications will strengthen an application:</p><ul><li>AWS Certified DevOps Engineer – Professional or AWS Certified Solutions Architect</li><li>Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD)</li><li>HashiCorp Terraform Associate or Professional</li></ul><p></p> </div>