Site Reliability Engineer – Python/Kubernetes/Openshift-REMOTE

  • Type Régie
  • BUDGET Tarif selon profil
  • Durée (mois) 6
  • Pays Royaume-Uni
  • Remote NON
  • Offres0
  • Moyenne Tarif selon profil
Réalisez votre mission en étant porté chez
Gagnez 940,43 net / mois En savoir plus

Publiée le 5 septembre 2023

Active

Description de la mission

Site Reliability Engineer – Python/Java/Kubernetes/Openshift sought by leading investment bank based 100% remote.

**Inside IR35**

The Role

We are seeking a skilled Site Reliability Engineer (SRE) to join our growing product team. The ideal candidate is passionate about ensure the stability, reliability and performance of our web applications and services. This role requires someone with a background in software development and engineering, experience working in product teams and a deep understanding of site reliability engineering principles. You will be responsible for maintaining and improving the reliability of our systems while collaborating with our product, development, and operations teams to optimize the user experience.

Responsibilities
Implement, maintain and improve monitoring, logging and alerting systems to ensure high levels of system reliability, availability, and performance in accordance with our Service Level Objectives (SLOs)
Collaborate with product and engineering teams to design, build and maintain scalable and reliable web applications and services
Analyse system performance and proactively identify areas for improvement, optimisation, and capacity planning
Develop and maintain automation tools and frameworks for deploying, managing, and monitoring infrastructure and applications
Participate in incident management, root cause analysis and resolution processes, and implement corrective measure to prevent recurrence
Advocate for and incorporate the best practices in site reliability engineering, including CI/CD, infrastructure as code and automated testing
Stay up to date with underlying infrastructure services and products offered by partner teams and represent our team in the improvement and adoption of those services

Experience:
Demonstrated experience as a Site Reliability Engineer, DevOps Engineer, or similar role
Strong coding skills in at least one programming language (We use Python, JavaScript, and Go within our team, and lots of Java throughout the bank)
Experience working in product teams and collaborating with developers, product managers and other stakeholders
Solid understanding of site reliability engineering principles, including monitoring, alerting, incident management and root cause analysis
Hands-on experience with logging and alerting systems such as ELK Stack (Elasticsearch, Logstash, Kibana), Grafana, Prometheus or Splunk
Experience with Red Hat OpenShift, Kubernetes, and container orchestration, and their as-a-code management tools such as Helm, kubectl etc.
Familiarity with database and data technologies (We use MongoDB and Kafka)
Proficiency with CI/CD processes, tools, and best practices

Please apply within for further details or call on 07393149627
Alex Reeder
Harvey Nash Finance & Banking

Compétences Techniques Requises

AnalyseJavakibana

Compétences Fonctionnelles Requises

DevOpsfinancePrometheus

À propos du Donneur d'ordres

Frédérique
14829 mission(s) publiée(s) 0 deal(s) gangné(s)
FREELANCER BIDDING (0)

Il n'y a pas d'offres.