Λέξεις-κλειδιά

*Ψάχνει στους τίτλους των αγγελιών εργασίας και στους πρώτους 150 χαρακτήρες.

Περιοχή

Εταιρία:
Sportion D.S.
Ημερομηνία Δημοσίευσης:
13-10-21
Τύπος απασχόλησης:
Πλήρης
Κωδικός θέσης εργασίας:
Κατηγορία εργασίας:
Πληροφορική
Περιοχή εργασίας:
Δήμος Αγίας Παρασκευής

Principal Site Reliability Engineer

Sportion Develoment Services, a B2B online software development services provider offering leading-edge solutions in various industries with a particular focus in the betting industry, is currently looking for a Principal SRE to join our team in Athens, Greece.

Role description

  • Analyze, design, implement and run new system and application improvement initiatives within SRE and the wider application- and system-configuration areas
  • Manage service stability in a highly transactional environment where loss of service in busy periods has substantial commercial impact
  • Stay on top of capacity control and planning activities ahead of high visibility sporting events
  • Mentoring other squads in infrastructure capabilities, operational readiness and SRE-ready services
  • Collaborate with other group leaders, product owners and software architects to plan, design, develop and maintain company-wide processes, to improve and maintain stable environment
  • Commitment to high-quality solutions on-time
  • Continuously improve Internal SRE process management
  • Technical leadership of complex, cross-company projects to deliver operational improvements
  • Provide guidance and teaching to other operational staff
  • Motivate the team members and peers to drive service and product quality improvements by implementing company and business best practices
  • Form part of the on-call team, supporting a broad range of systems
  • Participate in incident analysis processes

Skills expected

  • Experience in a subset of the key technologies used: java, php, node.js, ruby, oracle, mysql, mssql, redis, rabbitMQ, nginx, haproxy, varnish, grafana, appdynamics, influx, ELK
  • Ability to understand and tune system components’ most critical configuration parameters
  • Ability to understand code and propose candidate fixes or performance improvements
  • Ability to define and monitor meaningful application performance KPIs
  • Experience in the definition and implementation of realistic application health statuses
  • Worked extensively with advanced cloud architectures (GCP, AWS, Oracle Cloud)
  • Worked extensively with containerisation technologies (Kubernetes, Docker)
  • Extensive experience in application and infrastructure security
  • Years of Linux/*nix engineering and administration
  • Involvement in incident handling, troubleshooting and pragmatic operational decisions
  • Monitoring and alerting expertise (Elk, Graphite, Icinga, AppDynamics)
  • Strong capability in a few programming languages (Nextjs, Java, Typescript are useful)
  • Strong capability in a few IaC languages (Terraform, Ansible are useful)

[Για να στείλετε το βιογραφικό σας, κάντε κλικ εδώ.]

Copyright ©2016-2021 skywalker.gr