Lead Site Reliability Engineering

Rodeo FX

見る: 182

更新日: 12-06-2024

場所: Montreal Québec

カテゴリー: 建築/インテリアデザイン

業界: Media Communication

Loading ...

仕事内容

Company Description


As a company founded and run by an artist, it’s our mission to provide talented artists with the freedom and resources they need to deliver quality work and to thrive in a fun and creative environment.


Job Description


Under the supervision of the Global Head of IT, the Lead Site Reliability Engineering is responsible for driving deeper reliability to systems in production and help IT, support and development teams spend less time working on support escalations, and give them more time to build new features and services.

KEY RESPONSIBILITIES :

  • Building software to help operations and support teams;
  • Proactively building and implementing services to help IT and support;
  • Responsible for fixing support escalation cases;
  • Being a source of knowledge and help for routing issues to the IT and support teams;
  • Optimizing on-call rotations and processes;
  • Building an amount of knowledge by documenting with software development, support, IT operations and on-call duties;
  • Build a strong, productive team, inspiring and motivating performance;
  • Actively monitor the work climate in his/her team, taking steps to encourage and ensure the maintenance of a positive work environment and adherence to company values.

Qualifications


QUALIFICATIONS & EXPERIENCE :

  • Bachelors or Masters in Computer Science, Computer Engineering, Software Engineering or equivalent
  • Experience of 7 years as a developer and in a production environement
  • Passionate about systems, security, automation, performance
  • Experience managing and supporting a large scale, high throughput system (requests per second)
  • Experience building and maintaining a complete system and multiple environments using Infrastructure as Code (IaC)
  • Experience in team management
  • English – Spoken & Written, French – Spoken

TECHNOLOGY :

  • Stack:
    • Virtualization: Kubernetes/Openshift (on-prem), Docker, VMWare
    • Automation and CI/CD: Ansible, AWX(Tower), Salt, Jenkins
    • Monitoring: Consul, Prometheus, Grafana, Datadog
    • Data: PostgreSQL, Elasticsearch (ELK stack), InfluxDB
  • Coding: Python(2 & 3), Bash, Go, Javascript, SQL
  • GCP/GKE (cloud): Terraform, cloud networking
  • Tools: Git, Atlassian suite, Nexus Repository, Gsuite
  • Experience scoping and deploying on-premise or hybrid HA architectures for various systems.
  • Experience implementing IaC solutionsconnecting apps/systems together
  • Familiar with networking/application security best practices and pitfalls
  • Experience implementing caching solutions to improve systems reliability and performance (Redis, Memcached, Logstash, Avere, etc.)

PROFIL & SKILLS

  • Accountability;
  • Adaptability;
  • Attention To Detail;
  • Creativity;
  • Focus on Quality;
  • Initiative;
  • Insight;
  • Planning and Organizing;
  • Problem Analysis;
  • Result-Orientedness;
  • Stress Management.

Additional Information
  • Full-time, Permanent Contract;
  • 5 paid sick days;
  • 2 additional statutory holidays in the winter holidays;
  • Group Insurance, access to Dialogue online support and to an Employee Assistance Program (EAP);
  • RRSP with employer contribution;
  • Discounts with Bixi, Bota Bota Spa, Nautilus Plus and many more local businesses.

Diversity is a core value at Rodeo FX. We are passionate about building and sustaining an inclusive and equitable work environment where diversity is celebrated and valued. We believe every member on our team enriches our work by exposing us to a broad range of ways to perceive and interact with the world, identify challenges, and to design and deliver projects.

Loading ...
Loading ...

締切: 27-07-2024

無料の候補者に適用するにはクリックしてください

申し込む

Loading ...
Loading ...

同じ仕事

Loading ...
Loading ...