Basic Information

Ref Number

Req_00092084

Primary Location

Tampere - Main Office

Additional Locations

Home Office - Poland, Home Office - Spain, Infinity Tower

Country

Finland

Job Type

Support Positions

Work Style

Remote

Description and Requirements


As a Lead SRE, you will be the driver of a small team that creates, maintains, and owns the infrastructure that runs our AI data solutions platform, including production and non-production application environments, CI/CD, logging, monitoring, and so on. You should have extreme interest and curiosity in new technology. You are someone who enjoys experimenting and tinkering with different approaches, to consistently find the best possible solutions to our myriad needs. 


Why is this role exciting? 
  • Take part in advancing the state of humanity by enabling better AI.
  • You will have plentiful opportunities to make significant contributions in terms of architecture, infrastructure, and ways of working. 
  • Working in an international environment with a lot of cultural diversity. 

Responsibilities 

  • Provide people management and technical leadership for your team of 4-5
  • Build and maintain a 24/7 production environment at scale 
  • Proactive system monitoring, configuration, and tuning 
  • Incident resolution 
  • Implement, educate, and advocate for DevOps best practices 
  • Mercilessly reduce or eliminate toil 
  • Stay up to date with new technologies and trends to ensure we continue to use best-in-class technology and processes 
  • Write and maintain documentation 
  • Educate, mentor, and empower your team members 
  • Encourage an inclusive and open environment that makes space for diverse viewpoints and working styles 
  • Ensure the highest level of technical quality, security, scalability, and stability in all products and teams that you are part of

Minimum Qualifications 

  • Progressive experience including both software engineering and infrastructure / DevOps
  • Team leadership experience 
  • Project management experience 
  • Document extensively out of habit 
  • Experience building and running production systems at scale, with an understanding of the kinds of problems that can occur along with likely solutions 
  • Proven experience and current fluency in:

○ Amazon Web Services (AWS) 

○ Kubernetes 

○ Infrastructure as Code (IaC) 

○ Network architecture 

○ Serverless technologies 

  • Understanding of and ability to implement Continuous Integration and Continuous Delivery (CI/CD) systems and best practices 
  • Proven experience working with your team and stakeholders to reduce toil and improve Developer Experience (DX) 
  • Good verbal and written English communication skills

Nice to Have 

  • People management experience 
  • Understanding and ability to implement best practices for asynchronous work
  • Current ability in Python 
  • Terraform 
  • Ansible 
  • Packer 
  • GitOps 
  • Ability to tune and troubleshoot databases (SQL-based and MongoDB preferred)
  • Experience with Google Cloud Platform (GCP) 
  • Interest in AI and ML

Only shortlisted candidates will be contacted!


If this opportunity sounds appealing to you, apply now! 


For inquiries email: jobs.AI@telusinternational.com

Additional Job Description


TELUS International is seeking a Lead System Reliability Engineering to join our AI team! 



Required Language(s)
English