Apply Now

Site Reliability Engineer

Job ID241170-en_US-9708
CompanyRogers
LocationBrampton
ProvinceON
Date Posted2021-07-27
Job TypeFull-time
Job CategoryTelecommunications
Description

At Rogers, we connect Canadians to a world of possibilities and the memorable moments that matter most in their lives. Every day we wake up with one purpose in mind. To bring loved ones together from across the globe. To connect people to each other and the world around them. To help an entrepreneur realize their dream. A sports fan celebrate a special moment.

Because we believe connections unite us, possibilities fuel us, and moments define us.

 

As we grow our team, the well-being of our team members remains our top priority. To ensure the health and safety of our team members, including those in the recruitment process, our team members are temporarily working from home.

Who we are: 

The Continuous Service Validation & Reliability team manages new software release testing of all Connected Home Services (Ignite TV, Ignite Home Phone, Internet, and Smart Home Monitoring) for Rogers Communications. The team is responsible for ensuring high quality product updates are propagated to Rogers customers by applying industry leading automated testing practices, as well as daily validation of key product features and functionalities. In addition this team is part of the Rogers Tech Hub, Tech Hub is more than a new organization.  

 

Tech Hub is a re-imagined way of  delivering next-gen technology solutions with speed that delight our customers and inspire our teams.  

 

  • It brings industry-leading development and delivery practices in-house. 
  • It modernizes our foundational architecture. 
  • It unifies organizations into a common mission. And it revolutionizes our culture. 

 

The Opportunity:

Tech Hub is currently seeking a Site Reliability Engineer (SRE) to build and run large-scale, distributed, fault-tolerant systems. SRE ensures that Tech Hub's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to users' needs. Additionally, SREs will keep an eye on our systems capacity and performance and demonstrate a mindset of driving continuous improvement. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation.

 

On the CSV&R team within Tech Hub, you’ll have the opportunity to manage the complex challenges of scale, while applying your expertise in coding, algorithms, complexity analysis and large-scale system design. Within this role you will be encouraged to showcase your ability to multi-task, experiment and innovate, take risks in a blame-free environment, and solve complex problems by collaborating with others within our diverse team who come from a wide variety of backgrounds.    

 

Responsibilities:

  • Engage in and improve the entire lifecycle of services (inception, design, deployment, operation, and continuous improvement).
  • Support services before they go live through defining and instrumenting SLOs (Service Level Objectives) and SLIs (Service Level Indicators) through system design, software development of micro-services and frameworks, and capacity planning.
  • Maintain services once they are live by measuring and monitoring overall system health via SLOs & SLIs and demonstrate leadership throughout the incident management process, including blameless post mortems.
  • Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and performance to ensure customer expectations are being met.

 

 Minimum Requirements:

  • Undergraduate degree in Engineering, Computer Science, a related technical field involving software/systems engineering, or equivalent practical experience.
  • Experience programming in at least one of the following languages: C, C++, Java, Python
  • Experience with algorithms, data structures, logging, monitoring, infrastructure configuration, application and network protocols.
  • Experience working within Agile software development lifecycle and Dev/Ops methodologies.

 

Additional Preferred Requirements:

  • Large-scale distributed systems analysis, design, troubleshooting, and systematic problem solving skills.
  • Ability to debug, optimize code, and automate routine tasks.
  • Effective communication skills (verbal and written).
  • Understanding of Unix/Linux operating systems.
  • Understanding of Telecommunication services and technologies (IT, Digital, Video, Voice, Data, Wireless)
Schedule: Full time
Shift: Day
Length of Contract: Not Applicable (Regular Position)
Work Location: 8200 Dixie Rd (341), Brampton, ON 
Travel Requirements: Up to 10%
Posting Category/Function: Technology & Software Development
Requisition ID: 241170
 
Together, we'll make more possible, and these six shared values guide and define our work:
 
  1. Our people are at the heart of our success
  2. Our customers come first. They inspire everything we do
  3. We do what’s right, each and every day
  4. We believe in the power of new ideas
  5. We work as one team, with one vision
  6. We give back to our communities and protect our environment
 

What makes us different makes us stronger. Rogers has a strong commitment to diversity and inclusion. Everyone who applies for a job will be considered. We recognize the business value in creating a workplace where each team member has the tools to reach their full potential. At Rogers, we value the insights and innovation that diverse teams bring to work. We work with our candidates with disabilities throughout the recruitment process to ensure that they have what they need to be at their best. Please reach out to our recruiters and hiring managers to begin a conversation about how we can ensure that you deliver your best work. You matter to us! For any questions, please visit the Rogers FAQ.

 
Posting Notes:  Information Technology & Engineering 


For more information, visit Rogers for Site Reliability Engineer