When you join Verizon

Verizon is one of the world’s leading providers of technology and communications services, transforming the way we connect across the globe. We’re a diverse network of people driven by our shared ambition to shape a better future. Here, we have the ability to learn and grow at the speed of technology, and the space to create within every role. Together, we are moving the world forward – and you can too. Dream it. Build it. Do it here.

What you’ll be doing...

SRE Engineer for the VIES team is responsible for the reliability of our infrastructure platforms with a focus of availability and stability. The SRE engineer is responsible for successfully designing upgrades, overseeing changes, monitoring releases and improving performance of the platforms to provide quality services to our customers. While there is a strong focus on performing traditional operations functions, such as resolving incidents and being part of crisis calls and equal focus is on developing automated self healing solutions to make the platform more resilient based on root cause analyses.

  • Metrics and Monitoring - Monitoring the performance of server/storage platforms, design SLOs/ Dashboards and build Analytics to predict anomalies. Improve SLI’s and show continuous improvement in metrics.
  • Capacity Planning - Forecast platform resource requirements and identify potential performance and demand bottlenecks beforehand.
  • Change Management - Create Release plans and automated release engineering workflows to release patches and install with zero downtime on platforms.
  • Emergency Response - On call support, Root cause analysis, blameless postmortems.
  • Automation - Automate repetitive tasks (toil management) , automated health checks, self healing environments.
  • Agile/ DevOps engineering - Work in a product operating model which is based on Agile/ Scrum practices.
  • Identify scalability bottlenecks and areas for performance improvements.
  • Collaborate with the engineering team to propose features that solve recurring patterns of customer complaints.

What we’re looking for...

You’ll need to have:

  • Bachelor’s degree or four or more years of work experience.
  • Six or more years of relevant work experience.
  • Experience in SRE practices / AIOps.
  • Experience in desktop and server operating systems.
  • Experience in Linux and Openstack.
  • Experience in scripting languages like Shell scripting, Python etc.
  • Experience in Dashboard tools like Grafana.
  • Knowledge on Server, Storage, Network, Security and Firewall technologies.
  • Knowledge on agile/ devOps toolset and automated release management.

Even better if you have:

  • Experience in Containers - Docker, Kubernetes.
  • Ability to handle multiple priorities in a fast paced environment.
  • Teamwork, Problem-solving, good written and verbal communication skills.
  • Ability to design solution for any kind of requirement.
  • Ability to design and implement infrastructure monitoring solutions such as Controlup, Uberagent, Splunk.
  • Experience in MySQL, PostgreSQL.
  • Knowledge and experience in Moogsoft.
  • Knowledge of Serverless architecture.
  • Knowledge on any of the Cloud platforms: AWS, Google, Azure.
  • Any of the Infrastructure Certifications: Vmware, MCSE, Red Hat, Open Stack.