About the job
The Network reliability engineering(NRE) is part of Cisco Cloud Security business unit, this team drives the technology that is transforming the way networks are built, deployed and managed. As an Engineering Manager you will be a key member of the team supporting the existing network as a
service platform that runs our (and our customers) business, as well as building the team that is designing, developing, and running the next-gen systems that will carry Ciscos new security and network SASE offerings. At its core, the Cloud Infrastructure team provides Infrastructure as a Service for engineering in 40+ of our data centers, globally.
What Youll Be Doing
Network Reliability Engineering (NRE) teams have experience building large network systems and data centers using the principles of DevNetOps to minimize the toil for highly scalable and resilient systems. Our systems process 250+ BILLION transactions per day! You will join a team that focuses on quality, pragmatic solutions to issues. Operating in an automation first environment, you will work alongside our development team to understand the current architecture and moving pieces. Key responsibilities of this team are :
Designing, building & deploying the network infrastructure on a DevNetOPS pipeline
Automating workflows and handling of the networks dynamics
Integrating systems
Eliminating toil
Troubleshooting with proactive testing
Engineering reliability through automated response
Aligning error budgets and service-level objective
Participate in 24/7 on-call rotation
Who You'll Work With
The experience of members on the team is wide and multifaceted, allowing us to see multiple perspectives of any challenge that comes our way. We are a team that is supportive of learning and experimentation.
This full-stack engineering team is a mix of engineers who have come from network, systems, and software engineering roles and experiences.
We work closely with other Cisco Security product engineering teams developing customer-facing security services and with the Cloud Infrastructure Engineering teams building core Internet infrastructure and distributed systems to help deliver the services globally more optimally and efficiently at scale.
The team invests heavily in continual improvement in automation and optimisation. Everything needs to scale horizontally and autonomously with stability, resiliency at high performance with the flexibility to adapt. We take phenomenal pride in building and operating one of the most hyper-connected and highly optimized global Anycast networks in the world.
Who You Are
You are an experienced technical leader, with a background in managing multi-disciplinary teams, working with Agile development programmes.
As the Engineering manager of this team, you will lead engineers to develop solutions to extend and enhance the infrastructure services offered by our platform. You will guide the team in producing the best technical and product solutions for the platform, and work with product managers and business leaders to understand which customer problems we should solve and how to solve them.
Some of the things you will work on
Collaborate with product management and business leadership to define platform strategy and roadmap
Strengthen engineering process, principles, and culture within your team and across the organisation
Provide technical and architectural vision for one or more platform services
Drive operational excellence, quality, agility, and ever-increasing scale
Support continuous learning and professional development for yourself and your team
The team could be a good fit for you if several of these apply to you
You have 14+ years of experience leading engineering teams focused on delivery
You are energised by working on n/w infrastructure, distributed applications, platform and product development
You want to have a direct impact on the development of a new Network infrastructure platform
You have lead a Site Reliability or Operations team that works on developing, automating and running a high available platform
You care deeply about creating solutions with security built-in from the beginning
You have excellent people management skills and built high performing teams
Strong communication skills and ability to work effectively across multiple business and technical teams
Are deeply customer-focused, and love learning
Technical Lead, Network Reliability Engineer -G11
About the job
Who We Are
The Network reliability engineering(NRE) is part of Cisco Cloud Security business unit, this team drives the technology that's transforming the way networks are built, deployed and managed. As an Engineer you will be a member of the team supporting the existing network as a service platform that runs our (and our customers) business, as well as providing technical leadership in designing, developing, and running the next-gen systems that will carry Ciscos new security and network SASE offerings. At its core, the Cloud Infrastructure team provides Infrastructure as a Service for engineering in 40+ of our data centers, globally.
What Youll Be Doing
Our engineers have experience building large network systems and data centers using the principles of DevNetOps to minimize the toil for highly scalable and resilient systems. Our systems process 250+ BILLION transactions per day! You will join a team that focuses on quality, pragmatic solutions to issues. Operating in an automation first environment, you will work alongside our development team to understand the current architecture and moving pieces. Key responsibility are :
Designing, building & deploying the network infrastructure on a DevNetOPS pipeline
Automating workflows and handling of the networks dynamics
Integrating systems
Eliminating toil
Troubleshooting with proactive testing
Engineering reliability through automated response
Aligning error budgets and service-level objective
Participate in 24/7 on-call rotation
Who You Are
To join the team, we are looking for candidates with a background in large networking systems and operation at scale. Strong candidates can show skill in networking and automation, with Python experience. We also want to know that you've been responsible for time-sensitive, mission critical systems with a high attention to detail. Our services are the heart of the Cisco Umbrella product, and we take ownership of that very seriously -- our SLOs are high. You have proven all the vital skills to build and improve these systems over the course of your career -- this isnt your first rodeo.
12+ year experience in building large network systems, deployments and maintenance
Solid knowledge of networking,internet technology
Enjoy troubleshooting and root cause complex issues across complex systems
Automation using Python
Experience with API development and integration
Experience with working in Linux systems
Experience with supporting 24/7 on-call rotation
Qualifications
Bachelor of engineering with certifications (CCNA,CCNP)
Familiar with AWS or any cloud infrastructure
Experience running a highly visible, 24x7 mission-critical service using DevNetsOps practices
Ability to participate in a 24/7/365 on-call rotation and resolve production issues within SLAs
Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.