Senior Site Reliability Engineer

Regular Employment

Location: Hyderabad, TG, IN

About the team 

Part of Swiss Re's Group Digital & Technology organisation, Shared Platform Services ​provides central infrastructural services and standardized common components, ranging from automation, secure access tokens, cryptographic keys to database infrastructures. That helps us to increase productivity of our clients, improve cost efficiency, and reduce time to market for new features. 

The Automation and Orchestration Team enables Swiss Re to digitize and optimize processes by providing a reliable and fast cloud-native orchestration platform, which uses a microservice architecture and open-source components. With the orchestration platform Business and IT units can run their processes/workflows to reduce manual effort and ensure consistency in their daily work life. 

About the role 

As a Senior Site Reliability Engineer, you'll play a pivotal role in ensuring the reliability and performance of our hybrid cloud-based orchestration solution. Collaborating closely with our team, you'll cultivate a culture of SRE/DevSecOps, breaking down silos and managing incidents and problems. Your role will involve developing and implementing innovative solutions for proactive detection, analysis, prevention, and resolution of issues. Working with our diverse and motivated team, you will: 

  • Work closely with the Product Owner and Product Reliability Engineer to ensure the quality and stability of the platform 

  • Lead and mentor a team of Site Reliability Engineers on engineering, architecture, and security topics. 

  • Handle change management and deployment support 

  • Define and monitor Service Level Objectives, propose performance tuning strategies, manage error budgets, and implement proactive monitoring for errors and system availability using observability tools and monitoring solutions 

  • Propose and / or implement changes to reduce the manual effort needed (Toil Management) 

  • Drive automation through Scripting, Configuration Management and Infrastructure as Code 

  • Manage capacity planning and scaling strategies (forecasting system demand, planning growth) 

  • Participate in development sprints as needed, balancing operational risks and tasks. 

  • Create technical support documentation, reusable assets, and guidelines for the Engineering Team 

About You 

We are looking forward to welcoming you to our team, particularly when you possess: 

  • 10+ years' system engineering experience, software development, continuous integration/deployment and in cloud-native ecosystems. 

  • Passion for sharing knowledge, through interactive sessions as well as documentation. 

  • Strong analytical and problem-solving skills on issues spanning applications, networks, and system, as well as the ability to focus on details without losing track of the bigger picture. 

  • Strong coding skills, where strong experience in Java is preferred, and experience in Golang would be an advantage. 

  • Experience with centralized logging, monitoring, and observability solutions. 

  • Hands-on experience with network, IT security on integrated systems and building highly available, distributed, and scalable infrastructure with good understanding of microservice architecture 

  • Good experience with automation scripting, containerization, CI/CD pipelines, Infrastructure as Code, SQL and NoSQL databases (performance tuning and backup), and Agile methodologies. 

  • Nice to have: 

  • Azure DevOps certification and experience with Microsoft Azure infrastructure solutions. 

  • IT Service Management Certification (i.e. ITIL) or at least deep understanding 

  • Hands-on experience in Disaster Recovery and Backup/Restore implementation 

  • Experience with open-source integration and orchestration platforms and technologies. 

  • Virtual Machines configuration and maintenance 

  • Excellent oral and written English skills 

Nobody is perfect and meets 100% of our requirements. If you, however, meet some of the criteria above and are curious about the world of process orchestration we'll be more than happy to meet you! 

About Swiss Re

Swiss Re is one of the world’s leading providers of reinsurance, insurance and other forms of insurance-based risk transfer, working to make the world more resilient. We anticipate and manage a wide variety of risks, from natural catastrophes and climate change to cybercrime. We cover both Property & Casualty and Life & Health. Combining experience with creative thinking and cutting-edge expertise, we create new opportunities and solutions for our clients. This is possible thanks to the collaboration of more than 14,000 employees across the world.

Our success depends on our ability to build an inclusive culture encouraging fresh perspectives and innovative thinking. We embrace a workplace where everyone has equal opportunities to thrive and develop professionally regardless of their age, gender, race, ethnicity, gender identity and/or expression, sexual orientation, physical or mental ability, skillset, thought or other characteristics. In our inclusive and flexible environment everyone can bring their authentic selves to work and their passion for sustainability.

Keywords:  
Reference Code: 129794 

Make an impact

Start your career journey with Swiss Re.

Tags

Tags