Senior Site Reliability Engineer
Build technologies that matter
This is us
At Qinshift and Avenga we are merging together to start a new era of technology that matter. Leveraging the power of innovations, we are on a journey to shape the future of work, and we are inviting you to co-create it with us.
This is the job
In Bangalore we are seeking a Senior Site Reliability Engineer to join an international team at a leading cloud security company. You’ll help ensure the stability, availability, and performance of production systems while contributing to monitoring, incident response, and operational best practices. This role bridges Operations, Engineering, and Product Management, driving product improvements and uptime. It’s a great opportunity to work on cutting-edge cloud security solutions, protecting data and users from evolving digital threats.
This is you
- Bachelor’s degree in computer science, electrical engineering or a related area, with 7+ years of SRE experience in a large enterprise organization
- System admin experience on Linux environments.
- Experience with end-to-end monitoring setup for infra and applications
- Experience with Prometheus, Grafana, ELK, Opensearch, Cloudwatch, PagerDuty and other monitoring tools.
- Solid experience with Cloud Technologies such as AWS and OCI.
- Good experience with containerized workloads tools like Kubernetes.
- Experience understanding and managing web servers (Apache, Tomcat, Nginx)
- Ability to script/program with one or more high level languages, such as Python, Go, etc.
- Experience with any configuration management tools like Salt or Puppet or Ansible or similar.
- Experience with source control tools such as Github and SVN.
- Experience with deployment tools Jenkins, Harness etc.
- Experience with SQL and NoSQL databases like Redis, CouchBase, Cassandra, Crate, Elasticsearch.
- Experience in performing and writing Root Cause Analysis documents
- Strong communication and analytical/problem-solving skills.
Nice-to-have skills:
- Network knowledge (TCP/IP, UDP, DNS, Load balancing) and prior network administration experience is a big plus.
- Experience in Security domain will be added advantage
This is your role
- Perform Incident Management and Change Management to maintain the continuous availability of all Cloud Infrastructure services.
- Ensure all SRE and operating procedures are maintained and executed.
- Maintain a 24x7 production environment with a high level of service availability and perform quality reviews, manage operational issues.
- Perform root cause analysis for major incidents and drive the process by involving required stakeholders.
- Perform problem management by analyzing metrics, alarms and dashboards to troubleshoot problem areas, report issues to assist in performance tuning and fault finding.
- Implementation of proactive monitoring, alerting, trend analysis, and self-healing solutions.
- Explore and innovate new technologies, features, and tools to improve the platform and automate operational tasks using Bash, Python or any other programming language.
- Manage and maintain Runbooks and Standard Operating procedures
- Manage, coordinate, and document all types of maintenance activities and outages.
- Perform patching and upgrades for vulnerability management.
- Work closely with the teams to initiate the development of new ideas into internal tools.
- Understand the existing architecture and work with various Engineering teams to develop and execute strategies to provide a high-quality production service.
What awaits you at Avenga x Qinshift?
- Everyone at Avenga is subject to professional growth via our mentorship program;
- Our specialists get regular performance reviews and technical assessments to identify development plans;
- The company provides extended training and certification opportunities;
- We stay up to date with the industry by embarking on tech talks, webinars, conferences, and hackathons;
- The company fosters a sense of professional belonging and an environment of togetherness: we achieve things together and celebrate our milestones.
We take pride in the diverse skills and character of our teams, welcoming everyone to apply and contribute to our collective strength.
- Locations
- India
- Remote status
- Hybrid
- Seniority
- Senior-level
Senior Site Reliability Engineer
Build technologies that matter
Loading application form