DevOps SRE Manager
Talentica Software is a boutique software development company started by industry veterans and ex- IITB grads. At Talentica, we help startups build products. We are techies at heart and thrive on using the latest tools and technologies to solve real-world problems. Owing to our unique space, we deal extensively with industry-defining technologies. Over the last 21 years, the company has worked with over 180+ startups, with most clients based in the US, ensuring many successful exits.
In 2022, Great Place to Work® recognized Talentica Software as India's Great Mid-Size Workplace.
What we're looking for?
We're seeking a DevOps SRE Manager to spearhead our cloud operations, with a primary focus on Google Cloud Platform (GCP) and secondary support for AWS. You'll lead two critical teams: a DevOps team focused on GCP infrastructure and a CloudOps/SRE team ensuring 24/7 uptime for our critical services.This role demands a blend of deep technical expertise, exceptional leadership, and strong customer relationship skills. You'll be the driving force behind seamless cloud operations, ensuring our infrastructure is reliable, scalable, and secure.
What you’ll be doing:
- Own and oversee DevOps operations in a GCP environment using Terraform, Kubernetes (GKE), Prometheus, and Grafana.
- Ensure timely execution of DevOps tasks while optimizing infrastructure automation.
- Drive CI/CD pipeline enhancements and cloud security best practices.
- Enhance monitoring, logging, and alerting capabilities to improve system reliability.
- Optimize cloud costs, scalability, and security for long-term efficiency.
- Manage and guide a 24x7 CloudOps/SRE team responsible for uptime and incident response.
- Create and maintain rosters to ensure continuous 24x7 support coverage.
- Oversee incident management, RCA (Root Cause Analysis), and SLAs.
- Implement observability best practices using Grafana, Prometheus, and Opsgenie.
- Reduce manual intervention by promoting automation and self-healing infrastructure.
- Build and maintain strong customer relationships, ensuring clear and transparent communication.
- Lead and mentor a cross-functional team of DevOps and CloudOps/SRE engineers.
- Ensure team productivity, performance reviews, and professional growth.
- Drive continuous improvement through feedback, training, and best practices.
- Maintain basic to intermediate AWS knowledge (IAM, EC2, EKS, S3, Lambda, CloudFormation).
- Assist in AWS networking, security, and infrastructure optimization when required.
- Provide support for AWS-based workloads where integration with GCP exists.
To be successful in this role, you should have:
- Qualification: BE/BTech from a reputable engineering institute.
- Experience: 8-12 years in DevOps, CloudOps, or SRE roles.
Technical Stack Expertise Required:
- Cloud platform: Google Cloud Platform (GCP) - Major, AWS-Minor
- Infrastructure as code (IaC): Terraform
- Containerization & orchestration: Kubernetes (GKE)
- CI/CD & automation: Jenkins, GitOps, Ansible
- Monitoring & observability: Prometheus, Grafana
- Incident & alerting tools: Opsgenie
- Big data & streaming technologies: Kafka, Airflow, Druid
- AWS services: IAM, EC2, S3, Lambda, CloudFormation, CloudWatch
Tech skills:
- Prior experience in handling 24x7 operations and multi-cloud environments.
- Proven experience in managing DevOps & CloudOps/SRE teams, ensuring smooth operations.
- Hands-on expertise with GCP infrastructure, Terraform, Kubernetes (GKE), and CI/CD pipelines.
- Experience in incident management, RCA, monitoring, and alerting tools (Prometheus, Grafana, Opsgenie).
- Strong understanding of reliability engineering, automation, and cloud security best practices.
Bonus points
- Experience with Kafka, Airflow, and Druid in large-scale environments.
- GCP Professional DevOps Engineer, AWS Solutions Architect, or Kubernetes certifications.
- Working knowledge of AWS cloud services, assisting in hybrid-cloud scenarios
What you’ll find here:
- A culture of innovation: We don't take up maintenance projects. Our customers come to us for our technology expertise.
- Endless learning opportunities: Continuously expand your skillset by exploring and applying the latest advancements in your field, creating better, faster, and simpler products.
- Talented peers: Work alongside experienced graduates from India's top engineering colleges (IITs, NITs, and a few select others).
- Work-life balance: We value your well-being and offer flexible schedules with remote work options.
- A great culture: Our employees love working here! 82% recommend Talentica to their friends, according to Glassdoor.
Talentica is the place to be, If you're looking to take ownership of large-scale, impactful projects and work with cutting-edge technologies, Talentica offers you the platform to make a real difference in shaping the future of our industry.
Ready to take the next step? Fill in the lead form below, and we will get in touch with you soon.