Job Description:
We are seeking a Senior Backend and DevOps Engineer with deep expertise in both backend development and infrastructure management. In this role, you will be responsible for architecting, building, and maintaining high-performance backend systems while also managing and monitoring infrastructure to ensure stability and scalability. You will work extensively in high-transaction environments, optimize system performance, and build resilient, automated deployment pipelines. Additionally, you will take ownership of critical production issues and continuously enhance system reliability by implementing industry-leading DevOps best practices.
The Senior Backend and DevOps Engineer will be responsible for the following tasks:
- Setup SLOs, monitoring practices
- Enhance stability of the system via database optimization, performance, scalability, and fault tolerance
- Lead backend stability and performance efforts, particularly in high-transaction environments
- Monitor system health to proactively identify and resolve issues, especially during peak usage periods
- Provide on-call support, mitigating production issues like crashes, errors, and outages in real-time
- Analyze logs, error reports, and monitoring data to detect and address potential system problems
- Collaborate with cross-functional teams for automated CI/CD pipelines and incident management
- Optimize uptime, performance, and scalability
- Establish and maintain monitoring and alerting systems for rapid issue detection and response
- Document system changes, incident reports, and troubleshooting guidelines
The Senior Backend and DevOps Engineer is expected to have the following qualifications and skills:
- 5+ years of experience in backend engineering or DevOps roles
- Expertise in backend development with platforms like Supabase, Firebase, or similar
- Proficiency in Google Cloud Platform (GCP) services, including monitoring, scaling, and optimization
- Experience with queue systems like GCP Pub/Sub and GCP Cloud Task
- Database query optimization using PostgreSQL, MongoDB, etc.
- Hands-on experience with DevOps tools like CircleCI and GitHub
- Strong TypeScript and optimization skills for javascript
- Optional: Frontend experience with React TypeScript
- Proven expertise in system monitoring, incident management, and large-scale optimization
- Strong troubleshooting skills and root cause analysis capabilities
- Experience with monitoring and alerting tools like Prometheus, Grafana, ELK
- Desirable: Knowledge of real-time data systems, event-driven architectures, and security best practices
If interested, please send a CV and Cover Letter to hr@xenia.tech