[Remote] DevOps Engineer
Note: The job is a remote job and is open to candidates in USA. Time Doctor is a workforce analytics platform that provides AI-powered insights into work performance. As a DevOps Engineer, you will architect and manage cloud-native infrastructure primarily on Google Cloud Platform, leveraging modern technologies to ensure high availability and performance for enterprise-grade SaaS solutions.
Responsibilities
- Design, implement and maintain Terraform-based infrastructure across 40+ environments (production, staging, sandbox and customer-specific deployments)
- Manage GCP services including GKE, Cloud Run, Cloud Functions, Cloud Build, VPC networking, Cloud Armor and Cloud Load Balancers
- Develop and maintain reusable Terraform modules following best practices and DRY principles
- Manage multi-environment infrastructure using GitOps workflows and Terraform Cloud
- Administer MongoDB Atlas clusters including backup strategies, replication, network peering and performance optimization
- Manage Redis instances (Cloud Memorystore) for caching, session management and real-time features
- Configure and maintain BigQuery datasets, scheduled queries and data pipelines
- Implement data lifecycle policies for Cloud Storage and optimize storage costs
- Build and maintain CI/CD pipelines using GitHub Actions, Cloud Build and automated Terraform workflows
- Implement container build pipelines and manage container registries (GCR/Artifact Registry)
- Automate deployment processes with integration to GitHub via OIDC/Workload Identity Federation
- Develop infrastructure automation scripts in Python and Bash
- Configure and manage pre-commit hooks and code validation workflows
- Maintain comprehensive monitoring solutions using Google Cloud Monitoring and Sentry
- Create and manage custom alert policies for application and infrastructure metrics
- Implement distributed tracing and APM using Cloud Trace and OpenTelemetry
- Configure log-based metrics and structured logging with Cloud Logging
- Respond to incidents, perform root cause analysis and implement preventive measures
- Manage uptime checks and SLO/SLI monitoring across services
- Configure and maintain Cloud Armor WAF policies with rate limiting and DDoS protection
- Manage Identity-Aware Proxy (IAP) for secure access control
- Implement SSL/TLS automation using Let's Encrypt and Google-managed certificates
- Maintain secrets management using Google Secret Manager with proper IAM controls
- Configure VPC networking, private service access and Cloud NAT for secure communication
- Enforce security best practices including encryption at rest, network isolation and least-privilege IAM
- Work closely with engineering teams to optimize application performance and scalability
- Participate in architecture discussions and provide infrastructure expertise
- Create and maintain comprehensive documentation for infrastructure configurations, runbooks and procedures
- Mentor team members on DevOps best practices and infrastructure workflows
- Collaborate with cross-functional teams on feature deployments and system improvements
Skills
- Bachelor's degree in Computer Science, related technical field or equivalent practical experience
- 3-5 years of hands-on DevOps experience with production cloud environments
- Strong expertise in Google Cloud Platform (GCP), including: Cloud Run, Cloud Functions, GKE, VPC networking, Cloud Armor, Load Balancers, IAM, Secret Manager and security services
- Advanced Terraform proficiency: Infrastructure as code for complex multi-environment setups, Module development and state management, Terraform Cloud/Enterprise workflows
- MongoDB Atlas administration: Cluster configuration, sharding and replica sets, Backup/recovery strategies and performance tuning, Network peering and security configuration
- Container technologies: Docker containerization and multi-stage builds, Experience with Kubernetes or serverless container platforms, Container registry management
- CI/CD expertise: GitHub Actions workflows, Cloud Build pipelines, GitOps practices and automated deployments
- Scripting and automation: Strong Python and Bash scripting skills, Infrastructure automation and tooling development
- Monitoring and observability: Experience with Sentry/Datadog or similar APM/monitoring platforms, Cloud-native monitoring tools and practices, Log aggregation and analysis
Benefits
- 100% remote and async-first — work from anywhere
- Competitive pay + 30+ days of paid time off
- Full-time, 100% remote role
- Equal opportunity employer — we deeply value diversity and inclusion. We welcome applicants of all backgrounds and do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability.
- We hire globally. Depending on your location, you may join as a full-time employee or contractor. Our Talent Attraction team will walk you through all the details during your first interview.
Company Overview