Site Reliability Engineer

Work from home Full-time role Hiring

This role is for one of the Weekday's clients Min Experience: 5 years

JobType: full-time

We are looking for a skilled and proactive Site Reliability Engineer to help build and maintain highly reliable, scalable, and secure infrastructure and applications. This role will focus on automating operations, improving system performance, and ensuring overall service health by applying modern SRE practices.

Requirements

Key Responsibilities:

Design, implement, and manage Kubernetes-based infrastructure.
Utilize AWS services such as IAM, EC2, EKS, S3, and CloudWatch to build and support scalable cloud environments.
Develop and maintain automation scripts and tools using Shell scripting or Python.
Proactively identify, analyze, and troubleshoot complex application, network, and system-level issues.
Optimize system performance and reliability, with deep expertise in Linux debugging and performance tuning.
Build automation for system self-healing and recovery mechanisms.
Develop monitoring and alerting solutions for high-performance and low-latency applications.
Collaborate with development and operations teams to implement effective CI/CD pipelines.
Apply SRE principles including service monitoring, alerting, error budget tracking, capacity planning, fault tolerance, automation, and toil reduction.
Continuously seek opportunities to improve system reliability and engineering processes.

Qualifications:

Proven experience working with Kubernetes in production environments.
Strong command of AWS cloud services with hands-on experience in infrastructure provisioning and management.
Proficiency in scripting or programming (Shell or Python preferred).
In-depth Linux knowledge including tools for diagnostics and performance optimization.
Familiarity with modern observability tools for monitoring, logging, and alerting.
Strong troubleshooting and problem-solving skills.
Understanding and application of SRE concepts and best practices.

Key Skills:

Kubernetes · AWS (IAM, EC2, EKS, S3, CloudWatch) · Linux Debugging · Shell/Python Scripting · Monitoring & Alerting · Automation · CI/CD · Docker · Site Reliability Engineering (SRE) · Performance Tuning

Originally posted on Himalayas

Apply To this Job

Apply

Site Reliability Engineer

JobType: full-time

Requirements

Key Responsibilities:

Qualifications:

Key Skills:

You might like

Mobile Storm Damage Restoration Sales Professional

Expert Crypto Accountant

Nurse Program Manager - Atrium Donor Allocation Charlotte FT Days

Senior Machine Learning Engineer

Business Analyst

Growth & CRO Manager - fully remote within Europe (m/f/d)

Atlassian Consultant

C# Developer

Pharmacy Technician, Clinical Services (MTM & Adherence)

QA Specialist, External Manufacturing

Senior Partner Development Manager

Risk Adjustment Coding Specialist I

Part-time Remote Executive Assistant

Virtual Assistant - Data Entry Specialist for arenaflex: A Remote Opportunity for Career Growth and Development

Experienced Part-Time Data Entry Executive – Thriving in a Culture of Innovation and Teamwork at arenaflex

SVP, AI Platform & Automation

Psychiatrist - Massachusetts

Experienced Customer Success Specialist – Tech Support – Hybrid/Remote Work Opportunity

(Remote) Client Success Manager Job at SPECTRUM in Littleton

Software Support and Connectivity Specialist | M-F 8:30am-5pm (Remote, United States)

Site Reliability Engineer

JobType: full-time

Requirements

Key Responsibilities:

Qualifications:

Key Skills:

You might like

Mobile Storm Damage Restoration Sales Professional

Expert Crypto Accountant

Nurse Program Manager - Atrium Donor Allocation Charlotte FT Days

Senior Machine Learning Engineer

Business Analyst

Growth & CRO Manager - fully remote within Europe (m/f/d)

Atlassian Consultant

C# Developer

Pharmacy Technician, Clinical Services (MTM & Adherence)

QA Specialist, External Manufacturing

Senior Partner Development Manager

Risk Adjustment Coding Specialist I

Part-time Remote Executive Assistant

Virtual Assistant - Data Entry Specialist for arenaflex: A Remote Opportunity for Career Growth and Development

Experienced Part-Time Data Entry Executive – Thriving in a Culture of Innovation and Teamwork at arenaflex

SVP, AI Platform & Automation

Psychiatrist - Massachusetts

Experienced Customer Success Specialist – Tech Support – Hybrid/Remote Work Opportunity

(Remote) Client Success Manager Job at SPECTRUM in Littleton

Software Support and Connectivity Specialist | M-F 8:30am-5pm (Remote, United States)

Looking for more remote jobs?