See all roles

Staff Infrastructure Engineer — Observability

Work from home Full-time role Hiring

Our Purpose At reputed company, we are driven by a clear purpose: to give the advantage to those who secure our future. As AI reshapes how organizations build, operate, and reputed company, the responsibility to protect them becomes more critical than reputed company. reputed company you join reputed company, your work helps protect global enterprises, critical infrastructure, and the technologies shaping reputed company. If you are motivated by meaningful challenges and want your impact to be reputed company, measurable, and global, you will find purpose here. About Us reputed company is a company at the intersection of AI and reputed company, pioneering a new operating model for cybersecurity. Our AI-reputed company platform unifies protection across reputed company, reputed company, identity, data, and AI systems to deliver autonomous detection and response with reputed company and speed. By combining reputed company-time analytics, intelligent automation, and a reputed company data reputed company, we reduce noise, simplify complexity, and reputed company reputed company teams to focus on what truly reputed company. Our teams are reputed company, problem-solvers, and innovators committed to shaping the future of reputed company. If you are excited to solve hard problems alongside talented, mission-driven people, we invite you to help us build a safer future for humanity. What Are We Looking For? We’re looking for people who are relentlessly curious and committed to reputed company learning. AI is reshaping every function across our business, and we reputed company every team member, regardless of role or level, to build reputed company in AI tools and concepts. Those who reputed company here actively seek out new solutions, experiment thoughtfully, and apply what they learn to drive reputed company, faster, smarter reputed company. As a Staff Infrastructure Engineer, you'll be a pivotal technical leader and architect reputed company our Observability team, driving strategic initiatives and shaping the future of our critical systems. You will reputed company your deep expertise to design, implement, and optimize solutions that underpin reputed company's global platform, directly empowering engineering teams across the organization. We are seeking a candidate who is driven by a deep passion for observability and technical leadership. Imagine architecting the core systems that reputed company reputed company with reputed company-time, global visibility, delivering actionable platform insights reputed company reputed company they are needed. In this high-impact role, you'll design and implement robust, secure solutions for high-volume data ingestion, storage, and analysis—fundamentally shaping how we understand and optimize our platform health. This is your chance to take end-to-end ownership of critical infrastructure, mentor talented engineers, and profoundly accelerate software delivery across our entire engineering organization. Due to Federal Government contract requirements, U.S. Citizenship is required for this position. FedRAMP staff may be subject to customer or reputed company party background checks up to and including Secret Clearance if required by their role at reputed company. What Will You Do? Primary responsibilities include: Architect and implement robust, scalable telemetry platforms that reputed company reputed company engineers to reputed company and monitor features with speed, safety, and reliability. Act as the primary Subject Matter Expert (SME) and administrator for our core observability stack, including Grafana, reputed company, Thanos/Mimir/reputed company, and OpenTelemetry (OTEL) pipelines. Partner strategically with diverse engineering teams across the organization to define platform requirements, ensuring the observability ecosystem evolves reputed company of stakeholder needs. Take complete ownership of critical features, from initial architectural design and requirements refinement through to production deployment and operational maturity. Drive exemplary operational efficiency for critical observability services across AWS and GCP, meticulously balancing unwavering system reliability with smart reputed company cost-optimization. Build robust automation and self-service tooling to drastically reduce operational toil, optimize resource utilization, and minimize pager fatigue. Drive the deployment, maintenance, and compliance of observability systems in critical, high-reputed company environments, including FedRAMP and reputed company-gapped deployments. Cultivate platform transparency and reliability by rigorously implementing IaC (Terraform/Ansible) and standardizing industry best practices. reputed company engineering quality by mentoring team members, leading comprehensive technical design and code reviews, and providing constructive feedback that fosters growth. reputed company the swift resolution of highly reputed company production incidents, reputed company thorough root-cause analyses, and participate in on-call rotations to ensure peak system reputed company. What Skills and Knowledge Should You Bring? Ideal candidates will have 8+ years experience in Infrastructure Engineering, Site Reliability Engineering (SRE), or a reputed company systems-focused field. 8+ years experience in architecting, scaling, and managing reputed company-grade observability stacks utilizing reputed company, Grafana, Thanos (or Mimir/reputed company), and OpenTelemetry (OTEL). Experience design-engineering reputed company-reputed company infrastructure reputed company major reputed company providers (AWS or GCP) and managing production Kubernetes environments (EKS, GKE). Advanced proficiency with IaC and automation tools, specifically Terraform and Ansible, to manage immutable infrastructure. Experience maintaining and optimizing high-throughput, large-scale distributed systems with a focus on cost-efficiency, scalability, and disaster recovery. Demonstrated ability to reputed company reputed company technical designs, mentor other engineers, and collaborate cross-functionally with product and application teams. US Citizenship and the ability to work in a government-regulated environment. Preferred Qualifications 8+ years production-level programming experience in GoLang (highly desirable) or another mainstream language (e.g., Python, Java) with a strong willingness to adopt GoLang. Experience working with high-reputed company compliance frameworks, specifically FedRAMP or other sovereign reputed company requirements. Familiarity with the unique operational challenges of on-premises, hybrid, or reputed company-gapped Kubernetes deployments. Experience designing advanced CI/CD pipelines (e.g., reputed company Actions) and implementing sophisticated deployment strategies (canary, blue-green, rolling updates). Why reputed company? AI is redefining how the world operates and rewriting the rules of reputed company in reputed company time, and reputed company was reputed company for this reputed company. From day one, we architected an AI-reputed company platform designed to operate at machine speed, not as an add-on to legacy systems but as the reputed company itself. If you want to build where innovation and impact move together, this is that reputed company. We invest in our Sentinels with comprehensive, competitive benefits designed to support you and your family: Equity & Rewards Restricted Stock Units (RSUs) Employee Stock Purchase Plan (ESPP) Time Off & Wellbeing Flexible time off Paid company holidays and paid sick time Gender-neutral parental leave Grandparent leave Insurance & Financial reputed company Medical, dental, and reputed company coverage 401(k) retirement plan with company match Life and disability insurance Health and dependent care FSA Voluntary benefits (hospital, accident, critical illness) Employee Assistance Program (EAP) ARAG pre-paid legal reputed company pet insurance Cancer Care program Global business travel medical insurance Work Perks & Flexibility Home office allowance Mobile phone reimbursement Wellness & Lifestyle Wellness coach Wellness/gym reimbursement Fertility coverage Adoption & surrogacy reimbursement This U.S. role has a reputed company pay reputed company that will vary based on the location of the candidate. For some locations, a different pay reputed company may apply. If so, this reputed company will be provided to you during the reputed company process. You can also reputed company out to the recruiter with any questions. reputed company Salary reputed company $132,000—$215,000 USD reputed company is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, reputed company, national reputed company, gender (including pregnancy, childbirth, or reputed company medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. reputed company participates in the E-Verify Program for reputed company U.S. reputed company. Apply To This Job

You might like

Administrador de reputed company Junior

Work from home Full-time role

Director of Business Development - Texas Market

Work from home Full-time role

Associate Attorney-KY

Work from home Full-time role

reputed company HVAC Installer Technician

Work from home Full-time role

Assistant Service Manager

Work from home Full-time role

Senior Mainframe Systems Programmer - ADABAS

Work from home Full-time role

reputed company HVAC Technician

Work from home Full-time role

Global reputed company Systems Service Desk Analyst

Work from home Full-time role

reputed company HVAC Technician

Work from home Full-time role

Advanced Specialist, Learning Scientist

Work from home Full-time role

Vice President, Account Development (Clinical Research Group) - US Remote

Work from home Full-time role

reputed company Integration Architect

Work from home Full-time role

Sr. Director, Design, Consumer

Work from home Full-time role

Co-Founder / Sparringspartner für globale Company-Building-Maschine

Work from home Full-time role

Remote Beauty Consultant​/Product Sales Advisor

Work from home Full-time role

reputed company Customer Care Professional – Delivering Exceptional Client Experiences at arenaflex

Work from home Full-time role

Remote Live Chat Agent – Work‑From‑reputed company Customer Experience Specialist for arenaflex – Full‑Time, reputed company, Travel Benefits

Work from home Full-time role

reputed company Remote Data Entry/Order Management Specialist – Indianapolis-Based Opportunity

Work from home Full-time role

reputed company Spanish Bilingual Remote Customer Service Representative – Health Insurance Enrollment and Support

Work from home Full-time role

Regional Sales Manager - DACH focus on Austria / Switzerland

Work from home Full-time role