See all roles

[Remote] Lead AI Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. IXOPAY is an enterprise-grade global payment infrastructure platform that utilizes AI-driven intelligence for payment solutions. The Lead AI Engineer will design and implement AI agent systems, manage performance and cost, and lead the team in delivering AI-powered products and strategies.

Responsibilities

  • Architect AI agent systems: Design, build, and deploy production single-agent and multi-agent systems using LangGraph on cloud AI platforms (Azure AI Foundry, AWS Bedrock, etc.), with Claude as the primary foundation model. Architect agent memory, state management, and context persistence for long-horizon reasoning tasks
  • Own evaluation and quality: Build and maintain evaluation harnesses that measure faithfulness, hallucination rate, retrieval quality, latency, and instruction-following accuracy across model versions and prompt changes. Define golden test sets, regression suites, and automated eval pipelines that gate every release
  • Build AI safety and guardrails: Implement safety guardrails, output filtering, and prompt injection defenses across all agent systems. Lead red-teaming exercises to identify failure modes before they reach production. Ensure responsible AI practices are embedded in every deployment
  • Ship RAG-powered intelligence: Build agentic RAG pipelines with retrieval, generation, validation, and ReAct-style reasoning loops that ground outputs in real payment data. Leverage the Claude SDK, Anthropic's tool-use APIs, and MCP protocol to build agents that interact with internal systems, payment gateways, and external data sources
  • Manage cost and performance: Own token cost modeling and inference optimization across agent workflows. Understand the cost profile of every agent loop at production scale and make architecture decisions that balance capability with spend
  • Design human-in-the-loop patterns: Design human-in-the-loop review checkpoints and escalation paths for high-stakes workflows. Define where agents operate autonomously and where human oversight is required — especially in payment-critical operations
  • Lead delivery streams: Own the AI delivery streams (Trust Score, StormTrooper, RFC Agent) from ideation through production deployment. Define technical direction, set quality standards, and make build-vs-buy decisions
  • Instrument observability: Set up and maintain LLM observability and tracing infrastructure (LangSmith, LangFuse, or equivalent) to monitor agent behavior, debug failures, and track quality metrics in production
  • Drive AI strategy: Partner with product, engineering, and payments teams to identify high-impact AI use cases — fraud scoring, intelligent routing, chargeback prediction, automated reconciliation, and beyond
  • Build the team: Mentor contractors and future hires. Establish coding standards, review patterns, and engineering culture for the AI function

Skills

  • Extensive software development experience, with many years of your career focused on data science, machine learning, and/or agentic AI development (or a combination)
  • Demonstrated track record of building and deploying production-ready AI agents — not proofs of concept, not notebooks, but systems running in production with real users and real data. You can describe a specific production failure you caught, what the failure mode was, and what you built to prevent it
  • Experience designing and running evaluation frameworks for LLM-powered systems — faithfulness scoring, hallucination detection, retrieval quality metrics, and regression testing across model and prompt changes
  • Hands-on experience building agents with LangGraph (state machines, conditional edges, tool nodes, memory management, human-in-the-loop patterns)
  • Production experience with the Claude SDK / Anthropic API — tool use, MCP protocol, structured outputs, and prompt engineering at scale
  • Understanding of AI safety practices: output filtering, guardrail implementation, prompt injection defense, and red-teaming methodologies
  • Strong Python skills. You write clean, testable, well-documented code
  • Experience with RAG architectures — vector stores (FAISS, Pinecone, OpenSearch), embedding models, chunking strategies, and retrieval evaluation
  • Practical understanding of token cost modeling, inference optimization, and the trade-offs between fine-tuning, retrieval-based approaches, and prompt engineering at production scale
  • Familiarity with cloud AI platforms (AWS Bedrock, Azure AI Foundry, GCP Vertex AI) and supporting cloud infrastructure services
  • Experience with LLM observability and tracing tools (LangSmith, LangFuse, Arize, or equivalent) for monitoring agent behavior and quality metrics in production
  • Ability to lead technical decisions and communicate trade-offs clearly to both engineering and business stakeholders
  • Experience in payments, fintech, or financial services
  • Background in NLP, information extraction, or document understanding
  • Experience with fine-tuning approaches (LoRA, SFT, DPO/RLHF) and the judgment to know when fine-tuning is the right call versus retrieval or better prompting
  • Prior experience scaling an AI/ML function from 0 to 1
  • Contributions to open-source AI/agent frameworks

Benefits

  • Competitive salary and benefits
  • Opportunities for growth and development
  • A collaborative and supportive team environment
  • Medical, Dental & Vision Insurance
  • Flexible Spending Account (FSA) & Health Savings Account (HSA)
  • Employer-paid Life, AD&D, STD & LTD Insurance
  • Unlimited PTO & Paid Holidays
  • 401(k) Plan with Employer Match

Company Overview

  • IXOPAY provides digital payment processing solutions enabling independent, flexible and global payment processing for enterprise merchants. It was founded in 2014, and is headquartered in Vienna, Wien, AUT, with a workforce of 51-200 employees. Its website is https://www.ixopay.com.
  • Apply To This Job

    You might like

    [Remote] Senior Copywriter & Content Strategist

    Work from home Full-time role

    [Remote] Software Engineer III

    Work from home Full-time role

    [Remote] Executive Director, Business Development - Patient HUB Services

    Work from home Full-time role

    [Remote] Senior Data Scientist

    Work from home Full-time role

    [Remote] Senior UX Designer

    Work from home Full-time role

    [Remote] Senior Project Manager (ITS - Software - Transport)

    Work from home Full-time role

    [Remote] Senior Product Manager II - AI Platform & Agentic Experience

    Work from home Full-time role

    [Remote] Senior Security Engineer

    Work from home Full-time role

    [Remote] Staff Software Engineer: Platform

    Work from home Full-time role

    [Remote] Bilingual Customer Service Representative

    Work from home Full-time role

    Sr. Field Clinical Representative - Atlanta, GA – Amazon Store

    Work from home Full-time role

    Hawk-Eye Systems Technician (NFL) - Kansas City, MO

    Work from home Full-time role

    Experienced Customer Support Representative (WFH) in Newark, NJ – Join arenaflex's Global Team

    Work from home Full-time role

    Wayfair Warehouse Jobs – Part Time $25/Hour

    Work from home Full-time role

    Experienced Customer Service Representative (Remote) – Delivering Exceptional Experiences at arenaflex

    Work from home Full-time role

    Netflix.Job Remote Careers

    Work from home Full-time role

    Third Party Review (TPR) Analyst - Quality Assurance

    Work from home Full-time role

    Experienced Remote Customer Service Representative - Delivering Exceptional Client Experiences with blithequark, Starting at $19/Hour

    Work from home Full-time role

    Associate Academic Designer, Science (Labs)

    Work from home Full-time role

    Associate, Claims Examiner

    Work from home Full-time role