Open position at ShipMonk Product Development

Staff Site Reliability Engineer

Work schedule
Full-time
Address
Rohanská nábřeží 23, Praha 8, Karlín

We are ShipMonk and we provide logistics services in the field of order fulfillment. We are operating from our main business in the USA, but Prague is the center of the development of our modern platform.

We are seeking an influential Staff SRE to help architect and drive the strategic evolution of our core cloud and deployment infrastructure, shifting our operations toward a more robust, self-service developer platform. This is a highly strategic, but hands-on role for an engineer ready to challenge inefficiencies and contribute to continuous improvement initiatives, from concept to production.

About us

ShipMonk is a growing 3PL company. Our engineering culture values ownership, automation, and continuous improvement, placing SRE at the core of our strategic development efforts. We believe in empowering our developers with best-in-class tooling and infrastructure.

The opportunity

You will be the key technical innovator defining our infrastructure's future state, specifically focused on scaling, optimizing, and enhancing our fully automated platform. While the current architecture is stable, you will be empowered to conduct deep analysis and implement strategic, iterative architectural changes to substantially improve developer velocity and system reliability.

This role is focused on strategic planning, persuasion, and execution to drive evolutionary improvements that result in a best-in-class developer experience, moving us forward one major step at a time.

Key responsibilities and scope

  • Platform Architecture: Propose the design, implementation, and maintenance of core cloud and deployment systems, advocating for self-service patterns.
  • Kubernetes and Cloud Orchestration: Take ownership of the scalability, security, and optimization of production Kubernetes clusters and the underlying AWS accounts management structure.
  • CI/CD Strategy: Drive best practices across our CI/CD pipelines, optimizing performance and reliability of GitLab CI runners and standardizing deployment flows using ArgoCD.
  • Infrastructure Core Services: Provide administrative expertise and reliability improvements for critical services, including RabbitMQ and the enterprise VPN.
  • Observability Leadership: Improve the organization’s vision for monitoring, tracing, and logging, and manage the strategic use and optimization of Datadog management across all environments.

Skills and qualifications

  • 6+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering roles.

  • Deep expertise in AWS multi-account environments (Networking, Security, IAM).
  • Expert-level knowledge of Kubernetes administration, networking, and deployment strategies.
  • Strong operational experience with messaging systems (e.g., RabbitMQ) and GitOps tools (e.g., Argo CD).
  • Proficiency in modern CI/CD tooling, specifically GitLab CI/CD.
  • Expertise in Infrastructure as Code (IaC), preferably Terraform.
  • Demonstrated experience managing large-scale observability platforms like Datadog.

Ideal candidate

  • An Evolution Driver: Possesses a strong internal drive and the conviction to push for continuous, significant improvements and strategically refine the status quo of existing processes and infrastructure.

  • Strategic Communicator: A great communicator who is skilled at listening to the needs of engineering teams, translating those needs into technical roadmaps, and then successfully persuading other engineers and management that their ideas are worth investing in.
  • Platform-Focused: Experienced in building internal developer platforms (IDPs) and services, focusing on APIs and tooling that enable developers to deploy and manage their services reliably and independently.
  • Technical innovation: Acts as a force multiplier by bringing fresh ideas, challenging conventions, and raising the technical bar across the entire organization.

GET THE SH*IT DONE

If you like what we do and you are interested in our "story", we look forward to your resume, profile, story, whatever. There are no limits to creativity. Our Recruiter will contact you as soon as possible. We hope to be hearing from you soon.

Share opportunity

FacebookLinkedInE-mail