← All crypto jobsSite Reliability Engineer Core Infrastructure
Kraken
KrakenUSD 80k–101kvia web3.careerPosted 6/16/2026
Paraguayinfrastructureengineerreliabilityawscrypto
About the role
Building the Future of Open Finance
Payward - the parent company behind Kraken, NinjaTrader, Breakout, xStocks, Payward Services and CF Benchmarks - has spent the last 15 years building one of the most modern and globally accessible financial infrastructure platforms in the industry, built to advance an open, global financial system.Before you apply, we encourage you to explore our culture page to understand what drives us and how we work.The teamJoin our engineering team and play a pivotal role in upholding the reliability, scalability, and efficiency of our robust platform team. As a Site Reliability Engineer (SRE), you will collaborate closely with diverse cross-functional teams to conceive, execute, and oversee the foundational infrastructure systems that empower our array of applications and services.As a key member of our SRE team, you will guarantee the availability, high performance, scalability and cost efficiency of our critical services and platforms. This is an excellent opportunity for engineers who are passionate about automation, cloud technologies, distributed systems, monitoring, logging and maintaining highly available financial platforms.You will work closely with Software Engineers, Security Engineers, and Platform teams to improve operational excellence and support mission-critical financial services. You will participate in system monitoring, incident response, automation initiatives, and infrastructure improvements while learning best practices for operating large-scale, highly regulated environments. The Opportunity
Implement data infrastructure solutions (self service) that support the needs of dozens of business units and hundreds of engineers
Utilize Infrastructure as Code (IaC) principles to design, provision, and manage both on-premises and cloud (AWS) infrastructure components using tools such as Terraform
Develop and maintain automation scripts using bash/shell scripting and to automate operational tasks and deployments.
Enhance and manage CI/CD pipelines to facilitate consistent software deployments across the data infrastructure.
Implement robust data monitoring and alerting solutions to proactively detect anomalies and performance issues.
Manage and implement role-based access control (RBAC) and permissions for a multitude of user groups and machine workflows across different environments
Utilize Kubernetes and Nomad to manage containerized applications within the data infrastructure, ensuring efficient deployment, scaling, and orchestration.
Implement effective incident response procedures and participate in on-call rotations.
Collaborate with data analysts, engineers, and cross-functional teams to understand requirements and implement appropriate solutions.
Document architecture, processes, and best practices to enable knowledge sharing and support continuous improvement.
What you Bring
Bachelor’s degree in Computer Science, Software Engineering, or a related field (or equivalent experience).
Proven experience of 1+ year of working as a Site Reliability Engineer, Infrastructure/Platform/DevOps Engineer, Software Engineer or similar roles
Ability to leverage AI tools and agents such as Claude and OpenAI to efficiently deliver business value
Solid understanding of bash/shell scripting and proficiency in at least one programming language (preferably Python, Golang or Rust).
Experience with containerization tools such as Docker or Podman
Strong problem-solving skills and the ability to troubleshoot complex systems.
Nice to haves
Experience managing and operating data systems such as Kafka, Redis, ElasticSearch, MariaDB, AirFlow, Debezium, ScyllaDB, TiDB, Hashicorp Vault
Experience managing self-hosted and SaaS platforms such as Splunk, VictoriaMetrics, Grafana, Cloudflare, Ingresses, Gitlab
Experience running Kubernetes as a Platform offering for engineering teams
Kubernetes AWS or on-premises experience managing workloads at scale
Infrastructure as Code tools such as Terraform, Terragrunt...
This listing was sourced from web3.career and ranked for crypto candidates. Apply via the original source.