← All crypto jobs

Site Reliability Engineer Core Infrastructure

Kraken
KrakenUSD 80k–101kvia web3.careerPosted 6/16/2026
Paraguayinfrastructureengineerreliabilityawscrypto
Apply for this role ↗Browse all jobs

About the role

Building the Future of Open Finance Payward - the parent company behind Kraken, NinjaTrader, Breakout, xStocks, Payward Services and CF Benchmarks - has spent the last 15 years building one of the most modern and globally accessible financial infrastructure platforms in the industry, built to advance an open, global financial system.Before you apply, we encourage you to explore our culture page to understand what drives us and how we work.The teamJoin our engineering team and play a pivotal role in upholding the reliability, scalability, and efficiency of our robust platform team. As a Site Reliability Engineer (SRE), you will collaborate closely with diverse cross-functional teams to conceive, execute, and oversee the foundational infrastructure systems that empower our array of applications and services.As a key member of our SRE team, you will guarantee the availability, high performance, scalability and cost efficiency of our critical services and platforms. This is an excellent opportunity for engineers who are passionate about automation, cloud technologies, distributed systems, monitoring, logging and maintaining highly available financial platforms.You will work closely with Software Engineers, Security Engineers, and Platform teams to improve operational excellence and support mission-critical financial services. You will participate in system monitoring, incident response, automation initiatives, and infrastructure improvements while learning best practices for operating large-scale, highly regulated environments. The Opportunity Implement data infrastructure solutions (self service) that support the needs of dozens of business units and hundreds of engineers Utilize Infrastructure as Code (IaC) principles to design, provision, and manage both on-premises and cloud (AWS) infrastructure components using tools such as Terraform Develop and maintain automation scripts using bash/shell scripting and to automate operational tasks and deployments. Enhance and manage CI/CD pipelines to facilitate consistent software deployments across the data infrastructure. Implement robust data monitoring and alerting solutions to proactively detect anomalies and performance issues. Manage and implement role-based access control (RBAC) and permissions for a multitude of user groups and machine workflows across different environments Utilize Kubernetes and Nomad to manage containerized applications within the data infrastructure, ensuring efficient deployment, scaling, and orchestration. Implement effective incident response procedures and participate in on-call rotations. Collaborate with data analysts, engineers, and cross-functional teams to understand requirements and implement appropriate solutions. Document architecture, processes, and best practices to enable knowledge sharing and support continuous improvement.  What you Bring Bachelor’s degree in Computer Science, Software Engineering, or a related field (or equivalent experience). Proven experience of 1+ year of working as a Site Reliability Engineer, Infrastructure/Platform/DevOps Engineer, Software Engineer or similar roles Ability to leverage AI tools and agents such as Claude and OpenAI to efficiently deliver business value Solid understanding of bash/shell scripting and proficiency in at least one programming language (preferably Python, Golang or Rust). Experience with containerization tools such as Docker or Podman Strong problem-solving skills and the ability to troubleshoot complex systems.  Nice to haves Experience managing and operating data systems such as Kafka, Redis, ElasticSearch, MariaDB, AirFlow, Debezium, ScyllaDB, TiDB, Hashicorp Vault Experience managing self-hosted and SaaS platforms such as Splunk, VictoriaMetrics, Grafana, Cloudflare, Ingresses, Gitlab Experience running Kubernetes as a Platform offering for engineering teams Kubernetes AWS or on-premises experience managing workloads at scale Infrastructure as Code tools such as Terraform, Terragrunt...

This listing was sourced from web3.career and ranked for crypto candidates. Apply via the original source.