Oscilar Logo

Oscilar

Sr./Staff - Infrastructure/Site Reliability Engineer (SRE)

Reposted 11 Days Ago
Remote
2 Locations
Senior level
Remote
2 Locations
Senior level
Seeking a seasoned SRE to lead reliability for a cloud-native platform, overseeing infrastructure, CI/CD pipelines, observability, and mentoring engineers.
The summary above was generated by AI

Shape the future of trust in the age of AI
At Oscilar, we're building the most advanced AI Risk Decisioning™ Platform. Banks, fintechs, and digitally native organizations rely on us to manage their fraud, credit, and compliance risk with the power of AI. If you're passionate about solving complex problems and making the internet safer for everyone, this is your place.

Why join us:
  • Mission-driven teams: Work alongside industry veterans from Meta, Uber, Citi, and Confluent, all united by a shared goal to make the digital world safer.

  • Ownership and impact: We believe in extreme ownership. You'll be empowered to take responsibility, move fast, and make decisions that drive our mission forward.

  • Innovate at the cutting edge: Your work will shape how modern finance detects fraud and manages risk.

About the Role

Oscilar is growing fast, and so is the complexity of our systems. We’re looking for a experienced SRE to take ownership of reliability across our multi-region, cloud-native platform. You’ll have the mandate and autonomy to design, implement, and evolve systems that stay performant and resilient—through traffic spikes, dependency failures, and global deployments. You’ll be shaping how we scale, how we build observability, and how we run infrastructure that supports billions of events and large-scale data pipelines.

What You’ll Own
  • Architect and operate resilient cloud infrastructure (AWS, Pulumi, Kubernetes).

  • Lead initiatives to improve availability, latency, and performance at scale.

  • Design and evolve our CI/CD pipelines to optimize for speed, safety, and repeatability.

  • Define the metrics, alerts, and runbooks that form our observability backbone.

  • Run chaos experiments and failure simulations to harden the platform.

  • Mentor engineers and set best practices for SRE across the company.

What You Bring
  • Proven track record as a senior SRE or Infrastructure Engineer in high-scale environments.

  • Expert-level skills in AWS and Infrastructure as Code (Pulumi, Terraform).

  • Strong programming ability in Go or Python. We use Go.

  • Deep understanding of distributed systems (Kafka, ClickHouse) and microservices architecture.

  • Mastery of container orchestration (Kubernetes) and production debugging.

  • Strong sense of ownership, and the judgment to balance velocity with reliability.

Benefits
  • Compensation: Competitive salary and equity packages, including a 401k plan

  • Flexibility: Remote-first culture — work from anywhere

  • Health: 100% Employer covered comprehensive health, dental, and vision insurance with a top tier plan for you and your dependents (US)

  • Balance: Unlimited PTO policy

  • Technical: AI First company; both Co-Founders are engineers at heart; and over 50% of the company is Engineering and Product

  • Culture: Family-Friendly environment; Regular team events and offsites

  • Development: Unparalleled learning and professional development opportunities

  • Impact: Making the internet safer by protecting online transactions

Top Skills

AWS
Clickhouse
Go
Java
Kafka
Kubernetes
Pulumi
Terraform

Similar Jobs

17 Days Ago
In-Office or Remote
Toronto, ON, CAN
110K-270K Annually
Senior level
110K-270K Annually
Senior level
Big Data • Cloud • Healthtech • Software • Big Data Analytics
As a Senior Site Reliability Engineer, you'll ensure scalability and reliability of enterprise applications, lead incident management, and develop tools for automation.
Top Skills: AWSDockerGitHibernateJavaKubernetesLinuxMavenMySQLSolrSpringTomcatVagrant
8 Days Ago
In-Office or Remote
Toronto, ON, CAN
150K-330K Annually
Senior level
150K-330K Annually
Senior level
Travel
The Senior Site Reliability Engineer will enhance platform tooling, automate infrastructure workflows, improve scalability, and support engineering teams in incident response and collaboration.
Top Skills: BashDatadogGoogle Cloud PlatformHelmIstioKubernetesKustomizePythonTerraform
3 Days Ago
In-Office or Remote
Montréal, QC, CAN
Senior level
Senior level
Software
The Site Reliability Engineer will maintain and optimize the reliability of cloud infrastructure, focusing on automation, observability, and incident management in SaaS environments.
Top Skills: AWSBashDatadogGitlab Ci/CdJavaKubernetesPythonTerraform

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account