Upstart logo, people working, and tagline "Build the future, join the team"
Upstart Logo

Upstart

Director of Reliability

Job Posted 5 Days Ago Reposted 5 Days Ago
Easy Apply
Remote
2 Locations
217K-301K Annually
Senior level
Easy Apply
Remote
2 Locations
217K-301K Annually
Senior level
The Director of Reliability will lead the Site Reliability Engineering team to ensure platform reliability, performance, and scalability, implementing automation and observability practices while aligning SRE initiatives with business objectives.
The summary above was generated by AI

About Upstart

Upstart is the leading AI lending marketplace partnering with banks and credit unions to expand access to affordable credit. By leveraging Upstart's AI marketplace, Upstart-powered banks and credit unions can have higher approval rates and lower loss rates across races, ages, and genders, while simultaneously delivering the exceptional digital-first lending experience their customers demand. More than 80% of borrowers are approved instantly, with zero documentation to upload.

Upstart is a digital-first company, which means that most Upstarters live and work anywhere in the United States. However, we also have offices in San Mateo, California; Columbus, Ohio; and Austin, Texas.

Most Upstarters join us because they connect with our mission of enabling access to effortless credit based on true risk. If you are energized by the impact you can make at Upstart, we’d love to hear from you!

The Team

As the Director of Reliability, you’ll be the strategic force ensuring our platform is not only always-on but relentlessly performant, and scalable. You’ll lead the Site Reliability Engineering (SRE), Compute, Quality, Runtime and Deployment teams to build resilient systems all while championing automation, observability, and incident excellence. This role is at the heart of Upstart’s mission: making credit more accessible and fair by ensuring the tech behind it never misses a beat.

Your leadership will transform how we build, deploy, and maintain reliable systems, ensuring our technology is an enabler rather than a bottleneck. As the driving force behind resilient and scalable infrastructure, you will enable our company to deliver exceptional customer  experiences, support innovation, and sustain long-term growth.

This isn’t just an operational role—it’s a strategic leadership position that will define the future of our platform’s reliability and performance.

How you’ll make an impact:

  • You will proactively prevent downtime and service disruptions by implementing robust monitoring, alerting, and automation strategies.
  • Your efforts in optimizing system performance will improve response times, reduce latency, and enhance overall customer satisfaction.
  • By championing automation and observability, you will reduce manual toil and free up engineering teams to focus on innovation.
  • Your leadership will help create self-healing systems, reducing the need for reactive firefighting and improving developer productivity.
  • You will lead the development of a world-class incident response process, ensuring quick resolution of outages and minimizing business impact.
  • You will empower teams with SRE best practices, breaking down silos between development, operations, and security teams.
  • By aligning SRE initiatives with business objectives, you will help balance reliability with speed of innovation, ensuring that engineering teams can ship features quickly without sacrificing stability.
  • Your contributions will directly support revenue growth by reducing service disruptions and ensuring a seamless user experience

What we’re looking for: 

  • Minimum requirements:
    • 10+ years of experience in software engineering, DevOps, or Site Reliability Engineering, with at least 5+ years in a leadership role.
    • Proven experience leading large-scale, mission-critical distributed systems with a focus on reliability, scalability, and security.
    • Expertise in cloud platforms such as AWS, Azure, or Google Cloud.
    • Strong background in observability tools like Prometheus, Grafana, Datadog, New Relic, or Splunk.
    • Experience with infrastructure as code (Terraform, CloudFormation) and containerization (Docker, Kubernetes).
    • Strong understanding of networking, security, and performance optimization.
    • Demonstrated success in building high-performing SRE teams and implementing best practices.
  • Preferred qualifications:
    • Experience building and leading teams that deliver big impact
    • Experience developing and maintaining large scale distributed systems in AWS
    • Ability to influence and lead others without direct authority
    • Strong product and analytical mindset that allows you to think in terms of ROI, risk, and trade offs
    • Experience working at companies that have gone through periods of rapid business or organizational growth while maintaining high standards

Position Location - This role is available in the following locations: Remote, San Mateo, Columbus, Austin 

Time Zone Requirements - This team operates on the East/West Coast time zones.

Travel Requirements -  As a digital first company, the majority of your work can be accomplished remotely. The majority of our employees can live and work anywhere in the U.S but are encouraged to to still spend high quality time in-person collaborating via regular onsites. The in-person sessions’ cadence varies depending on the team and role; the Engineering team meets quarterly for 4-5 consecutive days at a time.


What you'll love: 

  • Competitive Compensation (base + bonus & equity)
  • Comprehensive medical, dental, and vision coverage with Health Savings Account contributions from Upstart 
  • 401(k) with 100% company match up to $4,500 and immediate vesting and after-tax savings
  • Employee Stock Purchase Plan (ESPP)
  • Life and disability insurance
  • Generous holiday, vacation, sick and safety leave  
  • Supportive parental, family care, and military leave programs
  • Annual wellness, technology & ergonomic reimbursement programs
  • Social activities including team events and onsites, all-company updates, employee resource groups (ERGs), and other interest groups such as book clubs, fitness, investing, and volunteering
  • Catered lunches + snacks & drinks when working in offices

 

#LI-REMOTE

#LI-Director 

At Upstart, your base pay is one part of your total compensation package.  The anticipated base salary for this position is expected to be within the below range. Your actual base pay will depend on your geographic location–with our “digital first” philosophy, Upstart uses compensation regions that vary depending on location. Individual pay is also determined by job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

In addition, Upstart provides employees with target bonuses, equity compensation, and generous benefits packages (including medical, dental, vision, and 401k).

United States | Remote - Anticipated Base Salary Range
$217,400$300,900 USD

Upstart is a proud Equal Opportunity Employer. We are dedicated to ensuring that underrepresented classes receive better access to affordable credit, and are just as committed to embracing diversity and inclusion in our hiring practices. We celebrate all cultures, backgrounds, perspectives, and experiences, and know that we can only become better together. 

If you require reasonable accommodation in completing an application, interviewing, completing any pre-employment testing, or otherwise participating in the employee selection process, please email candidate_accommodations@upstart.com

https://www.upstart.com/candidate_privacy_policy

Top Skills

AWS
Azure
CloudFormation
Datadog
Docker
GCP
Grafana
Kubernetes
New Relic
Prometheus
Splunk
Terraform

Similar Jobs at Upstart

12 Hours Ago
Easy Apply
Remote
2 Locations
Easy Apply
164K-226K Annually
Senior level
164K-226K Annually
Senior level
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
The Senior Software Engineer will architect, build, automate, and scale tools for software deployment, focusing on CI/CD solutions and platform infrastructure to enhance product engineering efficiency.
Top Skills: Api GatewayAWSAws CdkAws CloudwatchDatadogDockerEc2Github ActionsGradleHelmIamJenkinsKotlinKubernetesLambdaNewrelicPythonRdsReactS3SpringTerraformTypescript
16 Hours Ago
Easy Apply
Remote
2 Locations
Easy Apply
164K-226K Annually
Senior level
164K-226K Annually
Senior level
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
The Senior Software Engineer will develop full-stack solutions for Upstart's Capital Supply team, ensuring system security, performance, and collaboration with cross-functional teams.
Top Skills: AWSAzureGCPKafkaKotlinNext.JsPostgresPythonReactRuby On RailsSparkVercel
Yesterday
Easy Apply
Remote
2 Locations
Easy Apply
142K-197K Annually
Mid level
142K-197K Annually
Mid level
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Software Engineer, you will build scalable marketing tools that engage users and work with cross-functional teams to deliver high-quality products.
Top Skills: AWSGCPJavaJavaScriptPythonRubySQL

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account