athenahealth Logo

athenahealth

Lead Site Reliability Engineer

Reposted An Hour Ago
Remote
Hiring Remotely in USA
119K-203K Annually
Senior level
Remote
Hiring Remotely in USA
119K-203K Annually
Senior level
Lead Site Reliability Engineer responsible for ensuring cloud services reliability, automation, and performance while mentoring a team and collaborating cross-functionally. Drive initiatives to enhance incident management and enforce security compliance.
The summary above was generated by AI

Join us as we work to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all.

Lead Site Reliability Engineer — Cloud Engineering

Position Summary:
Design, build, and operate highly available cloud infrastructure that supports critical healthcare services and scales to meet growth. Lead reliability, automation, monitoring, and incident response efforts to ensure system performance and resilience across private and public cloud environments. This role works in a hybrid environment based in Boston, MA and partners closely with development, security, and operations teams. This role reports to the Cloud Engineering Manager.

About the Team:
The Cloud Engineering team ensures continuous availability and scalability of the systems that power athenahealth’s products, managing compute, storage, and network services across hybrid cloud environments. The team partners with application engineering, security, product, and site reliability teams to design resilient architectures, automate operations, and reduce manual work through Infrastructure as Code and observability tooling. Key technologies include Terraform, Kubernetes, public cloud services, and monitoring/observability platforms.

Essential Job Responsibilities:

  • Define, measure, and maintain Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for cloud services and infrastructure components.
  • Lead improvements to system availability, fault tolerance, and disaster recovery capabilities.
  • Manage incident detection, conduct root cause analysis, and oversee timely resolution of production incidents.
  • Drive automation and Infrastructure as Code (IaC) initiatives using tools such as Terraform, CloudFormation, and Ansible to provision and manage cloud resources.
  • Design and maintain monitoring, logging, and alerting solutions to provide continuous visibility into infrastructure health and performance.
  • Identify performance bottlenecks and implement capacity, cost, and performance optimizations for cloud services.
  • Ensure cloud infrastructure meets security and compliance requirements in collaboration with security and risk teams.
  • Lead and mentor Site Reliability Engineers, setting technical direction and promoting operational best practices.
  • Collaborate with development, DevOps, and operations teams to align infrastructure with application and business needs.
  • Evaluate and pilot AI-assisted tools that help detect anomalies, prioritize incidents, automate routine remediation, and forecast capacity needs; recommend safe, human-centered adoption practices and guide the team in using these tools responsibly.

Additional Job Responsibilities:

  • Own post-incident reviews and implement preventive measures to reduce recurrence and recovery time.
  • Support onboarding and go-live activities, including runbooks, playbooks, and run-time documentation.
  • Contribute to technical documentation and knowledge sharing to improve team effectiveness.
  • Participate in cross-team forums to align priorities and remove delivery blockers.
  • Support continuous improvement initiatives focused on reducing operational toil and improving system reliability.

Expected Education & Experience:

  • 7+ years of experience in Site Reliability Engineering, Infrastructure Engineering, or DevOps roles, with at least 3 years in a technical lead capacity.
  • 10+ years of hands-on experience with cloud automation and configuration management tools (for example, Terraform, CloudFormation, Ansible, or Puppet) across hybrid cloud environments.
  • Strong practical experience with public cloud services (AWS, Google Cloud, Azure) and cloud-native technologies such as Kubernetes and container orchestration.
  • Proficiency in one or more scripting or programming languages (Python, Go, Bash, or similar).
  • Experience designing and operating monitoring, logging, and observability solutions (Prometheus, Grafana, Datadog, ELK, CloudWatch, or similar).
  • Demonstrated ability to build and operate highly available, scalable, and fault-tolerant systems in production.
  • Solid knowledge of networking, storage, compute, Linux administration, and cloud security best practices.
  • Experience with CI/CD pipelines and automating deployment and release processes.
  • Experience mentoring engineers and providing technical leadership across distributed teams.
  • Bachelor’s degree in Computer Science, Engineering, or related field preferred; equivalent practical experience accepted.
  • Certifications in cloud platforms (AWS, GCP, Azure) or relevant technologies are a plus.

Expected Compensation

$119,000 - $203,000

The base salary range shown reflects the full range for this role from minimum to maximum. At athenahealth, base pay depends on multiple factors, including job-related experience, relevant knowledge and skills, how your qualifications compare to others in similar roles, and geographical market rates.  Base pay is only one part of our competitive Total Rewards package - depending on role eligibility, we offer both short and long-term incentives by way of an annual discretionary bonus plan, variable compensation plan, and equity plans.


About athenahealth

Our vision: In an industry that becomes more complex by the day, we stand for simplicity. We offer IT solutions and expert services that eliminate the daily hurdles preventing healthcare providers from focusing entirely on their patients — powered by our vision to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all.

Our company culture: Our talented  employees — or athenistas, as we call ourselves — spark the innovation and passion needed to accomplish our vision. We are a diverse group of dreamers and do-ers with unique knowledge, expertise, backgrounds, and perspectives. We unite as mission-driven problem-solvers with a deep desire to achieve our vision and make our time here count. Our award-winning culture is built around shared values of inclusiveness, accountability, and support.

Our DEI commitment: Our vision of accessible, high-quality, and sustainable healthcare for all requires addressing the inequities that stand in the way. That's one reason we prioritize diversity, equity, and inclusion in every aspect of our business, from attracting and sustaining a diverse workforce to maintaining an inclusive environment for athenistas, our partners, customers and the communities where we work and serve.

What we can do for you:

Along with health and financial benefits, athenistas enjoy perks specific to each location, including commuter support, employee assistance programs, tuition assistance, employee resource groups, and collaborative  workspaces  — some offices even welcome dogs.

We also encourage a better work-life balance for athenistas with our flexibility. While we know in-office collaboration is critical to our vision, we recognize that not all work needs to be done within an office environment,full-time. With consistent communication and digital collaboration tools, athenahealthenablesemployees to find a balance that feels fulfilling and productive for each individual situation.

In addition to our traditional benefits and perks, we sponsor events throughout the year, including book clubs, external speakers, and hackathons. We provide athenistas with a company culture based on learning, the support of an engaged team, and an inclusive environment where all employees are valued. 

Learn more about our culture and benefits here: athenahealth.com/careers  

https://www.athenahealth.com/careers/equal-opportunity

Similar Jobs

6 Days Ago
Easy Apply
Remote or Hybrid
Easy Apply
188K-235K Annually
Senior level
188K-235K Annually
Senior level
Artificial Intelligence • Consumer Web • Edtech • Enterprise Web • HR Tech • Social Impact • Generative AI
The Growth Enterprise Account Executive is responsible for identifying and closing business opportunities with enterprise clients, managing relationships, and ensuring customer satisfaction and retention.
Top Skills: SaaS
An Hour Ago
Remote
Senior level
Senior level
Appliances • Manufacturing
The Finance Manager partners with supply chain and finance teams, leading financial performance reviews, management reporting, and projects to enhance profitability and cash flow.
Top Skills: Financial KpisS&Op
An Hour Ago
In-Office or Remote
125K-172K Annually
Senior level
125K-172K Annually
Senior level
Healthtech • Biotech
The Government Field Sales Specialist will drive the diagnostic strategy for government accounts, develop partnerships, meet sales goals, and manage customer relationships.
Top Skills: CRMData Analytics ToolsLaboratory DiagnosticPoc Products

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account