Director, Site Reliability Engineering

| Bellevue
Sorry, this job was removed at 11:02 a.m. (PST) on Monday, December 10, 2018
Find out who's hiring in Bellevue.
See all Developer + Engineer jobs in Bellevue
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

At Smartsheet, we are building the next generation workspace collaboration platform. Our Technical Operations team is committed to operational excellence and delivering a world class customer experience. We're in an exciting high growth stage and now is the best time to join our team. Learn more about us with this short video overview of Smartsheet: Smartsheet Overview Video.
We are currently looking for a Director of Site Reliability Engineering to join our Technical Operations team. In this position, you will directly impact the reliability and performance of our critical production application systems; supporting 24/7 delivery to over 70,000 customers worldwide.
This position will report to the VP of Technical Operations and is located at our Bellevue, WA headquarters.
Responsibilities:

  • Lead our Site Reliability program which consists of 20+ engineers in three offices and two continents.
  • Manage key relationships with partners and internal customers, performing operational planning and forecasting.
  • Drive process initiatives to optimize delivery of computing resources – create work plans, gather and synthesize relevant data, lead analysis and develop final recommendations.
  • Identify delivery metrics and lead efforts to improve the efficiency, quality and cost of resource management to scale with the growth of Smartsheet.
  • Develop and oversee capacity, architecture and deployment plans related to the delivery and management of the Smartsheet production infrastructure.
  • Continuous optimization of our Service Impacting Event (SIE) process and coordination with internal stakeholders.
  • Relentless focus on the post-incident review process to ensure follow through on the technology and process initiatives to increase the quality and availability of the Smartsheet platform.
  • Troubleshoot, investigate, and fix production issues in cloud and hosted environments, including both hardware and internal software issues.
  • Develop and improve automated system alerts, effectively troubleshoot system errors and work incidents to return systems to normal operating conditions.
  • Ensure production changes are documented, fully tested in non-production environments, and adhere to change control and audit requirements.
  • Identify and mitigate security, risk and compliance concerns, in accordance with company policies.
  • Special projects as assigned.


Requirements:

  • 10+ years professional experience in a technically-focused field (Systems / Application Engineering).
  • 8+ years experience running a 24x7 mission critical production service with 99.99% uptime requirements.
  • 6+ year of work experience with production Linux systems administration.
  • 5+ years in a supervisory/team leadership role.
  • 4+ years of data center/networking/operations experience, including experience in application environments
  • Strong customer service focus; ability to communicate with and facilitate process with diverse groups including highly technical teams.
  • Demonstrated strong performance in prior roles, with increasing levels of responsibility and independence
  • Strong analytical, problem-solving, negotiation and organizational skills
  • Highly motivated, critical thinker with proven ability to manage a diverse team in a production support environment.
  • Ability to successfully manage competing priorities in critical incident situations.
  • Proficient with networking and internet protocols (eg HTTP, DNS, TCP/IP).
  • Proficient with config management, source control and containerization tools.
  • Experience with agile, scrum and ITIL service management methodologies.
  • Strong desire to learn, understand new technologies and mentor others.
  • Excellent verbal and written communication skills.
  • Ability to work in the U.S. on an ongoing basis.
  • BS degree in Engineering, IT-related field or equivalent practical experience.


About Smartsheet: 
In 2005, Smartsheet was founded on the idea that teams and millions of people worldwide deserve a better way to deliver their very best work. Today, the company delivers a leading cloud-based platform for work execution, empowering organizations to plan, capture, track, automate, and report on work at scale, resulting in more efficient processes and better business outcomes. Smartsheet went public on the New York Stock Exchange in April 2018 and currently enables collaboration, better decision making, and accelerated innovation for over 76,000 domain-based customers in 190 countries, including 96 of the Fortune 100.
Smartsheet is an Equal Opportunity Employer. Individuals seeking employment at Smartsheet are considered without regard to race, ethnicity, color, age, sex, religion, national origin, ancestry, pregnancy, sexual orientation, gender, gender identity, gender expression, genetic information, physical or mental disability, registered domestic partner status, caregiver status, marital status, veteran or military status, citizenship status, or any other legally protected category.

Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.

Location

10500 NE 8TH ST SUITE 1300 , Bellevue, WA 98004

Similar Jobs

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about SmartsheetFind similar jobs