OpenAI Logo

OpenAI

Senior Data Engineer, Core Experimentation

Posted 17 Days Ago
Be an Early Applicant
Hybrid
Seattle, WA, USA
293K-325K Annually
Senior level
Hybrid
Seattle, WA, USA
293K-325K Annually
Senior level
The role involves designing and managing data pipelines, developing canonical datasets, and ensuring data integration and compliance. Collaborate across teams to meet data needs and maintain data architecture.
The summary above was generated by AI

About the Team

The Statsig team at OpenAI builds and operates the experimentation platform that powers product development, measurement, and decision-making across the company. We partner closely with product, engineering, and infrastructure teams to ensure experiments are trustworthy, statistically rigorous, and scalable to the needs of frontier AI products.

Our mission is to help teams make better decisions through reliable experimentation. We care deeply about statistical correctness, pragmatic solutions, and building systems that researchers and engineers can trust at massive scale. The team operates at the intersection of experimentation methodology, data infrastructure, causal inference, and product analytics.

We are looking for experienced experimentation experts who want to shape the future of experimentation in the AI era.

About the role:

We're seeking a Data Engineer to take the lead in building our data pipelines and core tables for OpenAI. These pipelines are crucial for powering analyses, safety systems that guide business decisions, product growth, and prevent bad actors. If you're passionate about working with data and are eager to create solutions with significant impact, we'd love to hear from you. This role also provides the opportunity to collaborate closely with the researchers behind ChatGPT and help them train new models to deliver to users. As we continue our rapid growth, we value data-driven insights, and your contributions will play a pivotal role in our trajectory. Join us in shaping the future of OpenAI!

In this role, you will:

  • Design, build and manage our data pipelines, ensuring all user event data is seamlessly integrated into our data warehouse.

  • Develop canonical datasets to track key product metrics including user growth, engagement, and revenue.

  • Work collaboratively with various teams, including, Infrastructure, Data Science, Product, Marketing, Finance, and Research to understand their data needs and provide solutions.

  • Implement robust and fault-tolerant systems for data ingestion and processing.

  • Participate in data architecture and engineering decisions, bringing your strong experience and knowledge to bear.

  • Ensure the security, integrity, and compliance of data according to industry and company standards.

You might thrive in this role if you:

  • Have 3+ years of experience as a data engineer and 8+ years of any software engineering experience(including data engineering).

  • Proficiency in at least one programming language commonly used within Data Engineering, such as Python, Scala, or Java.

  • Experience with distributed processing technologies and frameworks, such as Hadoop, Flink and distributed storage systems (e.g., HDFS, S3).

  • Expertise with any of ETL schedulers such as Airflow, Dagster, Prefect or similar frameworks.

  • Solid understanding of Spark and ability to write, debug and optimize Spark code.

This role is based in Bellevue. We use a hybrid work model and value in-person collaboration for technical design, iteration, and cross-functional partnership.
Compensation Range: $293K - $325K USD

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity. 

We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement.

Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Similar Jobs

3 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
Senior level
Senior level
Legal Tech • Real Estate • Security • Software • Cybersecurity • PropTech
As a Senior Data Engineer, you will design and maintain scalable data models, build production data pipelines, and support analytics across various business functions.
Top Skills: Analytics ToolsCloud-Based EnvironmentsData ModelingData WarehousingElt PipelinesSQL
11 Days Ago
Hybrid
Seattle, WA, USA
140K-188K Annually
Senior level
140K-188K Annually
Senior level
eCommerce • Fintech • Logistics • Software • Transportation • Big Data Analytics
As a Senior Data Engineer, you will build and maintain data infrastructure, enabling product teams with data-driven insights and solutions.
Top Skills: AWSDbtDockerKubernetesPythonSnowflakeSQL
2 Hours Ago
In-Office
Seattle, WA, USA
75K-128K Annually
Junior
75K-128K Annually
Junior
Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
Responsible for managing suppliers' quality and delivery performance, issuing purchase orders, and supporting supplier reviews within the Boeing systems.
Top Skills: CsdtErpMicrosoft Office SuiteSAP

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account