EXL Jobs

Senior Data Engineer

EXL

Senior Data Engineer

Posted 9 Days Ago

Be an Early Applicant

Remote or Hybrid

Hiring Remotely in United States

94K-154K Annually

Senior level

Remote or Hybrid

Hiring Remotely in United States

94K-154K Annually

Senior level

Design, build, and operate production-grade, event-driven data pipelines (Kafka/Flink) on GCP to deliver model-ready features. Optimize BigQuery SQL and Parquet performance, develop Python data workloads (Polars/Pandas), deploy ML pipeline components on Kubeflow/Vertex AI with Docker, design event store architectures, and collaborate with ML and platform teams while documenting architecture and standards.

The summary above was generated by AI

EXL is hiring a Senior Data Engineer to join a strategic AI / ML platform engagement with a leading specialty retailer. This is a hands-on build role embedded with the client's platform engineering team.

The role requires shipping production-grade data pipelines that feed real-time customer event data into machine learning workflows. The right person is comfortable owning the full lifecycle of pipeline design, build, and deployment: from streaming ingestion through event store design to model-ready feature delivery.

This is a high-visibility role with growth potential into a larger book of work as the engagement expands.

Salary Range: $93,900 - $154,200 annual base

The posted range is the hiring range for this role — a subset of the broader range available to employees over time — and reflects base salary across our national hiring scale. Final offers are based on several factors, including the candidate's skills and experience, internal pay equity, work location, market conditions for the role, and the specific scope and responsibilities of the position. The top of the range is reserved for candidates who notably exceed the requirements; the lower end applies to those with less experience or fewer preferred qualifications. For positions based in higher-cost zones (e.g., California, New York, New Jersey), actual compensation may exceed the posted range; your recruiter will share specifics during the process.

Responsibilities

What You'll Do

Design and operate event-driven data pipelines using Kafka consumers and Flink jobs to process high-volume customer events (clicks, purchases, returns) in near-real time.
Build and optimize large-scale data transformations on Google Cloud Platform — BigQuery SQL, query performance tuning, and partitioning strategy at scale.
Develop Python data engineering workloads using Polars or Pandas at scale, with rigorous attention to Parquet partitioning, join performance on large datasets, and memory efficiency.
Build, deploy, and maintain ML pipeline components on Kubeflow Pipelines (KFP) and Vertex AI; package and deploy services with Docker.
Design event store architecture: partitioning by customer, time-ordered event assembly across heterogeneous sources, and schema management for mixed event types.
Partner with ML engineers, platform engineers, and data scientists to deliver clean, performant, model-ready data products.
Document architecture decisions and contribute to engineering standards across the platform team.

Qualifications

Required Skills & Experience

6–12 years of experience in data engineering, platform engineering, or a closely related discipline.
Streaming: Production experience with Kafka consumers and Flink stream processing — building, deploying, and operating streaming jobs at meaningful scale.
GCP Data Stack: Strong SQL on BigQuery (or an equivalent cloud warehouse), with demonstrated query optimization, cost management, and partitioning chops.
Python Data Engineering: Hands-on with Polars or Pandas at scale; deep working knowledge of Parquet partitioning and performance on large joins.
ML Pipelines: Hands-on experience building and deploying components on Kubeflow Pipelines (KFP) and/or Vertex AI Pipelines; working proficiency with Docker.
Event Store Design: Demonstrated experience designing event stores — partitioning by customer, time-ordered event assembly across sources, schema strategy for mixed event types (clicks, purchases, returns).
Communication: Strong written and verbal communication; comfortable being the senior IC voice in design conversations with client stakeholders.

Nice to Have

Domain experience in Retail or E-commerce — customer journey data, transaction analytics, returns and exchanges modeling.
Exposure to schema registry tooling (e.g., Confluent), Iceberg, or Delta Lake.
Experience working in client-facing or consulting engagements.
Google Cloud certifications (Professional Data Engineer or equivalent).

Work Arrangement & Eligibility

This role requires 3–4 days per week onsite in Seattle, WA. Fully remote and out-of-state candidates will not be considered.
EXL is open to sponsoring H1B transfers for qualified candidates.

Similar Jobs

Block

Senior Data Engineer

Yesterday

In-Office or Remote

CA, USA

168K-297K Annually

Senior level

168K-297K Annually

Senior level

Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency

Design and maintain data architecture and pipelines to support compliance and risk teams. Build and optimize data models, standardize metrics, and create data dictionaries. Implement data quality, lineage monitoring, AI-driven agents for false-positive reduction and automation, and participate in on-call rotations to ensure SLAs are met.

Top Skills: AirflowDatabricksDbtGitOmniPrefectPythonSnowflakeSQLTerraform

Cash App

Senior Data Engineer

2 Days Ago

Remote or Hybrid

CA, USA

168K-297K Annually

Senior level

168K-297K Annually

Senior level

Blockchain • Fintech • Mobile • Payments • Software • Financial Services

Lead design and optimization of data models and pipelines for compliance and risk; standardize metrics and documentation; build data quality, lineage, and monitoring (including AI agents for automation); manage ETL scheduling, on-call pipeline support, and collaborate with product and non-technical partners to translate business needs into automated, production-ready data solutions.

Top Skills: AirflowDatabricksDbtGitOmniPrefectPythonSnowflakeSQLTerraform

Fusion Risk Management

Senior Data Engineer

5 Days Ago

Remote or Hybrid

135K-155K Annually

Senior level

135K-155K Annually

Senior level

Professional Services • Software

Lead architecture and buildout of a new graph-backed enterprise data platform: design ingestion, graph and relational storage, entity resolution pipelines, temporal models, ETL/ELT pipelines, governance, APIs, and production connectors. Ship scalable graph data models, traversal queries, and platform roadmap while enabling observability, security, and containerized deployments.

Top Skills: AirflowAzureCypherDagsterDbtDockerGremlinHelmJavaKubernetesPythonSalesforceServicenowSparqlSQL

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Amazon, Microsoft, Meta, Google
Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Madrona, Fuse, Tola, Maveron
Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute