The Lead AI Engineer will design and implement LLM-powered applications, optimize RAG pipelines, and evaluate models while integrating AI systems into production environments.
ZS is a place where passion changes lives. As a management consulting and technology firm focused on improving life and how we live it, we transform ideas into impact by bringing together data, science, technology and human ingenuity to deliver better outcomes for all. Here you'll work side-by-side with a powerful collective of thinkers and experts shaping life-changing solutions for patients, caregivers and consumers, worldwide. ZSers drive impact by bringing a client-first mentality to each and every engagement. We partner collaboratively with our clients to develop custom solutions and technology products that create value and deliver company results across critical areas of their business. Bring your curiosity for learning, bold ideas, courage and passion to drive life-changing impact to ZS.
What you'll do: Lead AI Engineer in the Platforms and Products will...
We are seeking a highly motivated Applied AI Engineer with a strong foundation in Machine Learning and a deep interest in Large Language Models (LLMs) and Generative AI. This role focuses on building, optimizing, and evaluating production-grade LLM systems, including Retrieval-Augmented Generation (RAG), fine-tuning workflows, and scalable inference pipelines.
What you'll bring:
At ZS, your growth matters. We offer a comprehensive total rewards package that supports your health and well-being, financial future, time away, and professional development. With robust skills-building programs, multiple career progression paths, internal mobility, and a deeply collaborative culture, you'll have the opportunity to do meaningful work, expand your capabilities, and thrive as part of a global community . For details on total rewards in United States , visit ZS US office locations | Where we work | ZS .
Salary: $155,000.00 - $167,750.00
What you'll do: Lead AI Engineer in the Platforms and Products will...
We are seeking a highly motivated Applied AI Engineer with a strong foundation in Machine Learning and a deep interest in Large Language Models (LLMs) and Generative AI. This role focuses on building, optimizing, and evaluating production-grade LLM systems, including Retrieval-Augmented Generation (RAG), fine-tuning workflows, and scalable inference pipelines.
- Design and implement LLM-powered applications using state-of-the-art transformer models.
- Build and optimize RAG pipelines using embeddings, chunking strategies, and vector search.
- Experiment with prompt engineering, structured outputs (JSON schemas/function calling), and tool-augmented LLMs (agents/workflows).
- Fine-tune models using techniques such as LoRA, PEFT, and instruction tuning.
- Develop and evaluate embedding models for similarity search and semantic retrieval.
- Conduct LLM evaluation using automated and human-in-the-loop techniques (offline + online).
- Optimize inference workflows for latency, GPU utilization, and cost efficiency (quantization, batching, caching).
- Build and maintain REST API Services (FastAPI etc.) to deploy LLM/RAG endpoints, integrate with product systems, and support scalable inference.
- Contribute to integration of AI systems into production software environments (CI/CD, monitoring, reliability).
- Research and prototype cutting-edge approaches in Generative AI and share learnings with the team.
What you'll bring:
- A master's or bachelor's degree in Computer Science or related field from a top university
- 4+ years' hands-on experience in Machine Learning (ML) with production LLM systems
- Good fundamentals of machine learning, deep learning and fine tuning models (LLM) including:
- Understanding of transformer architectures
- Prompt engineering expertise
- Embeddings and vector search
- Experienced in backend API design with FastAPI, async patterns, rate limiting
- Experience with vector DB including:
- Pinecone, Weaviate, or Chroma
- Embedding storage and similarity search
- Hybrid search implementations
- Strong programming expertise in Python is must including:
- Async programming (asyncio, async/await)
- Type hints and Pydantic
- SOLID principles and design patterns
- Experience in ML Ops to measure and track model performance including:
- MLFlow for model tracking
- Langfuse for LLM observability (strongly preferred)
- Model versioning and A/B testing
- Experience in working with NLP & computer vision
- Fluency in English
- Client-first mentality
- Intense work ethic
- Collaborative spirit and problem-solving approach
At ZS, your growth matters. We offer a comprehensive total rewards package that supports your health and well-being, financial future, time away, and professional development. With robust skills-building programs, multiple career progression paths, internal mobility, and a deeply collaborative culture, you'll have the opportunity to do meaningful work, expand your capabilities, and thrive as part of a global community . For details on total rewards in United States , visit ZS US office locations | Where we work | ZS .
Salary: $155,000.00 - $167,750.00
Top Skills
Fastapi
Generative Ai
Langfuse
Large Language Models
Machine Learning
Mlflow
Python
Retrieval-Augmented Generation
ZS Bellevue, Washington, USA Office
ZS Seattle Office

One of ZS’s newest offices, Seattle is at the center of ZS’s digital, technology and data science innovation.
Similar Jobs at ZS
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
As a Strategy Insights and Planning Associate Consultant, you will analyze client problems, develop custom solutions using advanced data analytics, and communicate results. You will lead projects, conduct market research, and mentor team members while fostering client relationships and driving the project to meet deadlines.
Top Skills:
AccessConfirmitExcelZs'S Proprietary Software
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
The AI Engineering Specialist will oversee project tasks, manage expectations, mentor junior members, and design scalable AI systems while working with advanced AI models and ML pipelines.
Top Skills:
AWSAzureCloudFormationDatabricksDjangoDockerFast ApiJavaScriptKubernetesNext JsPythonReact JsTailwind CssTerraform
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
The Senior Engineer collaborates with teams, designs and implements technical features, writes production-ready code, and ensures high-quality deliverables, while also engaging in research and learning on new technologies.
Top Skills:
AWSAzureCi/CdDevOpsGCPJavaPythonScalaSpark
What you need to know about the Seattle Tech Scene
Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.
Key Facts About Seattle Tech
- Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Amazon, Microsoft, Meta, Google
- Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
- Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Madrona, Fuse, Tola, Maveron
- Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute


