Prime Intellect Logo

Prime Intellect

Research Engineer - Distributed Training

Job Posted 16 Days Ago Reposted 16 Days Ago
Be an Early Applicant
In-Office or Remote
2 Locations
Mid level
In-Office or Remote
2 Locations
Mid level
The Research Engineer will lead research on decentralized AI training, optimize performance, contribute to open-source libraries, and communicate technical outcomes to a broad audience.
The summary above was generated by AI

At Prime Intellect, we are on a mission to accelerate open and decentralized AI progress by enabling anyone to contribute compute, code or capital to train powerful, open models. Our ultimate goal? Openly accessible AGI that benefits everyone. But we can't do it alone and we want to do this together with you.

We are building the infrastructure for decentralized AI development at scale. We aggregate global compute and enable researchers to collaboratively train state-of-the-art models through distributed training across clusters.

As a Research Engineer working on Distributed Training, you'll play a crucial role in shaping our technological direction, focusing on our decentralizing AI training stack. If you love scaling things and maximizing training efficiency, this role is for you.

Responsibilities
  • Lead and participate in novel research to build a massive scale, highly reliable and secure decentralized training orchestration solution

  • Optimize the performance, cost, and resource utilization of AI workloads by leveraging the most recent advances for compute & memory optimization techniques.

  • Contribute to the development of our open-source libraries and frameworks for distributed model training.

  • Publish research in top-tier AI conferences such as ICML & NeurIPS.

  • Distill highly technical project outcomes in layman approachable technical blogs to our customers and developers.

  • Stay up-to-date with the latest advancements in AI/ML infrastructure and tools, decentralized training research and proactively identify opportunities to enhance our platform's capabilities and user experience.

Requirements
  • Strong background in AI/ML engineering, with extensive experience in designing and implementing end-to-end pipelines for training and deploying large-scale AI models.

  • Deep expertise in distributed training techniques, frameworks (e.g., PyTorch Distributed, DeepSpeed, MosaicML’s LLM Foundry), and tools (e.g. Ray) for optimizing the performance and scalability of AI workloads.

  • Experience in large-scale model training incl. distributed training techniques such as data, tensor & pipeline parallelism

  • Solid understanding of MLOps best practices, including model versioning, experiment tracking, and continuous integration/deployment (CI/CD) pipelines.

  • Passion for advancing the state-of-the-art in decentralized AI model training and democratizing access to AI capabilities for researchers, developers, and businesses worldwide.

  • If you're not familiar with these, but feel like that you can contribute to our mission and you're a high-energy person, get familiar with these resources (here, here and here) and please reach out!

Benefits & Perks
  • Competitive compensation, including equity and token incentives, aligning your success with the growth and impact of Prime Intellect.

  • Flexible work arrangements, with the option to work remotely or in-person at our offices in San Francisco.

  • Visa sponsorship and relocation assistance for international candidates.

  • Quarterly team off-sites, hackathons, conferences and learning opportunities.

  • Opportunity to work with a talented, hard-working and mission-driven team, united by a shared passion for leveraging technology to accelerate science and AI.

We recently raised $15mm in funding (total of $20mm raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others.

If you're excited about the opportunity to build the foundation for the future of decentralized AI and create a platform that empowers developers and researchers to push the boundaries of what's possible, we'd love to hear from you.

Top Skills

AI
Ci/Cd
Deepspeed
Ml
Mosaicml
Pytorch Distributed
Ray

Similar Jobs

An Hour Ago
Remote or Hybrid
IL, USA
Expert/Leader
Expert/Leader
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
The Principal Microsoft Instructor delivers training programs, oversees quality assurance, mentors instructors, and develops technical courses in cyber security and IT.
Top Skills: AzureCRMCyber SecurityLmsM365Microsoft CertificationsSecurityVMwareVsphereWeb Conferencing Platforms
An Hour Ago
Remote or Hybrid
United States
109K-203K Annually
Mid level
109K-203K Annually
Mid level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
As a Senior Developer Advocate, you will enhance developer experience using SailPoint technologies, engage with communities, write documentation, and build applications.
Top Skills: GoJavaScriptPowershellPythonRest ApisWebhooks
4 Hours Ago
Remote or Hybrid
4 Locations
175K-175K
Senior level
175K-175K
Senior level
eCommerce • Legal Tech • Professional Services • Software • Data Privacy
The Senior DevOps Engineer will manage and scale on-prem infrastructure, automate workflows, implement CI/CD pipelines, and ensure system reliability and security.
Top Skills: AnsibleBashDockerGitlab CiGrafanaJenkinsLinuxPostgresPrometheusProxmoxPythonTerraformVMwareZabbix

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account