Appsilon Logo

Appsilon

Data Engineer

Posted Yesterday
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Warszawa, Mazowieckie
23K-23K Annually
Mid level
In-Office or Remote
Hiring Remotely in Warszawa, Mazowieckie
23K-23K Annually
Mid level
Design, build, and maintain scalable data pipelines; integrate diverse data sources into warehouses or lakes; collaborate with data scientists and engineers to ensure quality and availability; optimize schemas, models, and performance; implement governance, security, and compliance; monitor reliability and document systems.
The summary above was generated by AI
Why we need you?

At Appsilon, we empower global organizations to make smarter decisions with data. Our solutions help Fortune 500 companies discover new drugs, save lives, optimize operations, and unlock millions in value. To do this, we rely on robust, scalable, beautifully engineered data systems.

We're looking for a Data Engineer who can elevate how our clients collect, process, and leverage massive datasets — someone who loves building modern data pipelines and wants their work to power meaningful, real-world impact.

Your responsibilities:
  • Design, build, and maintain scalable data pipelines across diverse environments.

  • Integrate data from multiple internal and external sources into data warehouses or data lakes.

  • Collaborate closely with Data Scientists, ML Engineers, and Developers to ensure data quality, structure, and availability.

  • Monitor and improve data integrity, performance, and reliability.

  • Build and optimize database schemas, data models, and documentation.

  • Implement data governance, security best practices, and compliance standards.

We’re looking for somebody with:Backend Python Development
  • Strong experience building scalable backend systems in Python.

  • Comfortable with modern language features (type hints, decorators, generators).

  • Able to design clean, maintainable APIs using FastAPI, Django REST Framework, or Flask.

  • Good understanding of performance optimization and Python internals.

  • A collaborative mindset — you enjoy working closely with cross-functional teams.

Data Engineering
  • Hands-on experience designing and operating ETL/ELT pipelines.

  • Solid SQL skills and ability to model, optimize, and maintain database structures.

  • Experience integrating data from multiple sources (databases, APIs, streaming).

  • Familiarity with large-scale data processing tools or distributed systems.

Nice to have:
  • Experience with cloud platforms (AWS/Azure/GCP).

  • Knowledge of R.

  • Experience with Docker, Kubernetes, and CI/CD tools (GitHub Actions, GitLab CI).

  • Understanding of data governance, metadata management, and security.

  • Experience in life sciences, biotech, genomics, or enterprise data environments.

  • Prior remote work experience with international teams.

Life science skills:
  • Molecular Biology & Bioinformatics: Leverages molecular biology and bioinformatics to analyze data and communicate biological insights.

  • Clinical Trials - Data Tools & Flow: Builds and analyzes clinical trial data pipelines, ensuring auditability and delivering insights through collaboration and visualization tools.

  • CDISC & Clinical Data Standards: Applies and designs clinical data structures using CDISC standards, ensuring compliance and supporting best practices across teams.

  • Nextflow: Develops scalable, reproducible bioinformatics pipelines with Nextflow across local, HPC, and cloud environments.

What we offer:
  • Competitive B2B compensation with clear salary ranges (up to 23.000 PLN net B2B).

  • Modern equipment (MacBook / ThinkPad + Linux environment).

  • Work on high-impact, cutting-edge projects in biotech, pharma, research, and enterprise analytics.

  • Budget for professional development (certifications, courses, conferences).

  • Opportunity to collaborate with industry experts on innovative data products.

  • A supportive, ambitious, and friendly team that cares about excellence.

  • Fully remote work.

Important note: To complete the hiring process, you need to have a valid government-issued ID (for Polish citizens) or a valid passport (for non-Polish citizens).

What can you expect during the process:
  • Intro call with our People Team.

  • Technical task.

  • Technical interview with the Engineering Team. 

  • Final interview with Head of Technology + offer.

Does this sound like a great opportunity for you?Use the Apply button below!

Appsilon is committed to being a diverse and inclusive workplace. We encourage applicants of different backgrounds, cultures, genders, experiences, abilities, and perspectives to apply. All qualified applicants will receive consideration for employment without regard to race, color, national origin, religion, sexual orientation, gender, gender identity, age, physical disability, or length of time spent unemployed.

Similar Jobs

Yesterday
Remote
Senior level
Senior level
Artificial Intelligence • Fintech • Payments • Software
Build the companys first data stack and data products (internal analytics and customer-facing). Work across data engineering, devops, and software tasks; ship early features, improve developer experience, document proposals, and iterate based on feedback.
Top Skills: AWSAxiomCi/CdGitGoNext.JsTypescriptVercel
Yesterday
Remote
Senior level
Senior level
Agency
Design, build, and scale ETL/ELT and real-time data pipelines for AI workloads (RAG, fine-tuning, batch inference). Transform unstructured data into vectorized formats, manage feature stores and vector databases, enforce data quality/governance, integrate event systems (Kafka), and collaborate with ML and engineering teams.
Top Skills: AirflowApache FlinkApache KafkaAWSAws GlueAzureAzure Data FactoryDbtEtl/EltFeature StoresGCPGoogle DataflowJavaKinesisKubernetesLlmPrefectPythonRagScalaSpark StreamingSQLVector Databases
7 Days Ago
In-Office or Remote
Mid level
Mid level
Information Technology • Consulting
Lead foundational data-engineering work to validate and re-engineer pipelines for an anonymized, centralized credit data lake. Harmonize schemas across entities, build dbt models and tests, implement data-quality suites (Great Expectations), entity resolution, anonymization controls, optimize Spark/Glue jobs, orchestrate pipelines (Airflow/Step Functions), and produce documented, feature-ready datasets and runbooks for a regulated UK/Ireland lending environment.
Top Skills: Amazon AthenaAmazon RedshiftApache AirflowSparkAws GlueAws Lake FormationAws S3Aws Step FunctionsData LineageDbtDeterministic MatchingEmrEntity ResolutionFuzzy MatchingGreat ExpectationsHashingK-AnonymityParquetPythonSQLTokenization

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account