Join Sanoma Learning as a Senior Data Engineer to design, develop, and maintain data pipelines for our Content as a Service (CaaS) platform. You will work on the central storage layer, collaborate with data scientists and software engineers, and leverage AWS services to manage and query data. As a Senior Data Engineer, you will contribute to projects involving Generative AI and support data-driven initiatives.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Nice to Have
Job Description
This position is open only to candidates who are legally residing in Poland and can work with us through a registered business entity in Poland (e.g., sole proprietorship/JDG or limited liability company/sp. z o.o.).
As part of onboarding, we kindly ask new joiners to visit our Warsaw office on the first day for a short introduction, identity verification, and equipment pick-up.
Location: Warsaw, Poland
Employment type: B2B contract
Work model: 100% remote
Business travel: Occasional, up to once per quarter (e.g., onboarding sessions or workshops)
Seniority level: Mid-level or Senior (4 open positions)
About Us
Sanoma Learning is the leading European learning company, serving over 20 million students in 11 countries. We offer printed and digital learning materials as well as digital learning and teaching platforms for primary, secondary, and vocational education. The development of our methodologies is based on deep teacher and student insight and really understanding their needs. By combining our educational technologies and pedagogical expertise, we create learning products and services with the highest learning impact. In our Technology organization, you will join the largest cross-cultural community of Sanoma Learning and contribute to the digital transformation and future of education in Europe.
Project Description
Content as a Service (CaaS) is a strategic central capability that enables Sanoma Learning to efficiently scale and innovate its digital offerings. It provides a single, enterprise-grade service to ingest, enrich, and deliver all learning content and educational metadata – from both print and digital sources – for use in digital customer facing products and method creation.
Role Responsibilities
- Design, develop, and maintain data pipelines to ensure reliable, scalable, and high-performance data flows.
- Work on the central storage layer, ensuring data availability, consistency, and security.
- Collaborate with data scientists, analysts, and software engineers to support data-driven initiatives.
- Implement and enforce best development practices in code quality, testing, monitoring, and deployment.
- Optimize data infrastructure for performance and cost-efficiency in AWS environments.
- Leverage AWS services such as S3, Glue, Lambda, DynamoDB, and Athena to manage and query data.
- Contribute to projects involving Generative AI (GenAI) by enabling data access, preparation, and integration with AI-driven solutions.
- Troubleshoot and resolve issues across the data pipeline and storage systems.
Interested in remote work opportunities in Data Science? Discover Data Science Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
- Proficiency in at least one programming language, preferably Python.
- Strong knowledge of AWS cloud services (S3, Glue, Lambda, DynamoDB, Athena, etc.).
- Solid understanding of databases (SQL and NoSQL), including schema design, optimization, and query performance.
- Hands-on experience with data processing pipelines (batch and/or streaming).
- Strong foundation in software engineering best practices, including version control, CI/CD, and automated testing.
- Experience with data modeling, storage formats, and ETL/ELT workflows.
- Familiarity with Generative AI technologies and how data engineering supports AI-driven applications.
- Strong problem-solving skills and ability to work in a collaborative, agile environment.
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
- Knowledge of data orchestration tools (Airflow, Step Functions, Prefect).
- Exposure to big data frameworks (Spark, Hadoop).
- Understanding of data governance and security best practices.
- Experience working with Content Management Systems (CMS) and content-centric data (e.g. articles, learning materials, metadata, versions, publishing workflows).
- B2B contract for an indefinite period
- Work-life balance and a supportive, informal atmosphere
- Opportunities for professional growth and skill development
- Work on modern data platforms (cloud environments, AWS stack)
- Build and maintain data pipelines supporting AI-driven solutions in education
- Hands-on experience with modern data stack (ETL/ELT, CI/CD, orchestration tools)
- Collaborate with Data Engineers, Data Scientists, Product teams, and AI Engineers
- Work in a flexible, result-oriented and collaborative environment
- Be part of an international team working across European markets
- Contribute to projects with real impact on digital education
Similar Jobs
Explore other opportunities that match your interests
consilio llc
Data Scientist
OLX