Hlavní informace
Developer for QUDA (Quality Data Analytics) (m/f/n)
Pozice: Nezadáno
Začátek: 1. 1. 2026
Konec: 31. 12. 2026
Město:
Ingelheim am Rhein, Německo
Způsob spolupráce: Pouze na projektu
Hodinová sazba: 2000 Kč
Poslední aktualizace: 15. 12. 2025
Popis úkolů a požadavky
Tasks
The service is requested as part of the project QUDA. The project has the purpose to build and continuously enhance an internal dashboard for reviewing and analyzing quality data from various sources. This includes implementing diverse visualizations, enabling continuous reporting, and integrating data science solutions.
The contractor independently performs the following tasks:Development of an ETL pipeline using Python and PySpark/Spark and AWS, including the integration of new features and resolution of software bugs.
Designing, improving and implementing the architecture and architectural adjustments of the Data Stack (including ETL pipeline).
Data modelling and database setup with relational databases (PostgreSQL).
Implementation and maintenance of unit and integration tests using the tools Pytest, Unittest.
Testing of ETL pipelines during development and post-development phases using Pytest
Documentation of technical implementations and test results, while following the provided guidelines to ensure compliance with GxP standards.
Application of DevOps practices, including the use of Terraform and AWS for deployment and orchestration.
The goal after completion of the service is the availability of robust, well-documented, and tested data stack (including ETL pipeline) that meet the functional and technical requirements of the project.
All code, work and documentation will be turned over to Boehringer for review and further usage.
Requirements
Python: 5+ years
Data Modeling: 5+ years
ETL: 5+ years
PySpark/Spark: 3+ years
Batch processing pipelines: 3+ years
Realtime data processing: 1+ years
Pipeline Orchestration: 3+ years
AWS Step Functions: 3+ years
Apache Airflow: basic experience
SQL: 3+ years
Designing Data Storage Architecture: 3+ years
Relational databases: 3+ years
PostgreSQL: 1+ years
Database Migrations: 1+ years
Data Lakes: 1+ years
Data warehouses: 1+ years
Unit testing: 5+ years
pytest: would be beneficial
GxP: experience would be beneficial
DevOps: 3+ years
Jenkins/CI: basic experience
Terraform : 1+ years
AWS CDK: basic experience
AWS: 3+ years
Glue: 3+ years
RDS: basic experience
Lambda: 3+ years
S3: 3+ years
Step Functions: 3+ years
Athena: 1+ years
SQS/Kafka: basic experience
IAM: basic experience
Node.js/Typescript: basic experience would be beneficial
Additional Information
Location: remote (Ingelheim)
Remote share: 100%
Project start: 01.01.2026, latest mid of january
Duration: 12 months until 31.12.2026
Availability: 40 hours/week
ODR-ID: OCR-FL-B2863
Please note that we can only consider applications with an hourly rate of less than €80.