Hlavní informace

Ref. č.: FREELANCE_1245324_OCR-FL-B2863

Developer for QUDA (Quality Data Analytics) (m/f/n)

Pozice: Nezadáno

Začátek: 1. 1. 2026

Konec: 31. 12. 2026

Město: Ingelheim am Rhein, Německo

Způsob spolupráce: Pouze na projektu

Hodinová sazba: 2000 Kč

Poslední aktualizace: 15. 12. 2025

Popis úkolů a požadavky

Tasks
The service is requested as part of the project QUDA. The project has the purpose to build and continuously enhance an internal dashboard for reviewing and analyzing quality data from various sources. This includes implementing diverse visualizations, enabling continuous reporting, and integrating data science solutions.
The contractor independently performs the following tasks:Development of an ETL pipeline using Python and PySpark/Spark and AWS, including the integration of new features and resolution of software bugs.
Designing, improving and implementing the architecture and architectural adjustments of the Data Stack (including ETL pipeline).
Data modelling and database setup with relational databases (PostgreSQL).
Implementation and maintenance of unit and integration tests using the tools Pytest, Unittest.
Testing of ETL pipelines during development and post-development phases using Pytest
Documentation of technical implementations and test results, while following the provided guidelines to ensure compliance with GxP standards.
Application of DevOps practices, including the use of Terraform and AWS for deployment and orchestration.
The goal after completion of the service is the availability of robust, well-documented, and tested data stack (including ETL pipeline) that meet the functional and technical requirements of the project.
All code, work and documentation will be turned over to Boehringer for review and further usage.

Requirements
Python: 5+ years
Data Modeling: 5+ years

ETL: 5+ years
PySpark/Spark: 3+ years
Batch processing pipelines: 3+ years
Realtime data processing: 1+ years
Pipeline Orchestration: 3+ years
AWS Step Functions: 3+ years
Apache Airflow: basic experience

SQL: 3+ years

Designing Data Storage Architecture: 3+ years
Relational databases: 3+ years
PostgreSQL: 1+ years
Database Migrations: 1+ years
Data Lakes: 1+ years
Data warehouses: 1+ years

Unit testing: 5+ years
pytest: would be beneficial

GxP: experience would be beneficial

DevOps: 3+ years
Jenkins/CI: basic experience
Terraform : 1+ years
AWS CDK: basic experience

AWS: 3+ years
Glue: 3+ years
RDS: basic experience
Lambda: 3+ years
S3: 3+ years
Step Functions: 3+ years
Athena: 1+ years
SQS/Kafka: basic experience
IAM: basic experience

Node.js/Typescript: basic experience would be beneficial

Additional Information
Location: remote (Ingelheim)
Remote share: 100%
Project start: 01.01.2026, latest mid of january
Duration: 12 months until 31.12.2026
Availability: 40 hours/week
ODR-ID: OCR-FL-B2863

Please note that we can only consider applications with an hourly rate of less than €80.

Kategorie

Amazon Web Services (AWS) Data Engineer Datenmodelierung ETL Python