01 Zakres zadań
- Architecture & Development: Design and implement scalable data pipelines using Apache Spark (PySpark) on Databricks. Build solutions, not just run scripts.
- Optimization: Analyze and optimize complex Spark jobs for performance and cost (partitioning, z-ordering, Photon engine utilization).
- Modernization: Migrate legacy data warehouses to the Lakehouse architecture using Delta Lake.
- Standards: Enforce high code quality standards through rigorous code reviews and CI/CD implementation (Databricks Asset Bundles).
- Mentorship: Share knowledge with the team and act as a technical advisor for clients.
