Data Scientist (Remote)
29400-33600 PLN miesięcznie (B2B)
Infolet
Czym będziesz się zajmować?
PROJECTWe are looking for an experienced Data Scientist to support the development and continuous enhancement of a large-scale data and machine learning ecosystem used in next-generation automotive solutions.
The project involves building Spark-based data pipelines, improving data quality processes, implementing model evaluation workflows, and developing robust monitoring for ML models running in production.
The role includes both initial development of the platform components and long-term maintenance and optimization.
YOU WILLInitial Development:
- Develop Spark jobs for data ingestion and feature engineering
- Implement data quality monitoring (metrics, dashboards, alerting)
- Build logic for model evaluation and automated deployment decisions
- Develop model monitoring with visualized KPIs and technical metrics
Further Development / Maintenance:
- Continuously extend data pipelines and feature engineering workflows
- Enhance data quality metrics and monitoring coverage
- Expand model monitoring logic and dashboards
- Troubleshoot and fix code issues, including edge cases
- Experiment with new ML algorithms and additional data attributes
- Optimize performance and cost (algorithms, data structures, storage formats)
- Adjust training/deployment pipeline configurations (frequency, resources, etc.)
Kogo poszukujemy?
MUST HAVE- 5+ years of experience
- Strong commercial experience with PySpark
- Excellent knowledge of Python
- Practical experience with GitHub
- Strong data analysis skills (Jupyter, Seaborn, exploratory analytics)
- Solid SQL knowledge
- Experience with Kubeflow or MLflow (MLOps frameworks for training, deployment & monitoring)
- Understanding of MLOps practices, including continuous training
- Experience with ML frameworks: scikit-learn, Pandas, Optuna
- Ability to create Grafana dashboards
- General knowledge of AWS services (S3, IAM, etc.)
- In-depth understanding of statistics and machine learning (missing data, outliers, model validation, algorithms)
- Fluent in Polish and good English
- Experience optimising data pipelines (Iceberg, Parquet, DynamoDB, etc.)
- Background in automotive or IoT data projects
- Experience with cost optimisation for ML systems
- Experience with large-scale model deployment pipelines
Czego wymagamy?
Znajomości:
Języki:
- Polski
- Angielski
Jakie warunki i benefity otrzymasz?
- 175-200 PLN godzinowo (B2B)
- B2B - Elastyczne godziny pracy (100%)
- 20000-24000 PLN miesięcznie (Umowa o pracę)
- Umowa o pracę - Elastyczne godziny pracy (100%)
- Praca zdalna: W całości
- Szkolenia wewnętrzne
- Pakiet medyczny, Ubezpieczenie, Pakiet sportowy
- Zimne napoje
- Parking rowerowy
- Pakiet relokacyjny
Gdzie będziesz pracował?
Zdalnie
Kim jesteśmy?
Od 20 lat wspieramy liderów IT, dostarczając technologie, ekspertów i pełne wsparcie operacyjne - w tym legalizację pobytu i pracy międzynarodowych specjalistów IT
Do naszych projektów poszukujemy specjalistów Java, JavaScript, C embedded, C++, PHP, specjalistów od mobile, testerów oprogramowania, administratorów sieci i systemów i wielu innych.