Senior Data Scientist/AI Engineer (Reinforcement Learning) (Remote)
28000-32000 PLN miesięcznie (B2B)
TeamQuest Sp. z o.o.
Czym będziesz się zajmować?
Requirements:
- Over 5 years of experience in software engineering in Python.
- At least 3 years of experience in the position of Data Scientist, Machine Learning/Environment Engineering.
- Working hours from 2:00 PM to 10:00 PM.
- Practical knowledge of AI frameworks (Langchain, Langraph, mcp-server).
- Extensive practical experience in working with artificial intelligence, including instant engineering and climate coding.
Additional advantages:
- Knowledge of the Code of Conduct or Claude's Code.
- Experience in integrating artificial intelligence with the system will be an additional asset.
- Understanding of RL concepts - reward modeling, environmental dynamics, verifiability, evaluation, and agent interaction loops.
- Knowledge of tools, metrics, and data channels for evaluating RL.
- Expertise in planning own work.
Kogo poszukujemy?
Responsibilities:
- Designing and deploying RL environments for large-scale agent evaluation and reinforcement learning experiments.
- Create pipelines for task generation, dynamic datasets, and scripted environments with controlled complexity and stochasticity.
- Develop validators and reward models to automatically evaluate trajectories and assess model inference.
- Collaborate with infrastructure and systems engineers to ensure scalability, reproducibility, and equip environments with tools for detailed telemetry.
- Design API interfaces and orchestration structures for running, resetting, and evaluating agents in various environments.
- Optimization of environment performance, reward logging, and reproducibility in distributed configurations.
We offer:
- Attractive salaries
- Possibility of full remote work
- Participation in interesting projects
Czego wymagamy?
Znajomości:
Języki:
- Polski
- Angielski
Jakie warunki i benefity otrzymasz?
- 28000-32000 PLN miesięcznie (B2B)
- B2B - Elastyczne godziny pracy (100%)
- Praca zdalna: W całości
- Pakiet medyczny, Pakiet sportowy
Gdzie będziesz pracował?
Zdalnie
Kim jesteśmy?
Our client is a rapidly growing company specializing in delivering modern cloud solutions and Kubernetes-based applications aimed at enhancing operational efficiency and reducing costs for businesses. Established in 2021, the company has quickly gained recognition in the market by creating advanced SaaS platforms supported by data engineering and machine learning. With offices in San Jose (USA) and Warsaw (Poland), our client collaborates with renowned partners such as Devtron and Tigera to offer corporate clients and startups from the USA and Europe robust, scalable solutions that support digital transformation, improve operational efficiency, and stimulate innovation. The company is currently seeking talented IT professionals ready to work on top-level projects, offering excellent working conditions and the opportunity for development in an international environment.