28 000 – 32 000 PLN
netto /miesiąc
B2BEtat: 100%
Oceń tę ofertę
TeamQuest
Zdalnie
28.0k–32.0k PLN
B2B
#(Langchain, Langraph, mcp-serv#Python
1dea
Zdalnie
28.6k–31.1k PLN
B2B
#Python#LLM#NLP#Deep learning#PyTorch#Fine tuning#SQL#Cloud#Model Context Protocol#HuggingFace
1dea
Zdalnie
28.6k–31.9k PLN
B2B
#ETL#ELT#Python#PySpark#Databricks#SQL#Azure#Fabric

Podobne ogłoszenia

TeamQuest
TeamQuest
Zdalnie
28.0k–32.0k PLN
B2B
#(Langchain, Langraph, mcp-serv#Python
Data Science#(Langchain, Langraph, mcp-serv#Python
28.0k–32.0k PLN
Praca zdalna
1dea
1dea
Zdalnie
28.6k–31.1k PLN
B2B
#Python#LLM#NLP#Deep learning#PyTorch#Fine tuning#SQL#Cloud#Model Context Protocol#HuggingFace
Data Science#Python#LLM#NLP#Deep learning#PyTorch#Fine tuning#SQL#Cloud#Model Context Protocol#HuggingFace
28.6k–31.1k PLN
Praca zdalna
1dea
1dea
Zdalnie
28.6k–31.9k PLN
B2B
#ETL#ELT#Python#PySpark#Databricks#SQL#Azure#Fabric
Data Science#ETL#ELT#Python#PySpark#Databricks#SQL#Azure#Fabric
28.6k–31.9k PLN
Praca zdalna
Acaisoft
Acaisoft
Zdalnie
25.2k–42.0k PLN
B2B
#Python#ML#Reinforcement Learning#LAN#Project management#Microservices#Distributed systems#Kafka
Data Science#Python#ML#Reinforcement Learning#LAN#Project management#Microservices#Distributed systems#Kafka
25.2k–42.0k PLN
Praca zdalna
DCG
DCG
Zdalnie
28.6k–31.9k PLN
B2B
#Big Data#ETL#Spark#Databricks#SQL#Python#Data warehouse#Azure#Fabric Data Factory#Airflow#Kafka#Hadoop
Data Science#Big Data#ETL#Spark#Databricks#SQL#Python#Data warehouse#Azure#Fabric Data Factory#Airflow#Kafka#Hadoop
28.6k–31.9k PLN
Praca zdalna
Scalo
Scalo
Zdalnie
25.2k–34.4k PLN
B2B
#ETL#Spark#GCP#Python#Java
Data Science#ETL#Spark#GCP#Python#Java
25.2k–34.4k PLN
Praca zdalna
1dea
1dea
Zdalnie
29.4k–31.9k PLN
B2B
#Python#Async programming#LangChain#LangGraph#NLP#NLU#TensorFlow#PyTorch#CI/CD#Cloud
Data Science#Python#Async programming#LangChain#LangGraph#NLP#NLU#TensorFlow#PyTorch#CI/CD#Cloud
29.4k–31.9k PLN
Praca zdalna
TeamQuest
TeamQuest
Warszawa
20.0k–40.0k PLN
B2B
#AI
Data Science#AI
20.0k–40.0k PLN
Warszawa
Praca hybrydowa
DCG
DCG
Zdalnie
28.6k–31.9k PLN
B2B
#GCP#PySpark#Cloud#DevOps#CI/CD#Autosar
Data Science#GCP#PySpark#Cloud#DevOps#CI/CD#Autosar
28.6k–31.9k PLN
Praca zdalna
Scalo
Scalo
Zdalnie
30.2k–35.3k PLN
B2B
#Java#AWS#AI#Python#RAG#Kotlin
Data Science#Java#AWS#AI#Python#RAG#Kotlin
30.2k–35.3k PLN
Praca zdalna

NOWETQ0102140
Senior Data Scientist/AI Engineer (Reinforcement Learning)

TeamQuest
100% zdalnie (Warszawa)
Data Science
Langachain, Langraph, mcp-serPython
Senior
PolskiAngielski
min. 5 lat doświadczenia
Langachain, Langraph, mcp-serPythonPolskiAngielski
Senior
min. 5 lat doświadczenia

Kogo poszukujemy?

Responsibilities:

  • Designing and deploying RL environments for large-scale agent evaluation and reinforcement learning experiments.
  • Create pipelines for task generation, dynamic datasets, and scripted environments with controlled complexity and stochasticity.
  • Develop validators and reward models to automatically evaluate trajectories and assess model inference.
  • Collaborate with infrastructure and systems engineers to ensure scalability, reproducibility, and equip environments with tools for detailed telemetry.
  • Design API interfaces and orchestration structures for running, resetting, and evaluating agents in various environments.
  • Optimization of environment performance, reward logging, and reproducibility in distributed configurations.

We offer:

  • Attractive salaries
  • Possibility of full remote work
  • Participation in interesting projects

Czym będziesz się zajmować?

Requirements:

  • Over 5 years of experience in software engineering in Python.
  • At least 3 years of experience in the position of Data Scientist, Machine Learning/Environment Engineering.
  • Working hours from 2:00 PM to 10:00 PM.
  • Practical knowledge of AI frameworks (Langchain, Langraph, mcp-server).
  • Extensive practical experience in working with artificial intelligence, including instant engineering and climate coding.

Additional advantages:

  • Knowledge of the Code of Conduct or Claude's Code.
  • Experience in integrating artificial intelligence with the system will be an additional asset.
  • Understanding of RL concepts - reward modeling, environmental dynamics, verifiability, evaluation, and agent interaction loops.
  • Knowledge of tools, metrics, and data channels for evaluating RL.
  • Expertise in planning own work.

Jakie otrzymasz benefity?

Pakiet medycznyPakiet sportowy

Gdzie i jak będziesz pracował?

Centrum, Warszawa
Tryb pracy: Elastyczne godziny pracy
Godziny pracy biura: 7-20
Model pracy
Stacjonarnie
Hybrydowo
100% zdalnie
Map Preview

Kim jesteśmy?

TeamQuest Sp. z o.o.
Wielkość firmy: 20+
Our client is a rapidly growing company specializing in delivering modern cloud solutions and Kubernetes-based applications aimed at enhancing operational efficiency and reducing costs for businesses. Established in 2021, the company has quickly gained recognition in the market by creating advanced SaaS platforms supported by data engineering and machine learning. With offices in San Jose (USA) and Warsaw (Poland), our client collaborates with renowned partners such as Devtron and Tigera to offer corporate clients and startups from the USA and Europe robust, scalable solutions that support digital transformation, improve operational efficiency, and stimulate innovation. The company is currently seeking talented IT professionals ready to work on top-level projects, offering excellent working conditions and the opportunity for development in an international environment.