35 000 – 74 000 PLN
netto /miesiąc
B2BEtat: 100%
Oceń tę ofertę
DCG
Zdalnie
25.2k–36.1k PLN
B2B
#Python#AI#ML#CI/CD#DevOps
DCG
Zdalnie
25.2k–28.6k PLN
B2B
#AI#NLP#ML#API design#Multithreading#LLM
Acaisoft
Zdalnie
25.2k–42.0k PLN
B2B
#Python#ML#Reinforcement Learning#LAN#Project management#Microservices#Distributed systems#Kafka

Podobne ogłoszenia

DCG
DCG
Zdalnie
25.2k–36.1k PLN
B2B
#Python#AI#ML#CI/CD#DevOps
DevOps#Python#AI#ML#CI/CD#DevOps
25.2k–36.1k PLN
Praca zdalna
DCG
DCG
Zdalnie
25.2k–28.6k PLN
B2B
#AI#NLP#ML#API design#Multithreading#LLM
Data Science#AI#NLP#ML#API design#Multithreading#LLM
25.2k–28.6k PLN
Praca zdalna
Acaisoft
Acaisoft
Zdalnie
25.2k–42.0k PLN
B2B
#Python#ML#Reinforcement Learning#LAN#Project management#Microservices#Distributed systems#Kafka
Data Science#Python#ML#Reinforcement Learning#LAN#Project management#Microservices#Distributed systems#Kafka
25.2k–42.0k PLN
Praca zdalna
Godel Technologies Europe
Godel Technologies Europe
Zdalnie
29.6k–38.3k PLN
B2B
#Software development#AI Fundamentals#Project management#GenAI tools#Communication skills
Data Science#Software development#AI Fundamentals#Project management#GenAI tools#Communication skills
29.6k–38.3k PLN
Praca zdalna
Scalo
Scalo
Zdalnie
30.2k–35.3k PLN
B2B
#Java#AWS#AI#Python#RAG#Kotlin
Data Science#Java#AWS#AI#Python#RAG#Kotlin
30.2k–35.3k PLN
Praca zdalna
ITFS
ITFS
Zdalnie
26.9k–38.6k PLN
B2B
#Palantir Foundry#Python#SQL#ETL
Data Science#Palantir Foundry#Python#SQL#ETL
26.9k–38.6k PLN
Praca zdalna
Codetalent
Codetalent
Zdalnie
33.0k–41.0k PLN
B2B
#.NET#Python#Azure#CI/CD
Data Science#.NET#Python#Azure#CI/CD
33.0k–41.0k PLN
Praca zdalna
P&P Solutions
P&P Solutions
Zdalnie
32.8k–35.3k PLN
B2B
#Azure Data Factory#Snowflake#Azure DevOps#Azure Pipelines#SQL#CI/CD#Agile#Scrum#Control-M
Data Science#Azure Data Factory#Snowflake#Azure DevOps#Azure Pipelines#SQL#CI/CD#Agile#Scrum#Control-M
32.8k–35.3k PLN
Praca zdalna
apreel
apreel
Zdalnie
20.2k–26.9k PLN
B2B
#Generative AI#LLM#RAG#LAN#LlamaIndex#OpenShift#Hugging Face#Transformers#Project management#Docker#Kubernetes#SQL#Azure#AWS#GCP#Python#PyTorch#TensorFlow#ML#Vector databases#Ajax
Data Science#Generative AI#LLM#RAG#LAN#LlamaIndex#OpenShift#Hugging Face#Transformers#Project management#Docker#Kubernetes#SQL#Azure#AWS#GCP#Python#PyTorch#TensorFlow#ML#Vector databases#Ajax
20.2k–26.9k PLN
Praca zdalna
apreel
apreel
Zdalnie
21.8k–26.9k PLN
B2B
#ML#AI#LLM
Data Science#ML#AI#LLM
21.8k–26.9k PLN
Praca zdalna

RL Environments Engineer

Verita HR
100% zdalnie (San Francisco)
Data Science
PythonDockerLLMMLReinforcement LearningAI
Senior
Angielski
min. 5 lat doświadczenia
PythonDockerLLMMLReinforcement LearningAIAngielski
Senior
min. 5 lat doświadczenia

Kogo poszukujemy?

  • Client: US startup
  • Recruitment process: 2 meetings with hiring managers, followed by a phone screen with our recruiter and technical test
  • Fully remote work

Skills:

  • Strong Python (engineering-quality)
  • Docker and production mindset
  • Understanding of LLMs and their limitations
  • Ability to meet throughput expectations
  • Advanced English (C1/C2) and ≥4 hours overlap with US time zones

Nice-to-have:

  • Deep knowledge of transformer internals and LLM training/inference
  • Experience with inference libraries (vLLM, SGLang, etc.)
  • CUDA or Pallas kernel development experience
  • Publications or open-source contributions in active DL/ML research
  • Experience building interactive RL environments and RL-based learning systems

What's in it for you?

  • Fully remote, flexible work schedule with some overlap to US time zone
  • Direct impact on how LLMs learn
  • Collaboration with top AI researchers and labs
  • Exposure to cutting-edge RL and ML projects

Czym będziesz się zajmować?

About the company: US-based AI startup focused on building the next generation of training data for LLMs. The team partners with top AI labs to create realistic RL environments where models encounter research and engineering challenges, iterate, and learn from feedback, pushing AI closer to its full potential.

Project: Design and build reinforcement learning environments to teach LLMs advanced reasoning and modern ML concepts. Candidates will work on realistic feedback loops where models encounter research and engineering problems and iterate on solutions.

What you will do:

  • Build and maintain RL/ML environments for LLM training
  • Implement robust, production-quality Python code (not just notebooks)
  • Deploy and run environments in Docker with focus on reliability and iteration speed
  • Analyze model performance and respond to feedback efficiently
  • Collaborate with research teams to translate papers and ideas into RL problems

Gdzie i jak będziesz pracował?

Centrum, San Francisco
Tryb pracy: Praca projektowa
Godziny pracy biura: 00-24
Model pracy
Stacjonarnie
Hybrydowo
100% zdalnie
Map Preview

Kim jesteśmy?

Verita HR
Wielkość firmy: 80

Work for the largest bank in Europe, which operates in more than 65 countries around the world giving us access to over 90% of all world trade flows. Don’t hesitate to apply, create future of banking with us!

Who we are?

Verita HR is an international company providing recruitment support within #Fintech, #Finance and #Banking market in EMEA. We connect the most innovative organizations with the best people in the market. We conduct systematic market research, which allows our Digital Teams to be a step ahead of the competition.