Hidden
Senior/Staff Machine Learning Engineer, Agents

Created:
€7,00012,000/month
Remote or office
Full-time

ai agentshuman-ai collaborationmlllmevaluations

Moderation Review

In the archive

Brief description of the vacancy

We believe the future of work lies in effective AI–human collaboration, not in purely autonomous systems operating on their own.

That’s why we’re building Hybrid Agents – systems where AI and human experts work side-by-side, solve tasks together, and leverage each other’s strengths. From general-purpose tasks to highly specialized professional domains, our platform unites people and models, backed by state-of-the-art technology and a growing marketplace of expert talent.

As an AI/ML Engineer, you will be a core contributor to the AI layer of our product.

About the company

Company Toloka AI

At Toloka AI we create data that powers leading GenAI models and innovations. We work with frontier labs, big tech, renowned AI startups, enterprises and non-profit research organizations worldwide. We use a combination of Experts + Crowd + Tech Platform to teach AI models to reason and evaluate their efficacy and safety. We have experts in more than 50 different domains—from doctors and lawyers to physicists and engineers—and boast one of the most diverse global crowds, representing over 100 countries and speaking 40+ languages. We are a well-funded startup with an enviable portfolio of clients including Anthropic, Amazon, Microsoft, poolside, Recraft, and Shopify.

Recently, we secured strategic investment led by Bezos Expeditions with participation from Mikhail Parakhin, CTO of Shopify and board advisor to leading GenAI companies, who now serves as our Chairman of the Board. Our remote-first team is globally distributed around the world: USA, UK, the Netherlands, Israel, Czech Republic, Serbia, and more. We are headquartered in Amsterdam.

Responsibilities

  • Build agentic systems and develop skills such as search, browsing, coding, and more
  • Design and experiment with hybrid-execution mechanics, including human hand-off and result validation
  • Develop automated evaluations that measure system quality on real-world tasks
  • Research and integrate state-of-the-art techniques to improve core quality and agent skills
  • Develop and maintain production services
  • Contribute to agent infrastructure – tools, environments, and third-party integrations

Requirements

Qualifications

  • 3+ years of industry experience in machine learning
  • Solid understanding of how modern deep-learning models work, including their strengths and limitations
  • Experience developing software and building products with language models
  • Strong engineering skills with proficiency in Python and commonly used ML frameworks
  • Proven track record of delivering ML-driven products

Preferred but not required

  • Experience building complex agentic systems
  • Experience designing or implementing evaluation metrics
  • Contributions to ML research projects
  • Fluency in English

Working conditions

  • You’ll have the opportunity to work on highly innovative, projects at the leading edge of AI development, and work with a genuinely dedicated and dynamic team of experts;
  • Personal and career development: we support employee’s ambition for professional development and encourage them to implement their ideas in their projects;
  • Flexibility: we offer remote or hybrid employment. You will also design with your manager a workday that works best for you;
  • Paid parental leave and sick leave;
  • 25 vacation days per year.

Contacts

Posted:

Our website uses cookies, including web analytics services. By using the website, you consent to the processing of personal data using cookies. You can find out more about the processing of personal data in the Privacy policy