Hidden
Data Scientist NLP/ Mid

RUB 150,000300,000/month
Remote or office
Full-time

dsdata scientistnlp

Moderation Review

In the archive

Brief description of the vacancy

WaveAccess is looking for a Data Scientist to join our team and contribute to innovative projects in the pharmaceutical domain. This role involves working with real-world pharmaceutical data and leveraging the power of Large Language Models (LLMs) to drive impactful insights and solutions.

About the company

WaveAccess is an international results-driven company that provides high-quality custom software development services for hundreds of emerging and established companies globally. By supporting customers with talented software engineers and also vast experience in advanced technologies, WaveAccess builds innovative software solutions while minimizing development risks and costs.

Throughout its 22-year history, the company’s highly skilled specialists have implemented over 500 successful projects for market leaders, ambitious startups, and government institutions.

Responsibilities

  • LLM Integration: Develop, fine-tune, and implement Large Language Models to analyze and process diverse sets of text and medical data.
  • Data Analysis: Perform advanced data analysis on real-world pharmaceutical datasets to extract meaningful insights and support decision-making processes.
  • Text Mining and NLP: Utilize natural language processing techniques to extract relevant information from large volumes of text, including medical literature, patient records, and clinical trial data.
  • Model Development: Build and validate predictive models to address key challenges in the pharmaceutical industry, such as drug efficacy, patient outcomes, and adverse event prediction.
  • Innovation: Stay up-to-date with the latest advancements in LLMs and NLP, and apply innovative approaches to solve complex problems in the pharmaceutical field.

Requirements

  • At least 3 years of experience in a Data Scientist position
  • English - B2
  • Deep knowledge of Neural Networks and architectures for working with sequences, in particular (RNN, LSTM, Transformers, CNN, attention).
  • Experience with Large Language Models (LLMs) and their application. Familiarity with modern LLM techniques such as Retrieval-Augmented Generation (RAG) and LLM agents.
  • Solid Python skills
  • Experience in presenting achieved results

Technologies

  • Python
  • Transformers
  • LLM
  • Standard NLP stack
  • Standard ML stack
  • Basic SQL
  • Git
  • Vector databases(Postgres+pgvector / Milvus/ Qdrant/ Faiss)

Preferred

  • Knowledge of general Machine Learning approaches
  • Knowledge of mathematical statistics.
  • Experience with AWS (EC2, S3)
  • Linux + bash, ssh
  • Experience in written and verbal communication with business stakeholders
  • Experience with full-cycle development

Nice to have

  • RestAPI development experience
  • Snowflake
  • Docker
  • Understanding of CI/CD
  • Java/C++/Other languages

Working conditions

  • We can cooperate with you through an individual entrepreneur/self-employment if you are outside of Russia
  • Medical support
  • Provision of equipment
  • Democratic management, flexible start of the working day
  • Corporate training programs
  • Work in a dynamic international team
  • Wide opportunities for self-realization, professional and career growth

Contacts

Cookies help us deliver our services. By using our services, you agree to our use of cookies.