Relocate. Ми з України
Post Jobs
Countries

Select a Country

Australia

Austria

Belgium

Canada

Denmark

Estonia

Finland

Germany

Ireland

Japan

Netherlands

Singapore

Spain

Sweden

United Kingdom

United States

blog

Blog

Expat Stories Visas & Immigration Money & Taxes Working Abroad

Read our blog

Visas Taxes Salaries Cost of Living Relocation Companies Jobs
Blog
Expat Stories Visas & Immigration Taxes & Money Working & Money Read our blog
Post Jobs
Menu
  • Home
  • International Jobs in Spain
  • Senior Data Scientist (LLM)

Senior Data Scientist (LLM)

San Sebastian, Spain

Multiverse Computing

Multiverse Computing logo

Advanced relocation package

Language courses
Language courses
Visa services
Visa services
Signing bonus
Signing bonus
Relocation bonus
Relocation bonus

About Multiverse Computing

Multiverse is a well-funded, fast-growing deep-tech company founded in 2019. We are the largest quantum software company in the EU and have been recognized by CB Insights (2023 and 2025) as one of the 100 most promising AI companies in the world.

With 180+ employees and growing, our team is fully multicultural and international. We deliver hyper-efficient software for companies seeking a competitive edge through quantum computing and artificial intelligence.
Our flagship products, CompactifAI and Singularity, address critical needs across various industries:

  • CompactifAI is a groundbreaking compression tool for foundational AI models based on Tensor Networks. It enables the compression of large AI systems—such as language models—to make them significantly more efficient and portable.
  • Singularity is a quantum- and quantum-inspired optimization platform used by blue-chip companies to solve complex problems in finance, energy, manufacturing, and beyond. It integrates seamlessly with existing systems and delivers immediate performance gains on classical and quantum hardware.

You’ll be working alongside world-leading experts to develop solutions that tackle real-world challenges. We’re looking for passionate individuals eager to grow in an ethics-driven environment that values sustainability and diversity.

We’re committed to building a truly inclusive culture—come and join us.

Position

We are seeking a Senior Data Scientist with deep expertise in creating high-quality datasets for training and fine-tuning Large Language Models (LLMs). You will be responsible for designing and implementing scalable data pipelines and strategies to support all stages of LLM development: pretraining, supervised fine-tuning, and reinforcement learning with human feedback (RLHF).

This role is critical to ensuring the robustness, safety, and alignment of our AI models. You will have the autonomy to explore innovative data sourcing and curation methods and the opportunity to directly influence the capabilities of state-of-the-art LLMs.

As a Senior Data Scientist, you will:

  • Design and implement strategies for creating, sourcing, and augmenting datasets tailored for LLM training and fine-tuning.
  • Develop scalable pipelines to collect, clean, filter, annotate, and validate large volumes of text data.
  • Conduct data audits to ensure quality, diversity, ethical compliance, and bias mitigation.
  • Collaborate with ML engineers and researchers to align datasets with training objectives and model evaluation needs.
  • Use tools like Active Learning, synthetic data generation, and self-supervised learning to maximize dataset efficiency.
  • Leverage human-in-the-loop (HITL) workflows for data labeling and validation where necessary.
  • Contribute to building data documentation and metadata standards (e.g., Datasheets for Datasets).
  • Keep up to date with research trends in dataset curation, LLM pretraining data, and benchmarking.

Your qualification

Required qualifications:

  • Bachelor’s, Master’s, or Ph.D. in Computer Science, AI, Data Science, or a related field.
  • 3+ years of experience in data science, machine learning, or related roles, with demonstrated experience in dataset creation for NLP or LLMs.
  • In-depth knowledge of the LLM lifecycle: pretraining, fine-tuning, alignment, and evaluation.
  • Proficient in Python and data tooling ecosystems (Pandas, NumPy, spaCy, Hugging Face Datasets & Transformers).
  • Hands-on experience with text data collection from diverse sources: web scraping, APIs, proprietary corpora, etc.
  • Strong understanding of data quality metrics, including bias detection, toxicity, and readability.
  • Experience working with annotation tools (e.g., Prodigy, Label Studio) and managing annotation teams or workflows.

Preferred qualifications:

  • Experience building or contributing to datasets used in LLM pretraining or supervised fine-tuning.
  • Familiarity with RLHF workflows and alignment techniques (e.g., preference modeling, reward modeling).
  • Exposure to multilingual and low-resource language datasets.
  • Contributions to open-source datasets, tools, or publications in dataset-centric research.
  • Knowledge of ethical AI, data governance, privacy laws (e.g., GDPR), and responsible data use.

What we offer

  • Indefinite contract
  • Equal pay guaranteed
  • Variable performance bonus
  • Signing bonus
  • We offer work visa sponsorship (If applicable)
  • Relocation package (if applicable)
  • Private health insurance
  • Eligibility for the educational budget according to internal policy
  • Hybrid opportunity
  • Flexible working hours
  • Language classes and discounted lunch options
  • Working in a high-paced environment, working on cutting-edge technologies
  • Career plan. Opportunity to learn and teach.
  • Progressive company. Happy people culture

Additional details

As an equal opportunity employer, Multiverse Computing is committed to building an inclusive workplace. The company welcomes people from all different backgrounds, including age, citizenship, ethnic and racial origins, gender identities, individuals with disabilities, marital status, religions and ideologies, and sexual orientations to apply.


Python Data Scientist Data Scientist APIs Data Science pandas NLP NumPy LLM LLMs
Archive vacancy
Archive vacancy
Facts about San Sebastian
Cost of Living Index 50 /100
Median for apartment
rent in city centre
(1-3 bedroom) $ 1154 - $ 1942
Safety Index 71 /100
Check if your resume is a good fit
25/100
Get Full Report Arrow right
These jobs may fit you

Spain

Multiverse Computing

Engineering Director - Platform (Cloud Infrastructure) in San Sebastian
logo

As an Engineering Director - Platform (Cloud Infrastructure), you will Build a robust team. Communica...

Spain

Multiverse Computing

Manager - Machine Learning in San Sebastian
logo

We are looking for an experienced and innovative Data Scientist with a strong background in Natural Language Processing (NLP) and Large Language Mo...

Spain

Multiverse Computing

Senior Software Engineer in San Sebastian
logo

As a Senior Software Engineer, you will: Join a world-class team of Quantum ...

Relocate. Ми з України

Relocation made easy: country guides, visa overviews, tax calculators, and more – Relocate.me has everything you need in one place.

Resources

Blog Webinars Visas Taxes Cost of living Salaries Healthcare Relocation companies

For job seekers

Browse international jobs Companies hiring International job search guide

For employers

Post jobs Global hiring guide

Legal

Privacy policy Terms of service

Newsletter

Curated tech jobs and content for relocation seekers

Subscribe

© 2024 Relocate.me | All Rights Reserved

Proudly built by Ukrainians 🇺🇦

Jobseeker Login

Create a Jobseeker account to apply for jobs.

Forgot password?

Or
Register
Login
Continue with Google Continue with LinkedIn
Back to Login
Jobseeker Register

Create a Jobseeker account to apply for jobs.

Or
Continue with Google Continue with LinkedIn

Check your email and follow the instructions to restore access to your account

Restore access