Inscreva-se para aceder a todos os recursos do nosso serviço
  • Pesquisa de emprego
  • Favorito
  • Criar um CV
    Novo
  • Salários
  • Alertas de emprego

Data Set Curator

$12,000 por año
Efetivo

Snaphunt

Dataset Curator

Job Summary: We are seeking for a Dataset Curator who is responsible for designing, maintaining, and optimizing high-quality datasets for AI, Machine Learning (ML), and Large Language Model (LLM) projects. The Dataset Curator works closely with data scientists, AI trainers, and engineers to gather, clean, validate, and annotate datasets across multiple domains, ensuring the data supports robust AI model training and evaluation. The dataset curator must understand dataset diversity, bias detection, quality assessment, and metadata management. This role is critical to improving AI performance, dataset reliability, and data-driven decision-making.

 

Key Responsibilities

· Curate, collect, and structure datasets for AI and ML training purposes.

· Validate dataset accuracy, completeness, and consistency.

· Annotate and label datasets according to project-specific guidelines.

· Identify and correct data inconsistencies, duplicates, and anomalies.

· Maintain metadata and documentation for datasets.

· Collaborate with AI trainers and data engineers to define dataset requirements.

· Ensure datasets are ethically sourced and free from biases.

· Continuously monitor dataset quality and propose improvements.

 

Job Requirements

Bachelor’s or Master’s degree in Data Science, Computer Science, Statistics, Information Systems or related field.

Proficiency in Excel, Google Sheets, SQL, and/or Python for dataset handling.

Knowledge of data cleaning, normalization, and transformation techniques.

Familiarity with data annotation tools and platforms.

Understanding of structured, semi-structured, and unstructured datasets.

Experience with database management and version control systems.

Awareness of AI dataset ethics and bias mitigation.

Required Certifications such as Google Data Analytics Certificate (advantage), Data Management or Curation Certification, AI/Data Annotation Training (optional but preferred)

3-5 years proven experience in dataset curation, data analysis, or data management roles.

Experience handling large-scale datasets for AI, ML, or analytics projects.

Vaga publicada 2 meses atrás
Empregos semelhantes que podem ser interessantes para vocêCom base na vaga Data Set Curator em Portugal
  •  ...Link is to become the global leader in deep data for life sciences. It helps pharma...  ...a team of 5 Veevans and multiple hundred curators. You know how to lead an engaged and high...  ...process improvements Ensure your teams curate high-quality data Guide Data Analysts... 
    Trabalho remoto

    Veeva Systems

    Portugal
    8 horas atrás
  •  ...Role Veeva OpenData is global reference data of healthcare professionals and...  ...is to be the operational backbone of our curation efforts. You will guide and lead a remote...  ...proven track record of handling complex data sets or data operations Process Mastery: You... 
    Trabalho remoto

    Veeva Systems

    Portugal
    8 horas atrás
  •  ...communities. The Role The Link and OpenData vision is to become the global leader in deep and reference data for life sciences. The core of our products is manually curated data captured using rule-based web research. We create high-quality data through efficient processes... 
    Trabalho remoto

    Veeva Systems

    Portugal
    8 horas atrás
  •  ...about new treatments to key people in the life science community. You can read more about Veeva Link on our product pages at  . As a data engineer, you focus on our data pipelines and take responsibility for a major part of the Link data processing platform. We value... 
    Trabalho remoto

    Veeva Systems

    Portugal
    8 horas atrás
  •  ...positive impact on its customers, employees, and communities. The Role Veeva Link’s vision is to become the global leader for deep data in life sciences. We are looking for a detail-oriented and experienced Data Analyst to join our product team. In this role, you... 
    Trabalho remoto

    Veeva Systems

    Portugal
    8 horas atrás
  •  ...industry , committed to making a positive impact on its customers, employees, and communities. The Role The vision of our Link data application family is to connect life sciences with healthcare professionals to improve research and care. The HCP Data team is the... 
    Trabalho remoto

    Veeva Systems

    Portugal
    8 horas atrás
  •  ...We're looking for a Senior Data Engineer – dbt to join our team in Portugal in a remote working mode. This role focuses on designing, building and optimizing scalable data transformation solutions using dbt. You will ensure clean, maintainable architectures and production... 
    Trabalho remoto

    EPAM Systems

    Portugal
    14 dias atrás
  •  ...employees, and communities. The Role Veeva Link builds connected data applications to improve research and patient outcomes, powered by...  ...other teams. Establish Hiring & Data Excellence: You will set the gold standard for candidate quality and process integrity.... 
    Trabalho remoto

    Veeva Systems

    Portugal
    8 horas atrás
  •  ...Fyld is hiring: Data Analyst At Fyld, we believe the future is built with people, technology, and strong values. We are a Portuguese consulting company that operates with transparency, respect, and a focus on everyone’s growth. Here, every project is an opportunity... 

    Fyld

    Portugal
    2 meses atrás
  •  ...employees, and communities. The Role At Veeva Link, we're building the intelligence layer for life sciences, creating connected data applications that accelerate drug development and significantly improve patient outcomes. Our core belief: combining the highest... 
    Trabalho remoto

    Veeva Systems

    Portugal
    8 horas atrás
  • $40 por hora

     ...Job Title QA/QC Data Centre Engineer Description We are seeking a detail-oriented QA/QC Data Centre Engineer to join our team in Sines, Portugal. The successful candidate will ensure that construction, commissioning and operational processes meet project specifications... 

    GWS Accommodate

    Portugal
    13 dias atrás
  •  ...Fyld is hiring: Data Scientist At Fyld, we believe the future is built with people, technology, and strong values. We are a Portuguese consulting company that operates with transparency, respect, and a focus on everyone’s growth. Here, every project is an opportunity... 

    Fyld

    Portugal
    2 meses atrás
  •  ...always chase your dreams. Here, prepare yourself to conquer your goals, while enjoying the journey.  We are currently looking for.   Data Engineer Requirements: WHAT WE'RE LOOKING FOR Expert in Apache Kafka: configuring, managing brokers, partitioning,... 

    IRIUM Portugal

    Portugal
    8 dias atrás
  •  ...to making a positive impact on its customers, employees, and communities. The Role Veeva Link’s vision is to build connected data applications that improve research and patient outcomes.  As a Team Lead for one of our Data Operations teams, you will lead a remote... 
    Trabalho remoto

    Veeva Systems

    Portugal
    8 horas atrás
  •  ...on its customers, employees, and communities. The Role Veeva OpenData Commercial supports the industry by providing reference data across the complete healthcare ecosystem, to support commercial sales execution, compliance, and business analytics. We drive value... 
    Trabalho remoto

    Veeva Systems

    Portugal
    8 horas atrás
  •  ...and Ul design, managed services, product development, and software development. This is the Job We are looking for a Senior Data Streaming Engineer with strong experience in Apache Flink, event-driven architectures, and distributed systems to join our engineering... 

    Avenga

    Portugal
    9 dias atrás
  • $12,000 por año

     ...Data Operations Analyst Job Summary: We are seeking for a Data Operations Analyst to ensure smooth handling, quality, and management of operational data for AI, ML, and business intelligence projects. This role monitors, validates, and optimizes workflows involving data... 

    Odixcity Consulting

    Portugal
    29 dias atrás
  • $12,000 por año

     ...JOB TITTLE : NLP Data Annotator LOCATION: Remote (Worldwide) EMPLOYMENT TYPE: Full Time JOB SUMMARY We are looking for a detail-oriented NLP Data Annotator to support the development of cutting-edge Natural Language Processing (NLP) systems. In this role,... 

    Snaphunt

    Portugal
    2 meses atrás
  • $12,000 por año

     ...Job Title: Legal Data Analyst Location: Remote (Worldwide) Job Summary: The Legal Data Analyst is responsible for collecting, analyzing, and interpreting legal and regulatory data to support informed decision-making, compliance oversight, and risk management initiatives... 

    Snaphunt

    Portugal
    2 meses atrás
  •  ...committed to making a positive impact on its customers, employees, and communities. The Role This is your chance to architect the data engine that powers Veeva Link. As the Engineering Manager for the HCO (Healthcare Organization) team, you aren't just managing... 
    Trabalho remoto

    Veeva Systems

    Portugal
    28 dias atrás
  •  ...Fyld is hiring: Data Engineer (Azure) At Fyld, we believe the future is built with people, technology, and strong values. We are a Portuguese consulting company that operates with transparency, respect, and a focus on everyone’s growth. Here, every project is... 

    Fyld

    Portugal
    2 meses atrás
  •  ...Vodafone’s Data Revolution is happening now, and it’s transforming the way we make decisions and create impact. That’s why our FutureRED...  ...12 months) Location: Portugal (Lisbon or Oporto), Hybrid setting Areas you could be working in include: • Artificial Intelligence... 

    Vodafone Discover Graduate Programme

    Portugal
    dia atrás
  • $12,000 por año

     ...Job Title: Web Data Analyst Annotator Job Summary: We are seeking for a detailed- oriented Web Data Analyst Annotator to collect, analyze, validate, and annotate web-based datasets used in training Artificial Intelligence and Large Language Models (LLMs). This role combines... 

    Odixcity Consulting

    Portugal
    8 horas atrás
  •  ...With a team of 500+ professionals across multiple countries, we are scaling globally to support travellers with seamless, unlimited data connectivity. We’re not just connecting people—we’re enabling freedom and peace of mind, ensuring our users stay connected from the... 

    Holafly

    Portugal
    2 meses atrás
  •  ...Are you passionate about leveraging cutting-edge technology to solve complex data challenges? Do you have a strong software engineering background and expertise in Palantir Foundry? EPAM is seeking an open-minded, innovative professional with excellent English skills... 
    Trabalho remoto

    EPAM Systems

    Portugal
    16 dias atrás
  •  ...We are seeking a  Senior Data Software Engineer skilled in Databricks to join our team in Portugal. You will be integral to our efforts, bringing a deep understanding of data engineering, expertise in Databricks and an open-minded approach to our collaborative and... 
    Trabalho remoto

    EPAM Systems

    Portugal
    16 dias atrás
  •  ...The Role : The Master Data Management Business Analyst will play a critical role in managing and optimizing the organization's master data. This position involves collaborating with various departments to ensure data accuracy and consistency across systems. Analyze... 

    Snaphunt

    Portugal
    2 meses atrás
  •  ...chase your dreams. Here, prepare yourself to conquer your goals, while enjoying the journey.  We are currently looking for.  Senior Data Ops Engineer (ATC) Requirements:  5+ years of experience as DataOps Engineer or similar role covering the most of required... 

    IRIUM Portugal

    Portugal
    8 dias atrás
  •  ...the Portuguese dynamics and entrepreneurial ecosystem. Job Description We are seeking a skilled professional to join our Squad Data RUN - Data Entreprise, dedicated to ensuring the achievement of business objectives and the effective management of daily operations... 

    Natixis in Portugal

    Portugal
    13 dias atrás
  •  ...Senior Data Engineer (Python/Spark/Kafka) - Hybrid Porto (3 days/week office) ABOUT THE OPPORTUNITY An international technology-driven organization is looking for a Senior Data Engineer to join a highly skilled data team focused on building scalable and modern... 

    HumanIT Digital Consulting

    Portugal
    17 dias atrás

Deseja receber mais vagas?

Assine e receba vagas semelhantes a Data Set Curator. Seja o primeiro a se candidatar!