Summary
Overview
Work History
Education
Skills
Certification
Languages
Timeline
Additional Information
Generic

Lars Bakker

Amersfoort

Summary

Multilingual Data Annotation and Transcription Specialist with expertise in Dutch and English. Skilled in speech-to-text transcription, dataset annotation, labeling, segmentation, timestamping, and metadata tagging. Experienced in delivering high-quality datasets for AI/NLP systems through projects with OneForma, Clickworker, and Patronus. Adept at maintaining data quality and cultural accuracy, with proven ability to meet OneForma POLY project standards for transcription and annotation tasks.

Overview

1
1
Certification

Work History

Data Scientist & Audio Annotator

Patronus
  • Delivered 98%+ transcription accuracy across Dutch and English audio datasets.
  • Executed timestamping, labeling, segmentation for large AI projects.
  • Enhanced datasets by identifying accent variations and background interference.
  • Collaborated with international QA teams to ensure OneForma-compatible dataset standards.

AI Evaluation & Annotation Specialist

Clickworker / OneForma
  • Conducted OneForma POLY project annotation tasks, ensuring accuracy and compliance.
  • Performed transcription, segmentation, and labeling for multilingual datasets.
  • Evaluated AI-generated outputs for cultural appropriateness and linguistic precision.
  • Applied entity recognition, intent detection, and sentiment classification.
  • Processed high-volume data entries while ensuring annotation consistency and inclusivity.

Education

M.S. - Computational Linguistics

Universidad de La Habana

B.S. - Computer Science

Eindhoven University of Technology (TU/e)

Skills

  • Transcription & Annotation – audio transcription, labeling, segmentation, timestamping, metadata tagging
  • AI/NLP Project Support – experience in OneForma POLY, Clickworker, and Patronus datasets
  • Quality Assurance – validation, error detection, and refinement of annotated datasets
  • Annotation Platforms & Tools – OneForma annotation system, ELAN, Praat, Express Scribe, proprietary tools
  • Multilingual Linguistic Accuracy – Dutch & English subtleties, dialect/accent variations
  • Cross-Cultural Communication – handling multilingual and multicultural datasets for inclusivity
  • Remote Project Delivery – strong organization, time management, and high-volume annotation

Certification

  • Certified Artificial Intelligence Practitioner (CAIP) – CertNexus
  • BCS Foundation Certificate in Artificial Intelligence
  • IBM AI Developer Professional Certificate – Coursera

Languages

Dutch – Native/Fluent
English – Native/Fluent

Timeline

Data Scientist & Audio Annotator

Patronus

AI Evaluation & Annotation Specialist

Clickworker / OneForma

M.S. - Computational Linguistics

Universidad de La Habana

B.S. - Computer Science

Eindhoven University of Technology (TU/e)

Additional Information

  • Experienced in OneForma POLY project annotation workflows.
  • Equipped for remote and hybrid roles with secure professional setup.
  • Flexible for urgent, high-volume annotation and transcription projects.
  • Skilled in bridging linguistic and cultural contexts for AI/NLP dataset development.
Lars Bakker