
Data Scientist – AI Model Training & Evaluation
Role summary
Alignerr is seeking a remote Data Scientist based in Germany for an hourly contract role to train, evaluate, and enhance next-generation AI systems. The role involves leveraging expertise in statistics, machine learning, and data analysis to assess AI model outputs, design evaluation frameworks, and ensure system accuracy and reliability. Responsibilities include analyzing AI responses for errors and biases, creating high-quality training data, and providing structured feedback for model improvement. Proficiency in Python, R, or SQL, along with experience in data wrangling and model evaluation, is required. Familiarity with deep learning frameworks and AI evaluation workflows is a plus.
About The Role
We're looking for data scientists based in Germany to help train, evaluate, and improve next-generation AI systems. You'll leverage your expertise in statistics, machine learning, and data analysis to assess AI model outputs, design evaluation frameworks, and ensure AI systems deliver accurate, reliable results.
- Organization: Alignerr
- Type: Hourly Contract
- Compensation: $25–$40 /hour
- Location: Remote
- Commitment: 10–40 hours/week
What You'll Do
- Evaluate AI model outputs for accuracy, reasoning quality, and statistical soundness
- Design and apply data-driven evaluation criteria and scoring rubrics
- Analyze patterns in AI-generated responses to identify systematic errors or biases
- Create high-quality training data including prompts, solutions, and annotations in your areas of expertise
- Provide structured, detailed feedback to improve model performance across data science and analytical tasks
- Review AI-generated code, visualizations, and statistical analyses for correctness
- Work independently and asynchronously on your own schedule
Who You Are
- Based in Germany
- Degree in Data Science, Statistics, Computer Science, Mathematics, or a related quantitative field (MS or PhD preferred)
- Strong foundation in statistics, probability, and machine learning concepts
- Proficiency in Python, R, SQL, or similar data analysis tools
- Experience with data wrangling, exploratory data analysis, and model evaluation
- Excellent analytical thinking and attention to detail
- Strong written communication in English — able to explain complex technical concepts simply
- Self-motivated and comfortable working independently
Nice to Have
- Experience with deep learning frameworks (PyTorch, TensorFlow)
- Familiarity with NLP, LLMs, or AI evaluation workflows
- Published research or industry experience in applied machine learning
- Background in A/B testing, causal inference, or experimental design
Why Join Us
- Work on cutting-edge AI projects shaping the future of technology
- Collaborate with top research labs and AI teams globally
- Freelance perks: full autonomy, flexible scheduling, and remote-first culture
- Gain deep exposure to how state-of-the-art LLMs are trained and evaluated
- Potential for ongoing work and long-term contract extension
- Be part of a growing community of expert contributors making AI smarter
Application Process (Takes 10–15 min)
- Submit your resume
- Complete a short screening
- Project matching and onboarding
*PS: Our team reviews applications daily. Please complete your application steps to be considered for this opportunity.*