AI QA Trainer - LLM Evaluation - Freelance Project
Tasks
- Design and run test plans and regression suites
- Evaluate language models on factual accuracy and logical soundness
- Identify and document failure modes and error traces
- Partner on adversarial red-teaming and automation
- Suggest improvements for prompt engineering and evaluation metrics
Perks/Benefits
Skills/Tech-stack
Adversarial Testing | Bias fairness auditing | Bias/fairness | Evaluation rubric design | Fairness auditing | Grounding verification | Prompt engineering | Python | Regression testing | Rubric Design | SQL | Test automation
Education
Roles
AI | AI QA | AI QA Engineer | Engineer | QA Engineer
Related jobs
-
Data Engineer / BI Developer (Power BI, Domo) PHP 420K-480KData Modeling | Data Quality | Data Warehousing | Data pipeline | DomoFully remote | Remote work from PhilippinesMid-level ContractAnywhere in the Philippines, Philippines R2h ago
-
Data Engineer (Cloud) GBP 52K-52KApache Airflow | Apache Spark | Azure | CI/CD | Data GovernanceHybrid workingMid-level Full TimeGB-ENG-LAN-Preston R4h ago
-
AI Engineer GBP 52K-54KAI Search | Azure AI | Azure AI Search | CI/CD | DatabricksHybrid work arrangementsSenior-level Full TimeGB-ENG-LAN-Preston R4h ago
-
AI Testing | Agentic AI | Azure Repos | CI/CD | Conversational AIHybrid workSenior-level Contract Full TimeGlasgow, Scotland, United Kingdom R4h ago
-
Senior GenAI Developer INR 2500K-4800KAPI Gateway | AWS Bedrock | Agent systems | Agentic RAG | Amazon LambdaCertification programs | Global client projects | Health insurance | Internship opportunities | Language coursesSenior-level Full TimeAll Cities, India R5h ago
-
Azure | CI/CD | Cloudera | DataStage | DatabricksCareer development opportunities | Employee benefits council | Employee bonus | Employee referral bonus | Health insuranceSenior-level Full TimeNantes, Pays de la Loire, France R7h ago
-
Lead Data Engineer EUR 54K-78KAgile | Apache NiFi | Cloud platform | DBT | Data EngineeringHybrid workSenior-level Full TimeFinland R8h ago
-
QA Engineer (SQL, Python & Pyspark) AUD 110K-112KAgile | Bamboo | Bitbucket | CI/CD | DatabricksDiversity and inclusion workplace | Educational benefits | Financial benefits | Flexible work arrangements | Health and wellbeing benefitsMid-level Full TimeSydney, NSW, Australia R9h ago
-
Associate Principal Engineer, Data Science INR 1500K-2000KANOVA | CI/CD | Cloud Architecture | Deep learning | GCPMid-level Full TimeRemote, India R9h ago
-
Associate Principal Engineer, Data Science INR 1500K-2000KANOVA | AWS | Azure | CI/CD | Cloud ArchitectureMid-level Full TimeRemote, India R9h ago
-
Business Intelligence Engineer - Power BI INR 712K-1200KAgile | DAX | Data Modeling | Data Quality | Data WarehousingFlexible hybrid schedule | Health insurance | Life insurance | Paid time off | Personal/family care leaveMid-level Full TimeHyderabad, India R10h ago
-
Member of Technical Staff: Machine Learning Engineer USD 174K-252KAWS | C++ | CUDA | Convolutional Neural Networks | Distributed TrainingSenior-level Full TimeRemote R12h ago
-
Mid-level Full TimeRemote, United States R12h ago
-
Senior/Staff Data Analyst USD 180K-210KAnomaly Detection | Automated reporting | DBT | Dashboarding | Data Modeling401k | Disability coverage | Flexible time off | Health coverage | Home office supportSenior-level Full TimeRemote R14h ago
-
AI Support Engineer II JPY 8000K-8000KAI Platform | AI and ML | AI platform support | AWS | Agentic WorkflowsDental insurance | Flexible time off program | Global Employee Assistance Program EAP | Medical insurance | Paid HolidaysMid-level Full TimeRemote Japan R16h ago
-
Mid-level Full TimePune - Tower 6, India R16h ago
-
Data Engineer GBP 45K-52KAmazon Web Services | Apache Spark | CloudWatch | Data Modeling | DatabricksAnnual bonus | Annual leave | Early Finish Friday | Electric vehicle scheme | Employee assistance programmeMid-level Full TimeOxford/ Hybrid, GB, OX4 4DQ R16h ago
-
API Integration | Agent logic | Claude API | Gemini API | JavaScriptRemote work | US time zone scheduleSenior-level Full TimePhilippines - Remote R16h ago
-
Computer Vision | Deep learning | Diffusion Models | Foundation Models | Generative ModelsHybrid work setup | Mentorship | Onsite Days Per WeekEntry-level Internship Part TimeGM Israel - Technical Center Israel … R16h ago
-
(Summer Internship) AI and Data Analytics Intern TWD 504K-636KAnomaly Detection | CP | Control Charts | Control Limits | CpkComprehensive benefits package | Hybrid work model | Work from home flexibilityEntry-level Full Time InternshipHsinchu, Taiwan R16h ago
-
Associate, Data Engineer (Test Automation) INR 1524K-2156KAPI Testing | AWS | Azure | CI/CD | Cause analysisComprehensive healthcare | Flexible time off | Hybrid work model | Retirement plan | Support for working parentsMid-level Full TimeBL4 - KNG Tower 1, Indiqube … R16h ago
-
Staff Software Engineer (Java or Scala) HUF 10627K-17818KAWS | Apache Atlas | Apache Ranger | Azure | Cloud platformContinued Career Development | Employee resource groups | Flexible WFH | Generous PTO | Paid volunteer timeSenior-level Full TimeHungary-Budapest R16h ago
-
Engenheiro(a) de dados Sênior BRL 54K-72KAWS | Apache Airflow | Apache Spark | Code review | Data CatalogSenior-level Full TimeRemote R17h ago
-
Assistant Projets IA & Machine Learning H/F EUR 14K-21KGenerative AI | Jupyter | Language Models | Large Language Models | PyTorchCareer support | HR follow-up | Modern campus services | Remote work optionEntry-level InternshipEurope, France, Ile-de-France, 92 - Hauts-De-Seine R18h ago
-
Member of Technical Staff (Data): World Models SGD 165K-214KAccess Control | Annotations | Apache Spark | Backward Compatibility | Data CatalogSenior-level Full TimeUS, Singapore, Remote R18h ago