AI QA Trainer - LLM Evaluation - Freelance Project
Tasks
- Design and run test plans and regression suites
- Evaluate language models on factual accuracy and logical soundness
- Identify and document failure modes and error traces
- Partner on adversarial red-teaming and automation
- Suggest improvements for prompt engineering and evaluation metrics
Perks/Benefits
Skills/Tech-stack
Adversarial Testing | Bias fairness auditing | Bias/fairness | Evaluation rubric design | Fairness auditing | Grounding verification | Prompt engineering | Python | Regression testing | Rubric Design | SQL | Test automation
Education
Roles
AI | AI QA | AI QA Engineer | Engineer | QA Engineer
Related jobs
-
Senior GenAI Solution Engineer (all genders) EUR 65K-84KAWS Bedrock | Anthropic API | Async | Azure OpenAI | CI/CDCompany bike | Company events | Fitness membership subsidy | Flexible working hours | Home officeMid-level Full TimeHamburg, München, Düsseldorf, remote R1h ago
-
AI Engineering Leader USD 137K-195KArtificial Intelligence | Compliance | Compliance Management | Data Engineering | DatabricksCell phone allowance | Equity grants | Growth-focused environment | Health coverage | Home office setup allowanceSenior-level Full TimeRemote- US R8h ago
-
Sr. Embedded Detection Analyst USD 170K-205KAI tools | Alert Correlation | Cause analysis | Data Analysis | Detection engineeringSenior-level Full TimeRemote - USA R9h ago
-
Machine Learning Engineer - Perception USD 161K-237K3D data | 3D data processing | Cloud processing | Computer Vision | Data Generation401k retirement plan | Dental insurance | Employee referral bonus | Flexible PTO | Free lunchSenior-level Full TimeColumbus, Ohio or Remote R14h ago
-
Senior AI Software Engineer USD 215K-250KAWS Bedrock | AWS CloudFront | AWS Cognito | AWS ECR | AWS S3401k plan | Dental insurance | Disability insurance | Flexible PTO | Health insuranceSenior-level Full TimeRemote R14h ago
-
Staff Software Engineer, Databases USD 180K-220KC++ | Computer Architecture | Distributed Storage | Distributed Systems | Graph DatabaseEquity | Health benefitsSenior-level Full TimeRemote R14h ago
-
Amazon QuickSight | Business Continuity | Business Intelligence | Data Backup | Data GovernanceAdditional paid time off | Early stage company equity potential | Health insurance | Indefinite term contract | Learning budgetSenior-level Full TimeTrabajo a distancia R14h ago
-
AI Engineering Intern USD 70K-96KAI Safety | Artificial Intelligence | Evaluation Methodologies | Experiment design | Machine LearningEntry-level Internship1 Remote R14h ago
-
Staff Data Engineer BRL 325K-443KAWS Glue | AWS Glue Catalog | Amazon MWAA | Amazon S3 | Apache AirflowRemote workSenior-level Full TimeSão Paulo, SP, Brazil R15h ago
-
Senior AI Engineer CAD 120K-150KAgent Orchestration | Agent systems | Artificial Intelligence | Autonomous Agents | Dynamic routingBirthday off | Generous vacation package | Health days | Health spending account | Hybrid workSenior-level Full TimeToronto R15h ago
-
Power BI Engineer USD 70K-136KAzure AD | Column-Level Security | DAX | Data Gateways | Data GovernanceMid-level Full TimeWashington, DC R17h ago
-
Data Engineer PHP 420K-480KAWS | Amazon Redshift | Apache Airflow | Apache Spark | BigQueryEmployee discount | Health insurance | Mentorship | Sports voucher | Training opportunitiesMid-level Full TimeAlabang, Philippines (Hybrid) R18h ago
-
Data Engineers EUR 30K-36KAmazon S3 | Apache Airflow | Apache Spark | CI/CD | Data QualityCompany training | Conference attendance | Flexible compensation benefits | Flexible working hours | Indefinite contractMid-level Full TimeMadrid, MD, Spain R19h ago
-
Sr. Data and AI Engineer USD 180K-200KAgile | Amazon Web Services | Azure | Big Data | Data ArchitecturePublic trust clearance support | Remote workSenior-level Full TimeWork from home, VA, United States R19h ago
-
AWS | Agile | Amazon Web Services | Apache Spark | Data EngineeringAccess to cutting-edge technologies | Collaborative team environment | Flexible work hours | Professional development opportunities | Remote work within United StatesSenior-level Full TimeMassachusetts R19h ago
-
AWS | Agile | Apache Kafka | Apache Spark | DatabricksAccess to cutting-edge technologies | Flexible work hours | Inclusive culture | Professional development | Remote work within the U.SSenior-level Full TimeMinnesota R19h ago
-
AWS | Agile | Apache Spark | Databricks | DevOpsAccess to cutting-edge technologies | Autonomy | Flexible work hours | Inclusive culture | Professional developmentSenior-level Full TimeIdaho R19h ago
-
AWS | Agile | Big Data | Data Pipelines | DatabricksAccess to cutting-edge technologies | Autonomy in role | Flexible work hours | Inclusive company culture | Professional developmentSenior-level Full TimeColumbia R19h ago
-
AWS Cloud | Agile | Amazon Web Services | Apache Spark | DatabricksAccess to cutting-edge technologies | Autonomy in role | Flexible work hours | Inclusive company culture | Professional developmentSenior-level Full TimeFlorida R19h ago
-
AWS | Agile | Apache Spark | Databricks | DevOpsAccess to cutting-edge technologies | Autonomy | Equity opportunities | Flexible work hours | Inclusive cultureSenior-level Full TimeCalifornia R19h ago
-
AWS | Agile | Apache Spark | Databricks | GitLabAccess to cutting-edge technologies | Autonomy | Collaborative team environment | Flexible work hours | Inclusive company cultureSenior-level Full TimeConnecticut R19h ago
-
AWS | Agile | Apache Spark | Batch Processing | Big DataAccess to cutting-edge technologies | Autonomy | Flexible work hours | Inclusive company culture | Professional developmentSenior-level Full TimeArizona R19h ago
-
Senior Data Engineer (Perm, Ireland, Hybrid ) EUR 46K-79KAmazon Redshift | Amazon Web Services | Apache Spark | Automation | CI/CDFlexible working hours | Income protection | Paid time off | Pension match | Private healthcareSenior-level Full TimePermanent R19h ago
-
Senior Data Engineer (Perm, Italy, Hybrid) EUR 51K-79KAgile | Amazon Redshift | Amazon Web Services | Apache Spark | AutomationDeath in service cover | Health insurance | Paid time off | Pension match | Remote working allowanceSenior-level Full TimePermanent R19h ago
-
Senior Data Engineer (Perm, UK, Hybrid) EUR 44K-79KAWS | Amazon Redshift | Apache Hadoop | Apache Spark | AutomationDeath in service coverage | Income protection | Marriage leave | Paid time off | Pension matchSenior-level Full TimePermanent R19h ago