Generative AI Automation Tester - Remote / Telecommute

Remote Full-time
About the position Responsibilities • Evaluate and test Generative AI POC models built using Open AI and Vertex LLM Models. • Design, develop, and execute detailed test plans to validate the model performance with real customer data. • Perform data validation, ensuring accuracy and completeness of the input/output data flow through AI models. • Collaborate with data scientists and engineers to ensure the model's output is aligned with the expected results. • Create test cases to assess model accuracy, bias, performance, and edge cases. • Identify model weaknesses, inaccuracies, and areas for optimization. • Report bugs, issues, and improvement areas, providing detailed feedback to development teams. • Use automated testing tools and frameworks for model testing. • Maintain comprehensive documentation of the QA process and results. Requirements • Proven experience in testing Machine Learning/AI models. • Familiarity with Generative AI models like Open AI GPT, Vertex AI models. • Strong knowledge of data validation, model performance, and quality assurance practices. • Experience with model evaluation metrics such as accuracy, precision, recall, F1 score, and bias analysis. • Proficiency in Python or other relevant programming languages. • Familiarity with testing frameworks and automated testing tools for AI models. • Strong analytical and problem-solving skills. Nice-to-haves • Experience in testing AI models with real-world datasets. • Knowledge of model versioning, deployment, and monitoring. Apply tot his job
Apply Now

Similar Opportunities

Quick Reminder - Lead Manual and Automation Tester, Scrum Master - Remote, USA

Remote Full-time

Automation Tester/Developer w/Top Secret Clearance Remote / Telecommute Jobs

Remote Full-time

QA Tester (Manual and Automation)

Remote Full-time

[Remote] Salesforce automation tester with security

Remote Full-time

QA Selenium Automation Tester - ONLY W2

Remote Full-time

QE Tester

Remote Full-time

Automation Tester/Engineer

Remote Full-time

Principal QA Automation Engineer

Remote Full-time

Automation Tester (Remote – US)

Remote Full-time

Automated Test Engineer - US Citizenship Required

Remote Full-time

Administrative Specialist 2, Bilingual Spanish/English

Remote Full-time

**Experienced Full Stack Data Entry Operator – Utility Account Management and Customer Service Support**

Remote Full-time

Risk, Governance & Compliance Consultant

Remote Full-time

Vulnerability and Application Scanning Lead - REMOTE (Fort Knox (REMOTE), KY, US)

Remote Full-time

Experienced Data Entry and Administrative Support Specialist – Remote Work Opportunity with blithequark

Remote Full-time

Experienced Customer Support Associate - Remote Role at blithequark

Remote Full-time

Dental Billing Success Consultant

Remote Full-time

Experienced Live Chat Agent / Guest Relation Officer - Remote Customer Service Excellence at blithequark

Remote Full-time

Crisis Triage Specialist - Regional Crisis/988 Line - WEEKDAY NIGHT 11PM-7:30AM

Remote Full-time

**Experienced Data Entry Analyst – Financial Services Industry**

Remote Full-time
← Back to Home