AI/ML Statistical Analyst (Volunteering Position - Unpaid Role)

Remote Full-time
Role Overview: We are seeking a highly analytical and methodologically rigorous AI Statistical Analyst. This individual will play a critical role in applying statistical inference tools to evaluate, validate, and improve the outputs of machine learning and generative AI models. The ideal candidate will bridge the gap between classical statistical analysis and cutting-edge AI by helping us assess the reliability of probabilistic claims generated by large language models (LLMs). You will be tasked with answering questions such as: “When an LLM reports that the probability of an individual attending Yale in the 21st century is X%, how do we know that this estimate is valid, testable, and actionable?” You will help us formalize methodologies for extracting, validating, and justifying such probabilistic outputs, and you will design experiments that ensure model-generated insights are statistically grounded and interpretable. Key Responsibilities • Develop methodologies for validating probabilistic outputs from LLMs using classical statistical techniques. • Design experiments and statistical tests to evaluate the calibration and reliability of probabilistic claims made by AI models. • Work cross-functionally with ML engineers, data scientists, and prompt engineers to integrate statistical validation into generative AI workflows. • Apply statistical inference techniques (e.g., regression analysis, Bayesian inference, significance testing) to model outputs and ground-truth data. • Construct and test hypotheses about LLM behaviors under different prompt and context configurations. • Communicate findings and methodologies clearly to both technical and non-technical audiences. Required Skills and Qualifications • Strong foundation in probability theory and its applications to uncertainty quantification in AI systems. • Proven experience with regression analysis, including linear, logistic, and multivariate techniques in real-world machine learning or AI projects. • Deep understanding of hypothesis testing and statistical significance, with practical experience in applying these concepts to AI/ML evaluation. • Familiarity with generative AI models such as LLMs (e.g., GPT-4, Claude, Gemini), including experience in crafting, refining, and interpreting prompts. • Proficiency with Python and statistical tools such as statsmodels, scipy, R, or similar. • Familiarity with Bayesian inference frameworks and uncertainty quantification methods (e.g., Monte Carlo simulation, posterior predictive checks). • Ability to design and interpret statistical experiments involving LLM-generated outputs. • Excellent communication skills, including the ability to convey statistical findings in clear, actionable terms. Preferred Qualifications • Graduate degree (MS or PhD) in Statistics, Data Science, Applied Mathematics, Econometrics, or a closely related field. • Prior experience in validating or auditing generative AI systems or developing statistical evaluation frameworks for ML outputs. • Experience working with LLM APIs and evaluating probabilistic reasoning in LLM-generated content. Apply tot his job
Apply Now

Similar Opportunities

Artificial Intelligence/Machine Learning - Adjunct Faculty - AIU Online/DigitalCrafts

Remote Full-time

[Work From Home] Machine Learning Engineers for Mechanical

Remote Full-time

Machine Learning / AI Senior Developer – REMOTE in Plano, TX

Remote Full-time

Machine Learning Research Scientist

Remote Full-time

Principal Product Manager in Responsible AI

Remote Full-time

Data ML Program Manager- Product Operations

Remote Full-time

Program Manager, GeminiApp Data Science

Remote Full-time

Senior Manager, Technical Program Management - AI and Platform

Remote Full-time

Principal Machine Learning Developer

Remote Full-time

Program Manager III OSC, Global Operations Artifitial Inteligence

Remote Full-time

Legal Analyst-Mergers & Acquisitions | Miami, FL, USA

Remote Full-time

Experienced Remote Live Chat Specialist – Customer Service and Support Expert for arenaflex

Remote Full-time

**Experienced Data Entry Specialist – Application Support – arenaflex**

Remote Full-time

Director, Sales Worldwide Accounts (Strategic Accounts)

Remote Full-time

Remote Hotel Regional Director of Sales (RDOS) - Hilton & Marriott Experience Required

Remote Full-time

**Experienced Remote Customer Service Representative – Delivering Exceptional Travel Experiences for blithequark**

Remote Full-time

Finance Manager (Remote from Florida)

Remote Full-time

Data Entry Specialist (Remote) - Part-Time Opportunity with blithequark: Accurate Data Management from the Comfort of Your Home

Remote Full-time

Process Improvement Specialist (Clinical)

Remote Full-time

Dental Assistant III- Closes 09/11/2025

Remote Full-time
← Back to Home