Role: Senior AI / ML Engineer - Data Scientist
Location: Remote - US
Direct Client
Contract: W2
Minimum Requirements for Role
Need proven enterprise-grade experience with agentic AI architectures -not theoretical knowledge, but real deployments at scale. Specifically, we are looking for expertise across multi-agent orchestration (e.g. AutoGen, Semantic Kernel, LangGraph or similar), LLM evaluation frameworks including LLM-as-a-judge methodologies, content safety tooling and responsible AI practices, and strong data science foundations to support model selection, evaluation, and performance analysis.
Product and Functional Skills
• You''ll be embedded in a team building complex agentic AI systems from the ground up, contributing across design, development, and deployment. This includes architecting and implementing multi-agent orchestration frameworks, integrating LLM-based evaluation pipelines (LLM-as-a-judge), and ensuring robust content safety practices are baked into the solution.
• Experience working in professional services or M&A technology environments is a strong advantage. Given the caliber expected, you should be comfortable navigating ambiguity and helping define the approach, not just executing against a pre-built spec.
Mandatory Skills:
• LLM Model with experience in OpenAl on Azure platform
• Expertise in Agentic Al architecture
• With good knowledge of Langchain, langraph
• Good understanding of Hallucination prevention using LLM as Judge
• Hands on experience in Training and fine tuning LLM models & Model context protocol (MCP)
• Good background in deep learning & transformer architecture