Data Engineer (AI Enablement)

Remote Full-time
THE JOB / Data Engineer (AI Enablement) STRATEGY / Responsible for building and operating the data foundations that power Octagon’s AI solutions and enterprise search. • **Our headquarters are in Stamford, CT, but the location of this position can be 100% remote for qualified candidates. You’re a systems-minded builder who turns messy, multi-source data into reliable, searchable, and governed knowledge. Your mission is to stand up the pipelines, vector search, and metadata standards that make AI tools accurate, fast, and safe. You’ll partner closely with the Solutions Engineer (peer role) to take prototypes and ship durable infrastructure—ingestion, embeddings, indexing, and APIs—so teams can find and use what they need. You’ll report to the Director, Data Strategy and work across departments to reduce manual effort, improve data quality, and enable AI-powered workflows at scale. THE WORK YOU’LL DO • Data foundations: Design and operate the vector database/search layer (e.g., FAISS/pgvector/Milvus) and document-chunking/embedding pipelines that make Octagon’s content discoverable and auditable. • Scalable pipelines for AI/ML/LLM: Implement and maintain ELT/ETL to support downstream workflows such as data labeling, classification, and document parsing; build robust validations, lineage, and observability. • Retrieval APIs: Expose governed retrieval endpoints that respect permissions (ACLs), support metadata filters, and return source snippets/IDs for grounding and citations. • Data structuring & manipulation: Normalize, transform, and move JSON and other structured payloads cleanly through workflows to ensure reliable handoffs and automation outputs. • Align & collaborate: Align product peers, design, data science, engineering, and commercial teams around a unified roadmap and shared data contracts. • Operationalize prototypes: Take MVPs from the Solutions Engineer and productionize with CI/CD, telemetry, cost/usage guardrails, and pilot → rollout gating. • Reliability & security: Build monitoring (freshness, re-index SLAs, retrieval quality), secrets management, access controls, and audit logging aligned with enterprise governance. • Flexibility and willingness to travel and work weekends or holidays as needed. Anticipated travel level: Low (0–15%). THE BIGGER TEAM YOU’LL JOIN Recognized as one of the “Best Places to Work in Sports”, Octagon is the global sports, entertainment, and experiential marketing arm of the Interpublic Group. We take pride in being Playmakers – finding insightful, bold ways to create play in our work, our lives, and in the world. We believe in the power of play to create big ideas and unlock potential for our clients and talent. We can put ourselves in the shoes of fans because we ARE fans – of sports, entertainment, and culture at large. This expertise allows us to continually evolve the fan experience across sports and entertainment alongside some of the biggest brands and talent in the world. The world needs play more than ever. Are you a Playmaker? WHO WE’RE LOOKING FOR • 3+ years (or equivalent portfolio) building data systems: data modeling, ELT/ETL, Python + SQL; experience with cloud object storage and relational databases. • Hands-on with embeddings and vector databases (e.g., FAISS/pgvector/Milvus) and document processing pipelines for RAG-style retrieval. • Scalable pipeline experience supporting AI/ML/LLM use cases (labeling, classification, doc parsing) and partnering closely with Data Science and Data Labeling teams. • Data structuring & manipulation expertise: cleanly normalizing and transforming JSON/Parquet/CSV payloads; designing resilient data contracts and schemas. • Orchestration/ops: Airflow/Prefect (or similar), CI/CD, structured logging/monitoring, cost/usage guardrails; secure secrets management. • Strong collaboration and communication skills; proven ability to align product/design/engineering/commercial stakeholders around a unified roadmap. Nice-To-Haves • Enterprise connectors and productivity stacks (e.g., Microsoft 365/SharePoint/Teams/Graph, Copilot or Copilot Studio/Power Automate; Google Workspace; Salesforce; DAMs). • Experience implementing LLM inference patterns, similarity search, guardrails, and memory; familiarity with agent frameworks or custom orchestration. • Additional languages for systems work (e.g., C++, C#, Java, or Go). • Containers (Docker), GitHub Actions, IaC; lightweight internal UIs (Streamlit or R Shiny) to expose services. • Familiarity with marketing/media-measurement datasets and associated normalization/quality checks. The base range for this position is $90,000 – $100,000. Where an employee or prospective employee is paid within this range will depend on, among other factors, actual ranges for current/former employees in the subject position; market considerations; budgetary considerations; tenure and standing with the company (applicable to current employees); as well as the employee’s/applicant’s background pertinent experience, and qualifications We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, sex, sexual orientation, age, disability, gender identity, marital or veteran status, or any other protected class. Apply tot his job
Apply Now

Similar Opportunities

AI Automation Developer for Workflow 1 (SEO Blogs, Images & AI Videos)

Remote Full-time

[Remote] Data Strategist & Automation Specialist

Remote Full-time

AI Agent Developer Needed – Social Media Automation & Lead Management

Remote Full-time

IT Application Developer (AI-Driven IT Ops) Intern

Remote Full-time

AI Automation & Agent Developer (OpenAI, Zapier/n8n) - Project-Based - Contract to Hire

Remote Full-time

[Remote] Automation Engineers

Remote Full-time

[Remote] Devin.ai Consultant (Enterprise AI Implementation Specialist)

Remote Full-time

[Remote] Sr. Advanced ETL Data Engineer / AWS Data Engineer-Dimensional Modeling for Analytics & reporting.(W2)

Remote Full-time

Senior Data/AI Engineer ($55/hr )Lead AI and Data Solutions Engineer ($60/hr)

Remote Full-time

[Remote] Cloud Data Engineer (Strictly W2 ONLY)

Remote Full-time

[Remote] Insurance Specialist, Fully Remote

Remote Full-time

**Experienced Online Typist / Data Entry Clerk / Transcriptionist – Remote Opportunity at blithequark**

Remote Full-time

[Remote] Associate Director, Performance Media & Measurement (NY, 21916)

Remote Full-time

**Experienced Healthcare Customer Service Representative – Work At Home Opportunity with blithequark**

Remote Full-time

AI Agent Developer / Automation Builder (Standalone AIPA Agents – Capitalise Digital Co.) - Contract to Hire

Remote Full-time

Paid Media Coordinator

Remote Full-time

Specialty Benefits, Financial Protection Practice Leader - Remote

Remote Full-time

Senior Strategic Planning Analyst

Remote Full-time

**Experienced Customer Service Representative – Remote Walmart Reseller Chat Support – Up To $27 per hour**

Remote Full-time

Entry Level Data Entry Specialist – Immediate Hiring Opportunity for Career Growth and Development at blithequark

Remote Full-time
← Back to Home