We just published research on a new pattern: Machine Learning as a Tool (MLAT) [Research]

We just published our research on what we’re calling “Machine Learning as a Tool” (MLAT) – a design pattern for integrating statistical ML models directly into LLM agent workflows as callable tools.

The Problem:

Traditional AI systems treat ML models as separate preprocessing steps. But what if we could make them first-class tools that LLM agents invoke contextually, just like web search or database queries?

Our Solution – PitchCraft:

We built this for the Google Gemini Hackathon to solve our own problem (manually writing proposals took 3+ hours). The system:

– Analyzes discovery call recordings

– Research Agent performs parallel tool calls for prospect intelligence

– Draft Agent invokes an XGBoost pricing model as a tool call

– Generates complete professional proposals via structured output parsing

Result: 3+ hours → under 10 minutes

Technical Highlights:

– XGBoost trained on just 70 examples (40 real + 30 synthetic) with R² = 0.807

– 10:1 sample-to-feature ratio under extreme data scarcity

– Group-aware cross-validation to prevent data leakage

– Sensitivity analysis showing economically meaningful feature relationships

– Two-agent workflow with structured JSON schema output

Why This Matters:

We think MLAT has broad applicability to any domain requiring quantitative estimation + contextual reasoning. Instead of building traditional ML pipelines, you can now embed statistical models directly into conversational workflows.

Links:

– Full paper: Zenodo, ResearchGate

Would love to hear thoughts on the pattern and potential applications!

submitted by /u/okay_whateveer
[link] [comments]

Liked Liked