AMLC of the Rockies - December 2025
Your domain-specific model achieves 85.3% accuracy, but production requires 95%+. What do you do? This talk explores how ensemble models and multi-source data synthesis can bridge the gap when single GenAI models fall short.
Through a real-world case study of classifying customer support tickets for a building products company, discover how the "judge pattern" achieved 100% accuracy while maintaining explainability and cost-effectiveness. Learn when to use ensemble approaches, how to synthesize multiple data sources for greater value, and practical deployment strategies including shadow mode validation.
Watch the preview of this talk:
Download the complete slide deck from this presentation to dive deeper into the technical details, case studies, and implementation strategies.
Download Slides (PDF)
Client: A building products company
Problem: Classify customer support tickets into categories
Two Approaches Tested:
Challenges:
"When one model isn't enough, use multiple models working together"
Three Strategies Tested:
Result: All three achieved 100% accuracy! Judge pattern provides explainability + accuracy.
Advantages:
When Judge Runs:
"Don't just ensemble models. Ensemble data sources."
Manufacturing companies have data silos:
Key Insight: Synthesized answers provide specific recommendations, technical justification, part compatibility, installation guidance, risk mitigation, and source citations. Value > Sum of Parts.
Use Ensembles When:
Don't Use Ensembles When:
Founder, Applied Industrials
Building AI systems for industrial companies with a focus on domain-specific NLP challenges. Rachael specializes in solving the hard problems that arise when off-the-shelf AI models aren't enough for specialized industrial applications.
Connect with Rachael on LinkedIn