Traversaal/Labs/Case studies/Enterprise learning
Case study · Enterprise learning · 2025

Turning ten million pages of leadership expertise into a learning tool users can trust.

We built an agentic retrieval system for a global leadership development firm that answers a learner's question in conversation, plans its own search across decades of research, and shows the exact page behind every answer.

Overview

Decades of frameworks, assessments and facilitation guides. Ten million pages of it. And almost none of it reachable when a learner actually needed an answer. The body of expertise was the firm's greatest asset — a global leadership development organisation serving Fortune 500 clients and individual learners alike. It was also, in practice, locked inside documents.

Conversational Q&A over a proprietary corpus, every answer cited back to source. No hallucinated frameworks, no orphaned claims.

Challenge  | Ten million pages is an asset — and a liability.

The firm's content library had grown over decades. The same leadership concept might appear in a 1990s framework, a 2010s revision, a case study, and a facilitator's guide — four documents, overlapping but not identical, each written for a different purpose. A learner got all four back with no signal about which to trust.

  • Overlapping content, no hierarchy. The same concept lived across multiple document types — framework, case study, facilitation guide — each authoritative for a different reason. Standard retrieval treated them as interchangeable text.
  • Format authority mattered. A competency model is trustworthy because of its research methodology. A facilitation guide because of who wrote it and how it's used in the room. The retrieval layer had to weight these differently.
  • Every answer had to be defensible. Any answer given to a Fortune 500 learning director had to be cited, traceable, and grounded in a passage the firm had written. Not 'probably right' — defensible.

The work  | An agentic retrieval layer built on the firm's own content.

We worked through the entire archive, removed duplicates, organised material by topic and authority, and gave every passage a clear lineage back to its original source. On top of that sits an agentic retrieval layer that plans its own search — deciding which document types to pull, weighing them by the authority their format carries, and running follow-up retrievals when an answer needs more than one source to stand up.

Conversational Q&A

Threaded sessions, suggested prompts, source-backed answers.

Citations & references

100% of answers grounded in the underlying corpus.

Ingestion at scale

10M+ pages chunked, embedded, indexed in Qdrant.

The product feels like talking to someone who has read everything we've ever published, and can show you the page.
Chief Product OfficerGlobal leadership development organisation

Outcomes  | Pilot-ready, with the metrics to defend it.

10M+
pages of proprietary content made conversational
100%
of answers cited back to trusted source material
B2B+D2C
enterprise licensing & individual subscriptions live

Customer-facing app, ingestion infrastructure, and the benchmarks the client uses to measure their own retrieval quality going forward.

About the client

Decades of expertise locked inside documents. We built an agentic retrieval system for a global leadership development firm that answers a learner's question in conversation, plans its own search across decades of research, and shows the exact page behind every answer.

Industry
Enterprise learning · L&D
Engagement
Discovery → pilot · 2025
Services
RAG · Ingestion at scale · Product discovery · Benchmarking
Stage
◈ Pilot-ready product
Next engagement

Got a corpus gathering dust?

Two or three clients at a time. 30-minute call, no pitch deck.

© 2026 Traversaal AI, Inc.