Skip to content
Back to the workshop

Search Beyond Keywords

How semantic search changes what "finding things" means

C
Cleo's TeamBuilding Cleo
3 min read

When a user searches for "summer campaign ideas" in a keyword-based system, they get documents that contain those specific words. When they search in a semantic system, they get documents about warm-weather marketing, seasonal promotions, outdoor lifestyle content, and holiday-adjacent campaigns - even if none of those documents contain the word "summer."

This is the difference between matching strings and understanding meaning. In an AI marketing platform where the quality of context directly determines the quality of output, this difference is enormous.

Embeddings as understanding

Semantic search works by converting text into high-dimensional vector representations - points in a mathematical space where distance corresponds to meaning. Two pieces of text that are about the same topic will be close together in this space, regardless of whether they share vocabulary.

We embed every piece of knowledge in the system: brand documents, content pieces, campaign results, audience insights, product descriptions. When the AI needs context for a task, the retrieval system finds the semantically closest knowledge to the current need, not the most keyword-similar.

Why this matters for AI context

The context assembly pipeline depends on finding the right information for the current task. If the user asks the AI to "draft something for the holiday sale," the system needs to retrieve pricing information, past promotional content, audience segments that respond to discounts, and brand voice guidelines for promotional communication. Keyword search would require the user to have labeled all of these with "holiday sale" - which they did not, because they created them in different contexts.

Semantic search bridges the gap between how information was created and how it is needed. The pricing document was created during product setup. The audience segment was defined during onboarding. The brand voice was established in the first conversation. None of them were tagged for this specific use case. Semantic search finds them anyway because it understands the relationship between the current task and the stored knowledge.

The freshness dimension

Semantic relevance is not the only dimension of retrieval. Freshness matters - recent campaign results are more relevant than results from six months ago. Specificity matters - a document about this exact product is more relevant than one about the product category. Authority matters - a brand guide created by the user carries more weight than inferred brand patterns.

We combine semantic similarity with these additional signals to produce a ranked retrieval that reflects what is genuinely most useful for the current task. Pure semantic distance would surface many relevant results. Weighted, multi-dimensional retrieval surfaces the right ones.

The invisible infrastructure

The user never sees any of this. They do not know that their question triggered a vector search that retrieved four documents ranked by semantic similarity, freshness, and authority. They just see an AI response that references their pricing, their brand voice, and their recent campaign performance as if it remembered all of it.

That feeling of "it remembers" is the product experience. The semantic search infrastructure is what makes it real.

- Cleo's Team

C

Written by Cleo's Team

Building Cleo, an AI marketing operating system. These posts cover the architecture decisions, technical challenges, and lessons learned along the way.

More from the workshop