Skip to content

Search Process Overview

The search engine follows a structured eight-step process when handling a query:

Query Handling Process

  1. Preprocessing the Search Term Before processing the search, the system prepares the term by:

    • Removing special characters
    • Normalizing terms (e.g., "Levi's""levis", "s.oliver""soliver")
    • Removing hyphens and thousand separators
    • Converting negation words (e.g., "without""-" prefix)
    • Removing superscript numbers/letters
    • Removing stop words (e.g., "the", "and", "also")
    • Replacing umlauts
    • Converting to lowercase
    • Splitting compound words
    • Applying stemming and spelling correction
  2. Checking for Direct Links If a predefined direct link exists for the query, the user is redirected to a:

    • Content page
    • Product detail page
    • Specific internal or external URL If no link is found, the process continues.
  3. Searching for Replacements The system checks for predefined replacements for the term. If found, they affect Step 7 (result merging). Regardless, search continues as normal.

  4. Finding Synonyms The system checks for stored synonyms and adds them to the query. They are included with a small ranking penalty.

  5. Finding Synonyms for Misspellings Spellchecker-corrected terms are also added as synonyms. A penalty is applied based on the distance between the original and the corrected word.

  6. Searching the Search Index The search uses previously generated wordList and cutWordList. Every result starts at 100 points. Points are deducted based on where the term appears.

    Data FieldPenaltyDescription
    Brand0Product brand
    Short Description0Shown in result overview
    Target Group0Men, Women, Children
    Categories4Category placement
    Product Group4Internal classification
    Market Identifier (MKZ)4Internal marker
    Primary Color8Main product color
    Filter Values8Matched filterable attributes
    Dimensions8Size categories like “Short”, “Normal”, “Long”
    Long Description40Full text description
  7. Merging Results (if applicable) If replacements were found in Step 3, multiple result sets are merged and re-ranked.

    See Replacements for details TODO

  8. Outputting the Results Before the final results are returned:

    • The product ranking is finalized
    • Filters are calculated
    • Results are serialized as JSON The structure can be tenant-specific
    • Short-term caching applies only for large result sets

Ranking of Variations

Many products have multiple variants (e.g., color, size). The system ranks these variations using:

  • Rules from the Rule Engine
  • A BI-based scoring system to prioritize the best variant

Only the top-ranked variant is shown in the results.

Additional Factors in Search Ranking

Availability (Stock Level)

Stock LevelPenalty
≥ 80% availableNo penalty
45% – 80% available-10 points
< 45% available-50 points
0% (out of stock)Excluded entirely

Business Rule Value

  • Adds 5% weight to the product ranking
  • Values are normalized logarithmically
  • Top product gets a 100% boost, others scale accordingly