Hybrid Search Troubleshooting Checklist

Symptom checker

Use this guide to diagnose and fix search quality issues. Start by identifying the symptom, then follow the corresponding troubleshooting steps.

Common symptoms:

No matches found - Customer questions get no results or fallback message
Wrong answer returned - Chatbot matches an incorrect Q&A pair
Low confidence scores - Matches exist but scores are below 0.70
Too many similar matches - Multiple Q&A pairs compete for the same question
Technical terms ignored - Specific product codes or technical terms don't match correctly
Paraphrases don't match - Questions phrased differently fail to match

Testing Tool: Use Knowledge Base > Test to reproduce issues and see detailed scoring information for debugging.

Issue: No matches found

Symptom: Customer questions receive no results, or the chatbot uses a fallback message like "I don't know" or "Let me connect you with support."

Diagnostic checklist:

□ Check if Q&A pair exists: Search your knowledge base for content related to the question
□ Verify embeddings are generated: Semantic search won't work without embeddings
□ Check Q&A pair status: Ensure the Q&A is enabled/active, not disabled or in draft
□ Test in Knowledge Base test tool: Reproduce the issue to see if any matches appear with scores
□ Review confidence threshold: Matches may exist but fall below your chatbot's minimum threshold

Solutions:

If no Q&A pair exists:

Add a new Q&A pair covering this topic
Use customer's actual phrasing for the question
Write a clear, concise answer
Generate embeddings after adding

If Q&A exists but embeddings are missing:

Go to Knowledge Base
Click "Generate Embeddings"
Wait for generation to complete
Re-test the question

If Q&A exists but is disabled:

Edit the Q&A pair and set status to "Active"
Ensure the "Enabled" toggle is on
Save changes

If matches exist but scores are too low:

See Low confidence scores section below
May need to lower confidence threshold or improve Q&A phrasing

Common Mistake: Adding Q&A pairs but forgetting to generate embeddings. Semantic search won't work until embeddings exist.

Issue: Wrong answer returned

Symptom: The chatbot returns an answer, but it's not relevant or addresses a different question than the customer asked.

Diagnostic checklist:

□ Test the question: Use Knowledge Base test tool to see all matches and their scores
□ Check if correct answer exists: Verify you have a Q&A for what the customer is actually asking
□ Compare scores: See if the correct answer appears but ranks lower than the wrong one
□ Review question phrasing: Check if questions in knowledge base use different terminology
□ Look for duplicates: Multiple similar Q&A pairs may confuse matching

Solutions:

If correct answer doesn't exist:

Add a new Q&A pair with precise question phrasing
Include key terms the customer used
Generate embeddings

If correct answer exists but ranks lower:

Improve the question phrasing: Edit to better match how customers ask
Add keywords to the question: Include specific terms customers use
Consider search weights: If technical terms are ignored, increase BM25 weight (see Tuning Search Weights)
Regenerate embeddings after editing

If duplicate Q&A pairs exist:

Identify semantic duplicates (similar questions with different wording)
Merge or delete duplicates, keeping only the best-phrased one
See Knowledge Base Hygiene for duplicate management

Example: Customer asks "How much does shipping cost?" but gets an answer about return policy. Check if your knowledge base has a shipping Q&A, and verify its question includes "shipping cost" or "delivery fee."

Issue: Low confidence scores

Symptom: Questions match Q&A pairs, but combined scores are below 0.70. The chatbot may use fallback responses even though a match exists.

Diagnostic checklist:

□ Test the question: Check semantic, BM25, and combined scores
□ Compare customer question to Q&A question: How different is the phrasing?
□ Check for keyword overlap: Does the Q&A question include terms from customer question?
□ Review semantic score: Low semantic score indicates phrasing/meaning mismatch
□ Review BM25 score: Low BM25 score indicates missing keywords

Solutions:

If semantic score is low (< 0.60):

Rephrase the question to better match customer language
Use more natural, conversational phrasing
Consider creating multiple Q&A pairs for different phrasings
Regenerate embeddings after editing

If BM25 score is low (< 0.50):

Add key terms from the customer question into your Q&A question
Include synonyms and alternate terminology
Ensure important technical terms or product names appear in the question

If both scores are moderate but combined is low:

This is expected - no single improvement needed
Consider if this truly should match, or if a new Q&A is needed
You may need to adjust confidence threshold settings

Example improvement:

Customer question: "Can I get a refund?"

Before (low scores):
  Q: "What is our refund policy?"
  Semantic: 0.65  (okay match)
  BM25: 0.40      (missing "refund" keyword prominently)
  Combined: 0.58  (below threshold)

After (improved):
  Q: "Can I get a refund or return?"
  Semantic: 0.88  (much better match)
  BM25: 0.75      (includes "refund" keyword)
  Combined: 0.84  (above threshold) ✓

Important: Always regenerate embeddings after editing questions. The semantic score won't improve until new embeddings are generated.

Issue: Too many similar matches

Symptom: A customer question matches multiple Q&A pairs with similar scores. The chatbot may return inconsistent answers or show low confidence.

Diagnostic checklist:

□ Test the question: Check how many Q&A pairs match with similar scores
□ Review matched Q&As: Are they duplicates or genuinely different topics?
□ Compare answers: Do the matched Q&As provide the same or conflicting information?
□ Check for vague questions: Overly broad Q&A questions match too many customer questions

Solutions:

If matched Q&As are semantic duplicates:

Merge duplicates: Keep the best-phrased question and delete the rest
If answers differ slightly, combine into one comprehensive answer
See Knowledge Base Hygiene for duplicate detection

If Q&As cover different topics but seem similar:

Make questions more specific: Add distinguishing details to each question
Include context in questions: "How do I return a damaged item?" vs. "How do I return an unwanted item?"
Add specific keywords that differentiate the topics

If Q&As conflict (provide different answers to similar questions):

This is a serious knowledge base issue
Identify the correct answer and delete or fix the incorrect one
If both are correct for different scenarios, clarify the questions to distinguish them

Example: Customer asks "How long does shipping take?" Multiple Q&As match: "Domestic shipping times," "International shipping times," "Express shipping options." Make each question more specific to reduce ambiguity.

Issue: Technical terms ignored

Symptom: Questions with specific technical terms, product codes, or model numbers match incorrect or overly general answers.

Diagnostic checklist:

□ Test with technical term: Check if the specific term appears in matched Q&A questions
□ Review BM25 score: Should be high if technical term matches
□ Check search weights: May be too semantic-heavy for your use case
□ Verify Q&A exists: Do you have a Q&A pair specifically for this technical term?

Solutions:

If Q&A with technical term doesn't exist:

Add specific Q&A pairs for each technical term, product code, or model
Include the exact technical term in the question
Example: "How do I configure OAuth2?" not just "How do I configure authentication?"

If Q&A exists but doesn't rank first:

Increase BM25 weight: Change from 70/30 to 50/50 or 40/60 (see Tuning Search Weights)
This gives keyword matching more influence
Test with multiple technical questions before deploying

If technical term appears in question but isn't prominent:

Rephrase question to emphasize the technical term
Put technical terms early in the question
Example: "OAuth2 configuration steps" rather than "How do I set up authentication using OAuth2?"

Best Practice: For technical content, API documentation, or product catalogs, use 50/50 or 40/60 semantic/BM25 weights to ensure precise matching of technical terms.

Issue: Paraphrases don't match

Symptom: Customers rephrase questions in different ways, and the chatbot fails to match even though you have the information.

Diagnostic checklist:

□ Verify embeddings exist: Semantic matching requires embeddings
□ Test the paraphrase: Check semantic score in test tool
□ Review search weights: May be too keyword-heavy (high BM25 weight)
□ Check question phrasing: Is your Q&A question too specific or formal?

Solutions:

If embeddings are missing:

Generate embeddings immediately (semantic search won't work without them)
Re-test after generation completes

If semantic score is low for paraphrases:

Rephrase Q&A question to be more natural and conversational
Use common, everyday language instead of formal or technical phrasing
Consider creating multiple Q&A pairs for common variations
Regenerate embeddings after editing

If BM25 weight is too high:

Increase semantic weight: Change from 70/30 to 80/20 or 85/15
This prioritizes meaning over exact keyword matches
See Tuning Search Weights

Example improvement:

Customer variations:
  "Can I send this back?"
  "What if I don't like it?"
  "How do I get my money back?"

Before (formal phrasing - low semantic match):
  Q: "What is the return policy for purchased items?"

After (natural phrasing - high semantic match):
  Q: "Can I return or get a refund?"

Pro Tip: Review real customer questions from support tickets or chat logs. Use their exact language when writing Q&A questions.

General diagnostics

If none of the specific issues above apply, use these general troubleshooting steps.

Quick diagnostic workflow:

Reproduce the issue: Use Knowledge Base > Test to test the problematic question
Review all matches: See what Q&A pairs match and with what scores
Analyze score breakdown: Check semantic, BM25, and combined scores
Compare to expectations: Determine which Q&A should match and why it doesn't
Apply targeted fix: Use appropriate solution from above sections
Regenerate embeddings: If you edited any questions or answers
Re-test: Verify the issue is resolved

Common root causes:

70% of issues: Missing Q&A pairs or poor question phrasing
20% of issues: Embeddings not generated after changes
10% of issues: Search weight configuration or duplicates

When to contact support:

Reach out to SoundMinds.ai support if:

Embeddings generation consistently fails
Search behaves inconsistently (same question, different results)
All troubleshooting steps fail to improve search quality
You need help analyzing complex scoring issues

Prevention: Regular knowledge base maintenance prevents most issues. See Knowledge Base Hygiene for best practices.

Symptom checker

Common symptoms:

Issue: No matches found

Diagnostic checklist:

Solutions:

If no Q&A pair exists:

If Q&A exists but embeddings are missing:

If Q&A exists but is disabled:

If matches exist but scores are too low:

Issue: Wrong answer returned

Diagnostic checklist:

Solutions:

If correct answer doesn't exist:

If correct answer exists but ranks lower:

If duplicate Q&A pairs exist:

Issue: Low confidence scores

Diagnostic checklist:

Solutions:

If semantic score is low (< 0.60):

If BM25 score is low (< 0.50):

If both scores are moderate but combined is low:

Example improvement:

Issue: Too many similar matches

Diagnostic checklist:

Solutions:

If matched Q&As are semantic duplicates:

If Q&As cover different topics but seem similar:

If Q&As conflict (provide different answers to similar questions):

Issue: Technical terms ignored

Diagnostic checklist:

Solutions:

If Q&A with technical term doesn't exist:

If Q&A exists but doesn't rank first:

If technical term appears in question but isn't prominent:

Issue: Paraphrases don't match

Diagnostic checklist:

Solutions:

If embeddings are missing:

If semantic score is low for paraphrases:

If BM25 weight is too high:

Example improvement:

General diagnostics

Quick diagnostic workflow:

Common root causes:

When to contact support:

Related Guides

Search Modes Explained: Semantic, BM25, and Hybrid

Tuning Search Weights for Better Answers

Improving Search Quality with Knowledge Base Hygiene