[product documentation] experiment with a "highlight" summarizer #205921

pgayvallet · 2025-01-08T15:27:10Z

semantic_text will support highlight in 8.18 / 9.0. We should experiment with an "highlight" based summarized, that could replace the current "llm summarizer" we're currently using.

Even if probably less powerful, the upside is that it would be very significantly faster than calling the LLM for summarization, which could make a great default.

kibana/x-pack/platform/plugins/shared/ai_infra/llm_tasks/server/tasks/retrieve_documentation/retrieve_documentation.ts

Lines 52 to 63 in a0f5a7f

    
           if (tokenReductionStrategy === 'summarize') { 
        
             const extractResponse = await summarizeDocument({ 
        
               searchTerm, 
        
               documentContent: document.content, 
        
               outputAPI, 
        
               connectorId, 
        
               functionCalling, 
        
             }); 
        
             content = truncate(extractResponse.summary, maxDocumentTokens); 
        
           } else { 
        
             content = truncate(document.content, maxDocumentTokens); 
        
           }

The text was updated successfully, but these errors were encountered:

elasticmachine · 2025-01-08T15:27:13Z

Pinging @elastic/appex-ai-infra (Team:AI Infra)

## Summary Fix elastic#205921 - Implements a new summary strategy for the product documentation, based on `semantic_text` highlights - set that new strategy as the default one ### Why ? Until now, in case of excessive token count, we were using a LLM based summarizer. Realistically, highlights will always be worse than calling a LLM for a "in context summary", but from my testing, highlights seem "good enough", and the speed difference (instant for highlights vs multiple seconds, up to a dozen, for the LLM summary) is very significant, and seems overall worth it. The main upside with that change, given that requesting the product doc will be waaaay faster, is that we can then tweak the assistant's instruction to more aggressively call the product_doc tool between each user message without the risk of the user experience being impacted (waiting way longer between messages). - *which will be done as a follow-up* ### How to test ? Install the product doc, ask questions to the assistant, check the tool calls (sorry, don't have a better option atm...) Note: that works with both versions of the product doc artifacts, so don't need the dev repository (cherry picked from commit c9286ec)

## Summary Fix elastic#205921 - Implements a new summary strategy for the product documentation, based on `semantic_text` highlights - set that new strategy as the default one ### Why ? Until now, in case of excessive token count, we were using a LLM based summarizer. Realistically, highlights will always be worse than calling a LLM for a "in context summary", but from my testing, highlights seem "good enough", and the speed difference (instant for highlights vs multiple seconds, up to a dozen, for the LLM summary) is very significant, and seems overall worth it. The main upside with that change, given that requesting the product doc will be waaaay faster, is that we can then tweak the assistant's instruction to more aggressively call the product_doc tool between each user message without the risk of the user experience being impacted (waiting way longer between messages). - *which will be done as a follow-up* ### How to test ? Install the product doc, ask questions to the assistant, check the tool calls (sorry, don't have a better option atm...) Note: that works with both versions of the product doc artifacts, so don't need the dev repository

pgayvallet added Feature:AI Product Docs Product Documentation for AI workflows Team:AI Infra AppEx AI Infrastructure Team labels Jan 8, 2025

This was referenced Jan 14, 2025

[product doc] implement highlight summarizer #206578

Merged

[product doc] optimize the highlight summarizer #206927

Open

pgayvallet closed this as completed in #206578 Jan 16, 2025

pgayvallet closed this as completed in c9286ec Jan 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[product documentation] experiment with a "highlight" summarizer #205921

[product documentation] experiment with a "highlight" summarizer #205921

pgayvallet commented Jan 8, 2025

elasticmachine commented Jan 8, 2025

[product documentation] experiment with a "highlight" summarizer #205921

[product documentation] experiment with a "highlight" summarizer #205921

Comments

pgayvallet commented Jan 8, 2025

elasticmachine commented Jan 8, 2025