Compress the HistoryRetriever context by asking a summary to the LLM
-
Try to ask a summary by keeping the messages pertaining to the user request.
If it takes too long, maybe try to summarize in background or at the end of the answer
Edited by Sébastien DA ROCHA