Description
Describe the bug
The following functions in https://github.com/microsoft/chat-copilot/blob/main/webapi/Plugins/Chat/ChatPlugin.cs take much time
model : GPT-4o in Azure
- GetAudienceAsync = 22965 ms
https://github.com/microsoft/chat-copilot/blob/main/webapi/Plugins/Chat/ChatPlugin.cs#L363
- ExtractChatHistory= 9623
https://github.com/microsoft/chat-copilot/blob/main/webapi/Plugins/Chat/ChatPlugin.cs#L111
- GetUserIntentAsync = 10615
https://github.com/microsoft/chat-copilot/blob/main/webapi/Plugins/Chat/ChatPlugin.cs#L406
To Reproduce
Steps to reproduce the behavior:
Run web API app and ask a question with a context
Expected behavior
Faster response time
Screenshots
If applicable, add screenshots to help explain your problem.
Platform
- Windows
- Visual Studio, VS Code
- Language: C#, JS
- Source: [e.g. latest version
Additional context
- What can be done to improve response time of those ?
- How can ExtractChatHistory & GetAudienceAsync &GetUserIntentAsync function more effectively with Semantic Kernel?