You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
My question is: In the paper's `Inference' part, you introduce the process to generate . Then which model are used in the following step (i.e. get the output answer based on query and C_\text{compressed}$)?
Thanks a lot!
The text was updated successfully, but these errors were encountered:
zhiyuan5986
changed the title
How is the lantancy evaluation implemented?
What is the completed inference process?
Feb 27, 2025
zhiyuan5986
changed the title
What is the completed inference process?
What is the complete inference process?
Feb 27, 2025
Refer to the script mentioned in the README, I guess it's gpt-3.5-turbo, and I get 92% of the longbench avg score which was shown in the paper using the model, also they showed the scores with gpt-4o in the paper.
Hi, this is a great job!
My question is: In the paper's `Inference' part, you introduce the process to generate . Then which model are used in the following step (i.e. get the output answer based on query and C_\text{compressed}$)?
Thanks a lot!
The text was updated successfully, but these errors were encountered: