What is the complete inference process? #4

zhiyuan5986 · 2025-02-27T02:42:04Z

Hi, this is a great job!

My question is: In the paper's `Inference' part, you introduce the process to generate $C_{compressed}$ . Then which model are used in the following step (i.e. get the output answer based on query and C_\text{compressed}$)?

Thanks a lot!

No13Judas · 2025-03-12T07:58:07Z

Refer to the script mentioned in the README, I guess it's gpt-3.5-turbo, and I get 92% of the longbench avg score which was shown in the paper using the model, also they showed the scores with gpt-4o in the paper.

zhiyuan5986 changed the title ~~How is the lantancy evaluation implemented?~~ What is the completed inference process? Feb 27, 2025

zhiyuan5986 changed the title ~~What is the completed inference process?~~ What is the complete inference process? Feb 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is the complete inference process? #4

What is the complete inference process? #4

zhiyuan5986 commented Feb 27, 2025

No13Judas commented Mar 12, 2025

What is the complete inference process? #4

What is the complete inference process? #4

Comments

zhiyuan5986 commented Feb 27, 2025

No13Judas commented Mar 12, 2025