AmazonBedrockGenerator shouldn't load tokenizer if truncate is set to False. #1125

hypathia · 2024-10-04T10:13:55Z

Is your feature request related to a problem? Please describe.
DefaultPromptHandler downloads unnecessary GPT2 tokenizer to a read-only filesystem (eg. AWS Lambda), when truncate argument is set to False.

Check

haystack-core-integrations/integrations/amazon_bedrock/src/haystack_integrations/components/generators/amazon_bedrock/generator.py

Lines 148 to 156 in cf52ce9

    
           # Truncate prompt if prompt tokens > model_max_length-max_length 
        
           # (max_length is the length of the generated text) 
        
           # we use GPT2 tokenizer which will likely provide good token count approximation 
        
           self.prompt_handler = DefaultPromptHandler( 
        
               tokenizer="gpt2", 
        
               model_max_length=model_max_length, 
        
               max_length=self.max_length or 100, 
        
           )

Describe the solution you'd like
If truncate is set to false, GPT2 tokenizer for token estimation shouldn't be loaded.

Describe alternatives you've considered
Simply place a condition to only load the tokenizer if truncate is set to true.

        # Truncate prompt if prompt tokens > model_max_length-max_length
        # (max_length is the length of the generated text)
        # we use GPT2 tokenizer which will likely provide good token count approximation
--->    if self.truncate:
            self.prompt_handler = DefaultPromptHandler(
                tokenizer="gpt2",
                model_max_length=model_max_length,
                max_length=self.max_length or 100,
            )

The text was updated successfully, but these errors were encountered:

hypathia added the feature request Ideas to improve an integration label Oct 4, 2024

anakin87 added the integration:amazon-bedrock label Oct 4, 2024

julian-risch added the P2 label Oct 11, 2024

julian-risch added P1 and removed P2 labels Oct 21, 2024

julian-risch assigned anakin87 Oct 21, 2024

anakin87 mentioned this issue Oct 22, 2024

refactor: avoid downloading tokenizer if truncate is False #1152

Merged

anakin87 closed this as completed in #1152 Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AmazonBedrockGenerator shouldn't load tokenizer if truncate is set to False. #1125

AmazonBedrockGenerator shouldn't load tokenizer if truncate is set to False. #1125

hypathia commented Oct 4, 2024 •

edited

Loading

AmazonBedrockGenerator shouldn't load tokenizer if truncate is set to False. #1125

AmazonBedrockGenerator shouldn't load tokenizer if truncate is set to False. #1125

Comments

hypathia commented Oct 4, 2024 • edited Loading

hypathia commented Oct 4, 2024 •

edited

Loading