Skip to content

performance improvements of [SD]DOT on A64FX. #5352

Open
@iha-taisei

Description

@iha-taisei

The current [SD]DOT kernel uses 2x loop unrolling.
There is room for optimization on A64FX, increasing this factor should improve performance.
I will prepare and submit a patch with this optimization.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions