Optimization: Cash kernels and move creation to correct device #227

zakajd · 2021-02-04T18:33:18Z

Is your feature request related to a problem? Please describe.
Most metrics use some kind of kernels for extraction of image features.
Now those kernels are created each time when metric is called. That's slow and redundant when the function is called hundreds of times.

Another problem related to performance is creation of temporal tensors first on CPU and later moving them to GPU if needed (.to(x.device)).

Describe the solution you'd like

Add support for cashing kernels between runs. Add kernel param to functional API. Create and store kernel in class API
Replace .to(x.device) and simmular calls to explicit creation of tensor on target device. Like torch.ones(N, N, device=x.device).

Additional context
For metrics that use more than one kernel implementation details can be discussed.

The text was updated successfully, but these errors were encountered:

denproc · 2023-02-05T03:02:54Z

Hi @zakajd,

The second part of the proposed solution is partially implemented in #334.

While the proposed changes cover the tensor creation related to filters and colour conversions, some parts of the library could still use .to (piq/base.py, fid.py, perceptual.py, pieapp.py and piq/functional/resize.py).

zakajd added feature New feature or request enhancement Making some part of the codebase better without introduction of new features labels Feb 4, 2021

zakajd mentioned this issue Sep 2, 2021

High CPU usage for FSIM and FSIMc #266

Closed

zakajd mentioned this issue Mar 2, 2022

SSIM non-rectangular kernel size #306

Open

denproc self-assigned this Feb 5, 2023

denproc mentioned this issue Feb 5, 2023

FSIM Optimisation to Reduce CPU Usage #334

Merged

denproc linked a pull request Feb 5, 2023 that will close this issue

FSIM Optimisation to Reduce CPU Usage #334

Merged

denproc removed a link to a pull request Feb 5, 2023

FSIM Optimisation to Reduce CPU Usage #334

Merged

denproc mentioned this issue Feb 5, 2023

Redundant mesh grid creation for VSI #335

Closed

denproc removed their assignment Feb 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimization: Cash kernels and move creation to correct device #227

Optimization: Cash kernels and move creation to correct device #227

zakajd commented Feb 4, 2021

denproc commented Feb 5, 2023 •

edited

Loading

Optimization: Cash kernels and move creation to correct device #227

Optimization: Cash kernels and move creation to correct device #227

Comments

zakajd commented Feb 4, 2021

denproc commented Feb 5, 2023 • edited Loading

denproc commented Feb 5, 2023 •

edited

Loading