Releases: CESNET/GPUJPEG
GPUJPEG 0.27.5
The releases 0.27.2-0.27.5 bring following changes:
- support non-ASCII file names in Windows
- encoder/decoder additional options (eg. TGA RLE)
- support for UYVY files (.uyvy extension)
Other changes can be found in NEWS.
Binary builds target CUDA compute capability 3.0 (Linux) and 3.5 for Windows (see continuous release description for more info about binary builds specification).
Full changelog: https://github.com/CESNET/GPUJPEG/commits/v0.27.5
continuous build
Builds are built for CC 3.0 (real+virtual) in Linux and 3.5 for Windows.
Builds marked as “all CUDA architectures” target all real and highest virtual CUDA architecture as supported by most recent CUDA toolkit (so the minimal CC is 5.0 as for CUDA 12). all can be used if application start-up time is critical (CUDA runtime doesn't need to compile the kernels for current architecture), otherwise it may not provide any advantage, but increases the size.
GPUJPEG 0.27.1
This releases brings mainly support for additional RAW image formats. Other changes can be found in NEWS.
Binary builds target CUDA compute capability at least 3.0.
API changes
gpujpeg_image_save_to_file() - take (const char *) for filename instead of (char *)
Full changelog: https://github.com/CESNET/GPUJPEG/commits/v0.27.1
GPUJPEG 0.26.0
This release brings mainly much faster start-up (and reinitialization) times in Linux, which is significant especially when encoding small amount of images. The speed-up for first frame is as much as 20x (eg. 43 ms for 10000x10000.tst compared to original 810 ms). Subsequent images are still faster but this narrows the gap. The improvement is especially noticeable for big images.
Builds require CUDA compute capability at least 3.5.
API changes
- [changed] log level version numbers (verbose + debug +1)
- [added] log levels symbolic names
- [renamed] GPUJPEG_VERBOSE to GPUJPEG_INIT_DEV_VERBOSE
- [added] struct gpujpeg_encoder_input init functions returning the struct (like gpujpeg_encoder_input_gpu_image) to be able to directly initialize the variable
- if verbosity is at least GPUJPEG_LL_STATUS, gpujpeg_parameters.perf_stats
doesn't need to be set to output coding duration
Other
- new patterns (noise, blank) to test images (.tst extension)
- cmake - add hint to enable the architecture to native (to speed up startup)
- improved logging of JPEG reader in debug mode to inspect JPEG structure
- print also coded image size and properties (useful if autodeduced)
- report also (re)initialization duration (in verbose mode)