Add support for reading and writing compressed blobs #8106

arpad-m · 2024-06-19T00:30:23Z

Add support for reading and writing zstd-compressed blobs for use in image layer generation, but maybe one day useful also for delta layers. The reading of them is unconditional while the writing is controlled by the image_compression config variable allowing for experiments.

For the on-disk format, we re-use some of the bitpatterns we currently keep reserved for blobs larger than 256 MiB. This assumes that we have never ever written any such large blobs to image layers.

After the preparation in #7852, we now are unable to read blobs with a size larger than 256 MiB (or write them).

TODO:

Maybe introduce a new version so that we give better errors should we encounter legacy image layers with such large blobs. This is to insure us in the case the assumption above is wrong, so there is > 256MiB large images. eventually chosen against as image layers and delta layers have different ways of storing the version number.

A non-goal of this PR is to come up with good heuristics of when to compress a bitpattern. This is left for future work.

Parts of the PR were inspired by #7091.

cc #7879

Part of #5431

github-actions · 2024-06-19T00:47:26Z

No tests were run or test report is not available

Test coverage report is not available

_{The comment gets automatically updated with the latest test results
5fc3b3c at 2024-06-21T15:07:46.442Z :recycle:}

pageserver/src/tenant/blob_io.rs

libs/pageserver_api/src/models.rs

pageserver/src/tenant/blob_io.rs

VladLazar

Test needs updating, but otherwise looks good to me. It would have been interesting to make the de-compression lazy (i.e. decompress right before walredo).

pageserver/src/tenant/blob_io.rs

Add support for reading and writing compressed blobs

3d6bc95

arpad-m requested a review from VladLazar June 19, 2024 00:30

Delta layers should never use compression for now

2d87a15

arpad-m force-pushed the arpad/compression_1 branch from 22c35d8 to 2d87a15 Compare June 19, 2024 00:34

arpad-m mentioned this pull request Jun 19, 2024

Epic: pageserver image layer compression #5431

Open

6 tasks

VladLazar reviewed Jun 19, 2024

View reviewed changes

pageserver/src/tenant/blob_io.rs Outdated Show resolved Hide resolved

pageserver/src/tenant/blob_io.rs Show resolved Hide resolved

pageserver/src/tenant/blob_io.rs Show resolved Hide resolved

jcsp reviewed Jun 20, 2024

View reviewed changes

libs/pageserver_api/src/models.rs Outdated Show resolved Hide resolved

jcsp reviewed Jun 20, 2024

View reviewed changes

pageserver/src/tenant/blob_io.rs Outdated Show resolved Hide resolved

jcsp reviewed Jun 20, 2024

View reviewed changes

pageserver/src/tenant/blob_io.rs Show resolved Hide resolved

arpad-m added 4 commits June 20, 2024 23:25

Use an enum with a level option instead

734f5cb

Add comment

203e460

Add a bit mask for length and compression

45c8114

Also use constants for the maximum length

a831ba2

arpad-m marked this pull request as ready for review June 20, 2024 22:29

arpad-m requested a review from a team as a code owner June 20, 2024 22:29

arpad-m requested review from jcsp and VladLazar June 20, 2024 22:29

VladLazar reviewed Jun 21, 2024

View reviewed changes

pageserver/src/tenant/blob_io.rs Show resolved Hide resolved

pageserver/src/tenant/blob_io.rs Outdated Show resolved Hide resolved

Fix test

5fc3b3c

arpad-m requested a review from VladLazar June 21, 2024 15:06

VladLazar approved these changes Jun 21, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for reading and writing compressed blobs #8106

Add support for reading and writing compressed blobs #8106

arpad-m commented Jun 19, 2024 •

edited

Loading

github-actions bot commented Jun 19, 2024 •

edited

Loading

VladLazar left a comment

Add support for reading and writing compressed blobs #8106

Are you sure you want to change the base?

Add support for reading and writing compressed blobs #8106

Conversation

arpad-m commented Jun 19, 2024 • edited Loading

github-actions bot commented Jun 19, 2024 • edited Loading

No tests were run or test report is not available

Test coverage report is not available

VladLazar left a comment

Choose a reason for hiding this comment

arpad-m commented Jun 19, 2024 •

edited

Loading

github-actions bot commented Jun 19, 2024 •

edited

Loading