ZRAM configuration and other tweaks for potatoes running Linux

wolf@lemmy.zip · edit-2 3 months ago

ZRAM configuration and other tweaks for potatoes running Linux

seaQueue@lemmy.world · edit-2 3 months ago

I wrote this years ago when I was doing a bunch of work with low ram (1gb) potato SBCs and I use it everywhere, including my 32/64gb SFF proxmox nodes: https://github.com/foundObjects/zram-swap

You might find the comments re: swap sizing and compression ratios handy, I’ve found that lz4 approximates to a 2.5:1 compression ratio during most workloads. On your 4gb potato I’d run something like ~2GB lz4 zram, which would work out to a ~5GB zram device. I never bothered with sysctl tuning, you generally don’t need to.

Edit: just about every Chromebook under the sun, and like 90%+ of all Android devices, runs lzo/lzo-rle zram swap at ~(½ramsize*3). Change that to *2.5 for lz4 and you’re set.

wolf@lemmy.zip · 3 months ago

Thank you for your answer and your insights.

In my unscientific tests, sysctl/vm.page-cluster made a measurable difference (15% faster when setting it to 0), and it seems everyone else (PopOS, ChromeOS) tweaks at least this setting with ZRAM. I would assume the engineers at PopOS/ChromeOS also did some benchmarks before using this settings.

Now I really would be interested, if you would measure a difference on your 1gb potato SBCs, because IMHO it should even have a bigger impact for them. (Of course, your workload/use cases might make any difference irrelevant, and of course potato SBCs have other bottlenecks like WiFi/IO, which might make this totally irrelevant.

seaQueue@lemmy.world · edit-2 3 months ago

I don’t have my potato lab up and running at the moment but my android devices and sff hypervisors are all using page-cluster=0. That’s the default setting on android and ChromeOS I think, I probably tuned it on the proxmox machines years ago and forgot about it.

Edit: that’s basically swap read ahead right? Ie: number of pages to read from swap at a time.

wolf@lemmy.zip · 3 months ago

To my understand it is swap read-ahead, and the number is a power for the base 2. This means the default reads 2^3 = 8 pages ahead. According to what I read, the default of 3 was set in the age of rotating discs and never adapted for RAM swap devices.

seaQueue@lemmy.world · 3 months ago

Yeah, that’s my understanding of that sysctl too. If IOPS are cheap (and they are when dealing with ram or high IOPS NVMe) there’s no real point in performing extra read ahead.

Samueru@lemmy.ml · 3 months ago

You likely saw this already, but if you haven’t: https://www.reddit.com/r/Fedora/comments/mzun99/new_zram_tuning_benchmarks/

wolf@lemmy.zip · 3 months ago

Thanks a lot! You are right, I saw this already.

I can confirm the findings with my benchmarks: zstd has the best compression, lz4 is the fastest.

Samueru@lemmy.ml · 3 months ago

Here is what I ended up using for my sysctl conf, iirc I got some of these from popos default config:

vm.swappiness = 180
vm.page-cluster = 0
vm.watermark_boost_factor = 0
vm.watermark_scale_factor = 125
vm.dirty_bytes = 268435456
vm.dirty_background_bytes = 134217728
vm.max_map_count = 2147483642
vm.dirtytime_expire_seconds = 1800
vm.transparent_hugepages = madvise

wolf@lemmy.zip · 3 months ago

Could you ELI5 the last five settings? I saw that Chrome OS sets vm.overcommit_memory = 1, it seems to make sense but is missing here.

Samueru@lemmy.ml · 3 months ago

I really don’t know lol

Increasing the max_map_count is needed for some Steam games, iirc Arch is now dong this by default.

iirc the dirty_bytes settings prevent the system from hanging if there is too much disk IO

And setting transparent_hugepages to madvise was something I did when archlinux had this bug in the kernel: https://old.reddit.com/r/archlinux/comments/1atueo0/higher_ram_usage_since_kernel_67_and_the_solution/

It was eventually fixed but I later ran into the issue again and I decided to keep it on madvise.

wolf@lemmy.zip · 3 months ago

Nice, thanks a lot, especially the dirty_bytes settings are interesting to me, because I experience hangs with too much disk IO :-P.

Cheers!

GustavoM@lemmy.world · 3 months ago

I’m definitely not a “potato expert”, but what I use (on my orange pi zero 3 w/ 1 GiB of ram, at least) is simply:

zram size= 100% of available ram, zstd, priority set at 100%. Because apparently if theres more zram swap than available ram, it’ll lead into memory leaks and/or slowdowns.