this post was submitted on 23 Dec 2024
76 points (100.0% liked)

Technology

37804 readers
177 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] klangcola@reddthat.com 17 points 3 days ago (2 children)

PSA about mini PCs: They might not come with adequate cooling for RAM, leading to potential data corruption.

(I'm in the middle of troubleshooting/fixing overheating RAM causing memory errors, will post on /c/selfhosted when I have more conclusions).

TLDR: Bought 3 Minisforum HM90 mini PCs (for Proxmox), equipped them with 64gb (2x32gb) RAM, with a different brand RAM in each PC. All 3 give sporadic errors in Memtest86. The RAM overheats due to the 2 SSDs mounted in the lid blocking natural airflow. With the lid off, or an extra fan installed, there are no errors. The errors were very sporadic: 1 PC gave errors after 1-2 passes, then almost 24hours. Second PC gave errors after more than 24 hours and some cases more than 48 hours between errors. The last PC gave hundreds of errors on the first pas. To be fair, memtest is a synthetic test and the RAM is unlikely to see 100% utilisation in real life, on the other hand the two adjacent SATA SSDs and the NVMe SSD are completely idle during memtest, and will generate extra heat during production use.

[–] DdCno1@beehaw.org 6 points 3 days ago

Take this seriously, people. I've been there and it caused tons of issues on an older server of mine. That's why I was very adamant about my current system having built-in error correction for its RAM.

[–] DrFuggles@feddit.org 2 points 3 days ago

Had an NVME fritz out on me on a passively cooled NUC because of thermals, I suspect. That sucked.