As a plus, if you run them on ZFS with de-dup, even the disk cost for new machines is miniscule.
Score: 5 (Troll)
Darn already. For I was thinking to myself: "Imagine a Beowulf cluster of these!".
[1] https://www.proxmox.com/en/products/proxmox-virtual-environm...
For completeness, the job in the article is not strictly parallel, instead, they are separable tasks, which fall under high-throughput computing. HTCondor is a popular choice and is widely used in physics. If the cluster has a shared filesystem such as NFS, it does not even need ssh to each node. Moreover, nodes can share the same R installation on NFS, remove the complexity of installing same set of R packages on all nodes. HTCondor is available on Ubuntu.
Pay attention to your SRAM (L3 unified cache), DRAM and swap space tilings.
[Snark] In practice: With memory access latency depending on both the square root of the memory size and the physical lengths of the wires in your cluster this sounds like a case for Adam Drake:
https://adamdrake.com/command-line-tools-can-be-235x-faster-...
The "problem" with Pi-like devices is that they're usually not very "normal". The process of provisioning is different, IIRC they only "recently" supported booting off something other than the SD Card, and in the case of the Zeros, you'll either be using Wifi or an external USB Ethernet dongle (over USB OTG no less). Sometimes they need specially compiled version of linux, so you're stuck far from mainline (this was a big component of the RPis success) This may be distracting from your goals of learning about clustering.
I suspect the $10 Pi Zero is about as cheap as you'll get though, depending on your personal costs of case + ethernet dongle + USB power supply, etc.
I think it looks quite cool: https://x.com/andreer/status/2007509694374691230
Instead of having to use lots of dongles and usb ethernet, I just wired them all up using brass rods, a small 5V power supply in the base, and boot them over WiFi (just the kernel and wifi config on the sd card).
Raspberry pi's are cheap, easily available, and there is an absolutely massive trove of information about them on the internet. And the scale means that the linux implementation is very stable and "just works" to a degree that is extremely hard for other SBCs to match.
Sure, VMs are the logical choice, but not everything has to be logical. Real hardware does feel more real :-)
Actually, you raise a good point: I should spend some time browsing the Armbian supported hardware list...
Intel NUC's are probably much better value for money these days but the 3B's were pretty cheap at the time I bought them.
Think https://www.servethehome.com/introducing-project-tinyminimic... was where I originally came across them