It’s not uncommon to see people either manually running a script or program over a series of files. I was thinking of showing how to automate and parallelise over multiple data with very little effort or knowledge of parallel programming. Basically, three stages:
- Shell for loops/xargs
- GNU Parallel on local machine
- GNU Parallel across remote machines
Let me know what you think!