bash

Processing Big Data on the Linux Shell

Recently, I needed to generate and process multi-billion record synthetic datasets for some data structure benchmarking. Rather than write a bunch of custom code to do this processing, I decided to try using shell scripts and standard system utilities.

Douglas Rumbaugh

Oct 18, 2023 8 min read