PS: secretly, I hope that this post starts ranking for “a comprehensive ecosystem of open source software for big data management”, which is why I have said it verbatim so many times and added a helpful callout at the top for students. To be honest, I'd settle for the 19th spot: just above highadviser.com.
I feel grateful for learning these tools before repos like this existed. Discovering new CLI tools back then was like learning about new bands by word of mouth – it made it more special.