I recently learned about the Linux command line utility shuf from browsing The Art of Command Line. This could be useful for random sampling.
Given just a file name, shuf randomly permutes the lines of the file.
With the option -n you can specify how many lines to return. So it’s doing sampling without replacement. For example,
shuf -n 10 foo.txt
would select 10 lines from foo.txt.
Actually, it would select at most 10 lines. You can’t select 10 lines without replacement from a file with less than 10 lines. If you ask for an impossible number of lines, the -n option is ignored.
You can also sample with replacement using the -r option. In that case you can select more lines than are in the file since lines may be reused. For example, you could run
shuf -r -n 10 foo.txt
to select 10 lines drawn with replacement from foo.txt, regardless of how many lines foo.txt has. For example, when I ran the command above on a file containing
alpha
beta
gamma
I got the output
beta
gamma
gamma
beta
alpha
alpha
gamma
gamma
beta
I don’t know how shuf seeds its random generator. Maybe from the system time. But if you run it twice you will get different results. Probably.
Back in the ’80’s we used shuf to generate both our login MOTD and finger responses.
Ah, those were the days.
Apropos the seed: GNU shuf has a flag –random-source=FILE to set a custom random source. I am not sure that I would use it to generate secure passwords, but still, it is a start.
Hi John, about your statement “I don’t know how shuf seeds its random generator.”
This web page may be insightful: https://maizure.org/projects/decoded-gnu-coreutils/shuf.html
Tony Patti
Thanks, Tony.
In the 1980s you would have needed a time machine to use the shuf command as it was written for the GNU utilities in 2006.
What you might have used in the 1980s was fortune (6):
https://en.wikipedia.org/wiki/Fortune_(Unix)
I don’t do lottery, but there is 6 out of 49 in Germany:
$ shuf –random-source=/dev/urandom -n 6 -e `for((i=1;i<=49;++i)); do echo $i; done`
7
33
46
30
1
32
$ shuf –random-source=/dev/zero -n 6 -e `for((i=1;i<=49;++i)); do echo $i; done`
1
2
3
4
5
6
$