Pretty interesting… I’d just point out that you don’t need ‘cat’ to feed your file to grep, you can always ‘grep somepattern file.txt’. Similarly, ‘egrep’ shortens your command a lot:
egrep -o “http://.*$” links.txt | sort | uniq
To get all the links from links.txt.
]]>