Easiest way to extract the urls from an html page using sed or awk only

You could also do something like this (provided you have lynx installed)…

Lynx versions < 2.8.8

lynx -dump -listonly my.html

Lynx versions >= 2.8.8 (courtesy of @condit)

lynx -dump -hiddenlinks=listonly my.html

Leave a Comment