Hit a snag with my plan to just use curl to archive webpages.

curl -sL [URL] | sed 's/<script[^>]*<\/script>//'

This won’t strip JavaScript between <script> tags on multiple lines. I think I need to install something like jq for SGML.