A REAL feature that should be in GSA PI
"The reason we have the Min./Max. file size because sometimes you get pages that are maybe 1 or 2KB, and that probably means that it’s probably a 404 page that says something like “this is a 404 page, click here to go to homepage.”
Obviously that’s no use to us. We want a page that actually has some HTML on it. So that’s why we set the Min. file size to 10KB min. You can set a lower if you want but you’ve been warned!
GSA PI can filter out all the URLs with less than 1k, 2kb or less than 10kb and max size to 200kb example. This will eliminate most of the sites with 404 errors!