Skip to content

Scrape images selectively : new filter

KaineKaine thebestindexer.com
edited September 2018 in GSA Image Spider
Hi @Sven

It would be possible to activate a selective scrape?
Currently all images in the pages are scraped, I think we could choose portions of html code or the images would be scraped. 

For exemple here :

https://www.gsa-online.de/product/image_spider/

<figure class="image"> ... </figure>

Regex : (?<=<figure\ class="image">).*</figure>

Comments

  • SvenSven www.GSA-Online.de
    edited September 2018
    might be too complex for most users. But worth a thought though. Maybe you can simply use the filter by now to limit scraping.
  • KaineKaine thebestindexer.com
    I think that if there is a place where people know the minimum html is certainly here :)
Sign In or Register to comment.