Skip to content

Is there a way to filter out certain domains by name or by mask?

Hello, everyone. I am trying to filter our scraping certain websites, some by name, others by wildcards and pattern matching, maybe excluding certain TLDs.

How can this be done?  Thanks, kindly!

Sven, sorry for so many questions!!

Comments

  • SvenSven www.GSA-Online.de
    Just use * for any or none chars and ? for one char of any kind.

    • gsa-online.de would only filter out this particular domain
    • *gsa-online.de would filter out any subdomain from gsa-online.de like forum.gsa-online.de
    • *.de will filter out any german top level domain
    • ???.de will filter out any domain that has 3 chars and ends with .de
Sign In or Register to comment.