Skip to content

Exclude Engine Identification

Hello there again, i got engine that sometimes can't be identified. Not in source code / by url - its just hiding existence.

I want to assume that every website that i scrape is the engine, and skip the GSA identification.
I cant worry too much but there MAY be a problem while running Identify Platform and sort in >

Is there a way to exclude engine identification in the .ini file ?

Comments

  • SvenSven www.GSA-Online.de

    you mean disable the detection and use a fixed one while submitting? Yes there is a way.

    You have to import the URL like this...

    URL#EngineName

    The EngineName is case sensitive without the .ini ending.

  • Thanks a lot :)
  • @Sven, Why SER matches engines when url get from list Identified files, why waste time for re-identifying ?
  • SvenSven www.GSA-Online.de
    This re-identification is not taking much resources and it makes sure that the site is still what it is expecting to get (not down, engine change, page moved...).
  • oook
    :-bd
  • andrzejekandrzejek Polska
    edited June 2015
    @sven

    So pasting links in identified list like that:

    URL#EngineName

    will skip identification?
  • SvenSven www.GSA-Online.de
    yes
Sign In or Register to comment.