Skip to content

Trim to root in GSA PI

seodamageseodamage internetzzz
edited January 2016 in GSA Platform Identifier
Hello, i buy GSA Platform Identifier about 2 weaks ago, and it work pretty well.

I have some private soft, xrumer, zenno-poster etc. When i check all my links i have urls like http://domain.com/threads/thread_name.html etc.
If i make trim url to root, GSA PI make this:

i have example url http://trastovie-saiti.ru/forum/ and cms is xenforo.
After trim to root it make my url like http://trastovie-saiti.ru/ and, if i check this url in GSA PI again, it tells me dat this is joomla k2.
If it possible to make trim to engine root? Becose i need to have base like:

http://iron-club.ru/forum/index.php
http://dota2.ru/forum/
http://www.aikiforum.ru/
http://www.bmwclub.ru/index.php?forums/
http://smart-tv-news.ru/forum/
http://phpclub.ru/talk/
http://forum.beersfan.ru/

another words if i trim this url's i will lose many links, becose it will change http://domain.com/ForumOrCMSFolder to just http://domain.com/
thanks


p.s. where can i read about Deep matching chekbox? Am i right i can only paste raw domains and GSA PI will try to find all cms in each domain?
p.s.2 Have a nice vacation and good profit.

Comments

  • s4nt0ss4nt0s Houston, Texas
    Do you mean you are wanting to do an engine check for sub URL's? 

    Like if URL is www.domain.com/a/b/c 

    You want it to check 

    domain.com/a
    domain.com/b
    domain.com/c

    And trim it to the root that is the same engine as the original URL?

    If so, that would require a lot of extra work and slow things down quite a bit so we won't be able to add this right now, but possibly in the future.

    For deep matching its checking the source code of the page more thoroughly. E.G. Musthave/MustNotHave strings in URL are checked in source code too. 
  • seodamageseodamage internetzzz
    When checking url in GSA PI is complete, i simply take all forums for example. After i need to trim the forums to index page(of forum). When i use "tools > trim to root" this feature make all url like just http://domain.com/ and no forum subfolder so i lose many links. But i need very much to trim to the forum index page so i could parse some data after this for example online, forum subforums, language and many other thing, but i cant becose i got domain main page and it not forum :(
  • s4nt0ss4nt0s Houston, Texas
    Yes, I understand what you mean. but when you "trim to root" it trims the URL to the root level so its going to be http://www.domain.com/ instead of http://www.domain.com/page. This is how trim to root works in all tools as far as I know.

    There might be other tools to help you trim the URL's in mass to the forum index page you need, but Pi wasn't designed to do that. 

    We might be able to add a "trim to first page" in a future update in the tools menu.

    I know Scrapebox has some trim URL options but I'm waiting for it to be activated on my server to check exactly what trim options it offers. 

    There's some free online text manipulation tools that might work, but I haven't had time to dig into it: http://text-filter.com/Free-Online-Text-Manipulation-Tools.htm 
  • web4youweb4you store web4you.pro
    Use regx. Only this way will be working

    Notepad++ and REGX filter :-)
Sign In or Register to comment.