Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

In this Discussion

GSA Content Generator can extract content from link list

kingbinkingbin Hanoi
edited April 2018 in GSA Content Generator
Hi Seven,

I have a list link and i want GSA Content Generator extract content, image, video from this list (in case with keyword and without keyword). Can GSA Content Generator do that?

I try to import my list to Source Option (in Custom) but when i run campain, it wasn' t working. You can see my log

[11:26:54] Starting "Scraping Articles"...
[11:26:55] Result 01. Length:    869 Chars | Title: Americas Oak / 9663M | URL: http://laminatevietnam.com/index.php?route=product/product&product_id=107
[11:26:55] Result 01. Length:    869 Chars | Title: Arctic Maple / 6009M | URL: http://laminatevietnam.com/index.php?route=product/product&product_id=111
[11:26:55] Result 01. Length:    869 Chars | Title: Amazon Walnut / 6861G | URL: http://laminatevietnam.com/index.php?route=product/product&product_id=135
[11:26:55] Result 01. Length:    869 Chars | Title: Bangkok White / 9201G | URL: http://laminatevietnam.com/index.php?route=product/product&product_id=147
[11:26:55] Result 01. Length:   1051 Chars | Title: Natural Zebrawood / 0585 NC | URL: http://laminatevietnam.com/index.php?route=product/product&product_id=88
[11:26:56] Result 01. Length:    869 Chars | Title: Bangkok White / 9201M | URL: http://laminatevietnam.com/index.php?route=product/product&product_id=156
[11:26:56] Result 01. Length:    869 Chars | Title: Anodised Aluminium / D8122 | URL: http://laminatevietnam.com/index.php?route=product/product&product_id=143
[11:26:56] Result 01. Length:    869 Chars | Title: Bleached Cherry / 8633 IM | URL: http://laminatevietnam.com/index.php?route=product/product&product_id=90
[11:26:56] Saving content to disk...
[11:26:56] Removing duplicates...
[11:26:56] Starting "Removing duplicate content"...
[11:26:56] Starting "Filtering Content"...
[11:26:56] Starting "Generating Articles"...
[11:26:56] SameArticle: extracting paragraphs and titles...
[11:26:56] Sorry, not enough content to create any article. Try to select more sources or use more keywords.
[11:26:56] Starting "Generating Articles"...
[11:26:56] Starting "Generating Articles"...
[11:26:56] Starting "Generating Articles"...
[11:26:56] Finished.

Answers

  • SvenSven www.GSA-Online.de
    As far as I can see, it has extracted content from the custom source at least. Just not enough to build quality content I guess. Your settings for length/words might be out of range. Try to use very little data here to accept all scraped articles.
  • Hi Seven,

    Tks you for reply. You are right, i test again and increase range number of word, it work perfectly ^^

    Another question, GSA Content Generator can run without select language? I run some campain with my language, in log, i saw that GSA Content Generator scraped, found a lot of link but hum, GSA Content Generator was wasted them because of "Unwanted language". Althought i checked again, link is not any wrong, same language and found keyword.

    [16:16:42] Result 01 - Unwanted language "en" detected for https://sachiepvien.net/luc-trieu-yen-ca-hanh-1-2-tap/

    In example, it set my campain language (Vietnamese) and website is in Vietnamese. What wrong?

    Reguard,

    Tuan Anh
  • SvenSven www.GSA-Online.de
    that site has lang="en.." in it's source.
  • I change setting to "English" but when run, it error:

    [08:09:55] Result 01 - Unwanted unicode-language "vi|pt" detected for https://sachiepvien.net/luc-trieu-yen-ca-hanh-1-2-tap/
  • SvenSven www.GSA-Online.de
    If it's English as project language, it is performing the unicode test as there are a lot of site with this lang="en" thing where it is not really English content.

    But if you use anything other than English and the lang-tag tells you it's English, it better skips it as well.

    Sorry, thats all for your safety to not generate crap.
Sign In or Register to comment.