Make builtin spinner
Would you add a builtin spinner to SER? You don't have to ship it with a dictionary; I can feed it with my own, for example in a format:
a;synonym_of_a;second_synonym_of_a
b;synonym_of_b
etc.
(I'm not forced to use English this way).
Tagged:
Comments
whats multispinner?
I have actually thought about a build in spinner...if people could join together collecting sources with data like @matt_rankman posted above...I can build something up.
it has a built in "MY thesaurus" that can be edited offline to create and store your own thesaurus in select it in addition to system thesaurus
at articlequeen the syntax for the "my thesaurus" is simple and makes sense = is plain spintax with the keyword to replace always being first
@sven
if you need thesaurus
may be have a look at various sources below
free thesaurus
free thesaurus download
etc
or download / get thesaurus from various sites such as
http://thesaurus.com/browse/free
http://wordweb.info/free/
http://synonyme.woxikon.de/ = also in other languages
http://www.openthesaurus.de/
OR
much easier !!!
search in the open source community = openoffice = now called libreoffice !!
= may be more complex but a fully and professionally maintained FREE thesaurus in MANY languages such as en, de_CH, de_AT, pl, ru, .... and many more, see
http://extensions.services.openoffice.org/en/dictionaries
download FREE office suite and look at your thesaurus db, download additional languages as needed = it's FREE because open source
in your local Linux install you find in path (on a win install just search for the path to thesaurus db
all thesaurus db is clear language human readable
/usr/share/ooo/thesaurus/
any installed languages installed from your libreoffice suite
can be found here
/usr/share/ooo/thesaurus/th_en_US_v2.dat
etc
and depending on install language of course also other languages, currently approx 24 languages (in my install selection)
libreoffice being one of the worlds best FREE professional office suite since many years
high quality, free, just follow / obey the licenses
the syntax is different from simple hobby thesaurus db (noun, adj, adv, verb, ..)
possible downside might be for win-users
liibreoffice (previous staroffice = then openoffice = now libreoffice) for windows and inux (may be others)
it is a global commercial quality product and thus most likely all such thesaurus db in UTF8
hence may be lack of current compatibility in charset between SER ant thesaurus possoble
or may be just use existing words and convert into OTHER format
open source = free to use or modify = just follow open source license
advantage of libreoffice thesaurus system vs other thesaurus systems
ONE location / source = MANY updated languages with same syntax for easier maintenance vs many sources for each language
other public domain thesaurus sources might be (english)
http://icon.shef.ac.uk/Moby/
see thesaurus
http://icon.shef.ac.uk/Moby/mthes.html
or FREE english thesaurus
http://wordnet.princeton.edu/
or google for more FREE sources
thesaurus dictionary open source
an additional feature to add "my thesaurus" or even "my phrases" certainly makes publishing clean high quality spintax MUCH more convenient as each one can add HIS professional expert words / phrases and thus create even more variations
That OpenOffice/LibreOffice I already have...it's sub-optimal but might work as a quick thing.
Thanks all for the sources...will try to come up with a solution. Of course I will keep the format open to modify as always.
any thesaurus vs libreoffice
ibreoffice might be more work at beginning because more complex syntax of thesaurus db
but
the simple fact that ilibreoffice is a global widespread professional application with maintained thesaurus
leads to the result that once imported into SER or SER adapted to libreoffice thesaurus (some may already have their language thesaurus installed ! hence no need to have it twice on a computer
it is easy to update and expand to ever new languages or adapted to personal dictionary
libreoffice contains approx all EU languages, plus ru and a few other NON-EU / non-latin languages
@matt_rankman
and of course there is a
Polish Dictionary Pack with thesaurus for you as well
the choice is up to @sven
create a new project->tools->spin convert -> last option there.
Though the results are a bit disappointing in my eyes. The produced text is hardly readable. The reason is the problem to decide what type of word it is (noun or verb).
very nice Spin converter, works great
Some ideas:
1. Spell Check Multilingual: http://www.openoffice.org/de/spellcheck/
2. Grammer Check: http://www.languagetool.org/
3. Original Text in english and translate Spinned Text in other languages.
probably creating spin rules is important so sentences do not become fragmented and unreadable. Like simple sentence rules like like if previous word was "an" then you cant have "car".
In fact there are 2 new patent that work together "Identifying gibberish" and "Synonym identification", I believe that this is the hummingbird? from what i see on my network that has an 90%+ accuracy.
from what i see you have to have readable content (because of Identifying gibberish) and you can't massively spin it (because Synonym identification).
Here are the 2 patents:
Identifying gibberish content in resources: http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&p=1&u=/netahtml/PTO/srchnum.htm&r=1&f=G&l=50&s1=8,554,769.PN.&OS=PN/8,554,769&RS=PN/8,554,769
Synonym identification based on co-occurring terms: http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&Sect2=HITOFF&p=1&u=/netahtml/PTO/search-adv.htm&r=1&f=G&l=50&d=PALL&S1=08538984&OS=PN/08538984&RS=PN/08538984
I am still trying to find a way, any suggestion will be much appreciated.
thesaurus and can you please tell me how to add those?