xml - what would be the best approach to extract one language form wiktionary? -
i have searched not found want, is:
the best , efficient extract italian words, etymologies , parts of speech... including plural forms of words (amico, amichi) wiktionary. put either csv (maybe larg though) or mysql db pure text (not blobs).
i want essential record each italian word in english.
mwdumper keeps crashing too.
any advice welcome!
i created small java program extracts part of speech (verb, nound, adjective, adn on) en.wiktionary xml dump, here, uses tsv can adapted easily.
Comments
Post a Comment