details | Anonymous message posted by j.rabenschlag@gmail.com
Hi,
the current stemmer for the German language in your package removes the umlauts.
Example:
words<-"groß Größe größer"
SnowballC::wordStem(words, language = "german")
[1] "gross Grosse gross"
The Snowball project provides a stemming function called "german2" to prevent this problem: http://snowball.tartarus.org/algorithms/german2/stemmer.html
Could you implement this?
Some more info:
> sessionInfo()
R version 3.5.2 (2018-12-20)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows >= 8 x64 (build 9200)
Thanks,
Johannes | 2019-02-22 18:27 | milanbv |