bangiolo20 scribeva:Hm... ? I don't remember I give any software hints.

Only yesterday I figured out at least how to use TextStat. Heh... it hates diacritics.

Solved.
Try to import plain text *.txt, *.doc or *.docx files only. An non-proper xml markup inside LibreOffice and OpenOffice's *.odt files splits the regarding word inside the *.odt file xml markup. So, its ODT file format "bug". That only happens if you modify the ODT file. If, i.e. you only copy text without any further modifications that problem will not occour and TextSTAT will process proper output.
You could check yourself. ODT files are zipped xml files. You could extract it and could take a look at the affected words inside the xml markup.
===
There excists bug inside LibreOffice's/ OpenOffice's xml markup. If some words will be set with diacritics that regarding word will capsulated into two parts with some new xml markup. That new xml markup let look that regarding word like two ones. So, inside the xml source view the regarding word is splitted, inside non-source view that word looks like to remain one word.
Inside TextSTAT try to import plain text *.txt, *.doc or *.docx files only.