[Rspamd-Users] Error 500 on rspamc learn_spam
Will van Gulik
mailing-porcus at porcus.ch
Sun Aug 16 19:06:13 UTC 2020
Hi,
I finally got further with that case. I think my case occured with the
upgrade from debian stretch to buster. During that process, redis was
not installed correctly. I also speculate that on the line somewhere, I
was upgraded from 1.8 to 2.5. Maybe that's also related.
Anyhow, I tried this fix that actually corrected the error message on
learning :
- https://github.com/rspamd/rspamd/issues/2678
I did that on 2 of my servers, where the second didn't had redis
installed either (also an upgrade), and then realized that installing
redis-server would create the file bayes classifier file as expected.
Now my rspamc learn_spam outputs a nice statistics message.
Hope this helps for anyone who might have a similar case,
Cheers,
Will
On Wed, Jun 03, 2020 at 01:59:15PM +0200, Will van Gulik wrote:
> Hi,
>
> I'm running rspamd 2.5 from rspamd's repo and I have the following issue for some
> days/month : (couldn't log when it exactly started, sorry). Distrib is
> debian buster, up to date.
>
> ----8<----
> $cat my_spam_mail_from_my_imap_junk_folder.eml | /usr/bin/rspamc learn_spam
> Results for file: stdin (0.088 seconds)
> HTTP error: 500, Unknown statistics error, found when storing data on backend; classifier: (null)
> ----8<----
>
> I noticed that was happening when my sieve script to learn spam / ham
> didn't worked anymore on my dovecot. I upgraded recently from 1.8 to
> 2.5 but I also have that on a fresh install of a 2.5 .
>
> Here is my output for the classifier :
>
> ----8<----
> $rspamadm configdump classifier
> 2020-06-03 13:56:58 #0(main) cfg; rspamd_config_post_load: CPU doesn't have SSSE3 instructions set required for hyperscan, disable it
> 2020-06-03 13:56:58 #0(main) <9unm9u>; cfg; rspamd_language_detector_init: cannot compile stop words for 0 language group: regexp parsing error: 'character code point value in \x{} or \o{} is too large' at position 7; pattern: a\x{10d}koli
> 2020-06-03 13:56:58 #0(main) <9unm9u>; cfg; rspamd_language_detector_init: cannot compile stop words for 1 language group: regexp parsing error: 'character code point value in \x{} or \o{} is too large' at position 6; pattern: \x{441}\x{430}\x{43c}
> 2020-06-03 13:56:58 #0(main) <9unm9u>; cfg; rspamd_language_detector_init: cannot compile stop words for 2 language group: regexp parsing error: 'character code point value in \x{} or \o{} is too large' at position 6; pattern: \x{906}\x{92b}\x{942}
> 2020-06-03 13:56:58 #0(main) <9unm9u>; cfg; rspamd_language_detector_init: cannot compile stop words for 3 language group: regexp parsing error: 'character code point value in \x{} or \o{} is too large' at position 6; pattern: \x{647}\x{646}\x{627}\x{644}\x{643}
> 2020-06-03 13:56:59 #0(main) <9unm9u>; lua; rbl.lua:1166: invalid config for blocklistde_received: extra fields: "ignore_whitelists", RBL is DISABLED
> 2020-06-03 13:56:59 #0(main) <9unm9u>; lua; rbl.lua:1166: invalid config for spamhaus_received: extra fields: "ignore_whitelists", RBL is DISABLED
> *** Section classifier ***
> bayes {
> backend = "redis";
> store_tokens = false;
> min_tokens = 11;
> min_learns = 200;
> cache {
> }
> new_schema = true;
> statfile {
> spam = false;
> symbol = "BAYES_HAM";
> }
> statfile {
> spam = true;
> symbol = "BAYES_SPAM";
> }
> tokenizer {
> name = "osb";
> }
> learn_condition = "return require(\"lua_bayes_learn\").can_learn";
> signatures = false;
> }
>
> *** End of section classifier ***
> -----8<----
>
> Where should I look to debug that one ? I'm a bit lost on what
> rspam_learn is expecting from whom.
>
> Happy to take any pointers to this interesting case,
>
> Cheers,
>
> Will
> --
> Users mailing list
> Users at lists.rspamd.com
> https://lists.rspamd.com/mailman/listinfo/users
More information about the Users
mailing list