[Rspamd-Users] Error 500 on rspamc learn_spam

Sun Aug 16 19:06:13 UTC 2020

Hi, 

I finally got further with that case. I think my case occured with the
upgrade from debian stretch to buster. During that process, redis was 
not installed correctly. I also speculate that on the line somewhere, I
was upgraded from 1.8 to 2.5. Maybe that's also related.

Anyhow, I tried this fix that actually corrected the error message on 
learning :
- https://github.com/rspamd/rspamd/issues/2678

I did that on 2 of my servers, where the second didn't had redis
installed either (also an upgrade), and then realized that installing
redis-server would create the file bayes classifier file  as expected.

Now my rspamc learn_spam outputs a nice statistics message.

Hope this helps for anyone who might have a similar case,

Cheers,

Will

On Wed, Jun 03, 2020 at 01:59:15PM +0200, Will van Gulik wrote:
> Hi, 
> 
> I'm running rspamd 2.5 from rspamd's repo and I have the following issue for some
> days/month : (couldn't log when it exactly started, sorry). Distrib is
> debian buster, up to date.
> 
> ----8<----
> $cat my_spam_mail_from_my_imap_junk_folder.eml | /usr/bin/rspamc learn_spam
> Results for file: stdin (0.088 seconds)
> HTTP error: 500, Unknown statistics error, found when storing data on backend; classifier: (null)
> ----8<----
> 
> I noticed that was happening when my sieve script to learn spam / ham
> didn't worked anymore on my dovecot. I upgraded recently from 1.8 to
> 2.5 but I also have that on a fresh install of a 2.5 . 
> 
> Here is my output for the classifier :
> 
> ----8<----
> $rspamadm configdump classifier
> 2020-06-03 13:56:58 #0(main) cfg; rspamd_config_post_load: CPU doesn't have SSSE3 instructions set required for hyperscan, disable it
> 2020-06-03 13:56:58 #0(main) <9unm9u>; cfg; rspamd_language_detector_init: cannot compile stop words for 0 language group: regexp parsing error: 'character code point value in \x{} or \o{} is too large' at position 7; pattern: a\x{10d}koli
> 2020-06-03 13:56:58 #0(main) <9unm9u>; cfg; rspamd_language_detector_init: cannot compile stop words for 1 language group: regexp parsing error: 'character code point value in \x{} or \o{} is too large' at position 6; pattern: \x{441}\x{430}\x{43c}
> 2020-06-03 13:56:58 #0(main) <9unm9u>; cfg; rspamd_language_detector_init: cannot compile stop words for 2 language group: regexp parsing error: 'character code point value in \x{} or \o{} is too large' at position 6; pattern: \x{906}\x{92b}\x{942}
> 2020-06-03 13:56:58 #0(main) <9unm9u>; cfg; rspamd_language_detector_init: cannot compile stop words for 3 language group: regexp parsing error: 'character code point value in \x{} or \o{} is too large' at position 6; pattern: \x{647}\x{646}\x{627}\x{644}\x{643}
> 2020-06-03 13:56:59 #0(main) <9unm9u>; lua; rbl.lua:1166: invalid config for blocklistde_received: extra fields: "ignore_whitelists", RBL is DISABLED
> 2020-06-03 13:56:59 #0(main) <9unm9u>; lua; rbl.lua:1166: invalid config for spamhaus_received: extra fields: "ignore_whitelists", RBL is DISABLED
> *** Section classifier ***
> bayes {
>     backend = "redis";
>     store_tokens = false;
>     min_tokens = 11;
>     min_learns = 200;
>     cache {
>     }
>     new_schema = true;
>     statfile {
>         spam = false;
>         symbol = "BAYES_HAM";
>     }
>     statfile {
>         spam = true;
>         symbol = "BAYES_SPAM";
>     }
>     tokenizer {
>         name = "osb";
>     }
>     learn_condition = "return require(\"lua_bayes_learn\").can_learn";
>     signatures = false;
> }
> 
> *** End of section classifier ***
> -----8<----
> 
> Where should I look to debug that one ? I'm a bit lost on what
> rspam_learn is expecting from whom.
> 
> Happy to take any pointers to this interesting case,
> 
> Cheers,
> 
> Will
> -- 
> Users mailing list
> Users at lists.rspamd.com
> https://lists.rspamd.com/mailman/listinfo/users