[Rspamd-Users] Error 500 on rspamc learn_spam

Will van Gulik mailing-porcus at porcus.ch
Fri Jun 5 15:22:16 UTC 2020


I could fix the ignore_whitelist errors, however I'll need some
additional help with the "rspamd_language_detector_init", reinstalling
the package didn't help. Any other ways I can dig to fix that ?

Also note that on a machine where I don't have the additional errors
with my classifier output, I also have the 500 error. Any ideas on that
one ?

Will

On Wed, Jun 03, 2020 at 09:10:16PM +0200, Stefan Bauer wrote:
> How about starting to fix the obvious errors/warnings configdump reports
> first?
> 
> Am Mittwoch, 3. Juni 2020 schrieb Will van Gulik <mailing-porcus at porcus.ch>:
> 
> > Hi,
> >
> > I'm running rspamd 2.5 from rspamd's repo and I have the following issue
> > for some
> > days/month : (couldn't log when it exactly started, sorry). Distrib is
> > debian buster, up to date.
> >
> > ----8<----
> > $cat my_spam_mail_from_my_imap_junk_folder.eml | /usr/bin/rspamc
> > learn_spam
> > Results for file: stdin (0.088 seconds)
> > HTTP error: 500, Unknown statistics error, found when storing data on
> > backend; classifier: (null)
> > ----8<----
> >
> > I noticed that was happening when my sieve script to learn spam / ham
> > didn't worked anymore on my dovecot. I upgraded recently from 1.8 to
> > 2.5 but I also have that on a fresh install of a 2.5 .
> >
> > Here is my output for the classifier :
> >
> > ----8<----
> > $rspamadm configdump classifier
> > 2020-06-03 13:56:58 #0(main) cfg; rspamd_config_post_load: CPU doesn't
> > have SSSE3 instructions set required for hyperscan, disable it
> > 2020-06-03 13:56:58 #0(main) <9unm9u>; cfg; rspamd_language_detector_init:
> > cannot compile stop words for 0 language group: regexp parsing error:
> > 'character code point value in \x{} or \o{} is too large' at position 7;
> > pattern: a\x{10d}koli
> > 2020-06-03 13:56:58 #0(main) <9unm9u>; cfg; rspamd_language_detector_init:
> > cannot compile stop words for 1 language group: regexp parsing error:
> > 'character code point value in \x{} or \o{} is too large' at position 6;
> > pattern: \x{441}\x{430}\x{43c}
> > 2020-06-03 13:56:58 #0(main) <9unm9u>; cfg; rspamd_language_detector_init:
> > cannot compile stop words for 2 language group: regexp parsing error:
> > 'character code point value in \x{} or \o{} is too large' at position 6;
> > pattern: \x{906}\x{92b}\x{942}
> > 2020-06-03 13:56:58 #0(main) <9unm9u>; cfg; rspamd_language_detector_init:
> > cannot compile stop words for 3 language group: regexp parsing error:
> > 'character code point value in \x{} or \o{} is too large' at position 6;
> > pattern: \x{647}\x{646}\x{627}\x{644}\x{643}
> > 2020-06-03 13:56:59 #0(main) <9unm9u>; lua; rbl.lua:1166: invalid config
> > for blocklistde_received: extra fields: "ignore_whitelists", RBL is DISABLED
> > 2020-06-03 13:56:59 #0(main) <9unm9u>; lua; rbl.lua:1166: invalid config
> > for spamhaus_received: extra fields: "ignore_whitelists", RBL is DISABLED
> > *** Section classifier ***
> > bayes {
> >     backend = "redis";
> >     store_tokens = false;
> >     min_tokens = 11;
> >     min_learns = 200;
> >     cache {
> >     }
> >     new_schema = true;
> >     statfile {
> >         spam = false;
> >         symbol = "BAYES_HAM";
> >     }
> >     statfile {
> >         spam = true;
> >         symbol = "BAYES_SPAM";
> >     }
> >     tokenizer {
> >         name = "osb";
> >     }
> >     learn_condition = "return require(\"lua_bayes_learn\").can_learn";
> >     signatures = false;
> > }
> >
> > *** End of section classifier ***
> > -----8<----
> >
> > Where should I look to debug that one ? I'm a bit lost on what
> > rspam_learn is expecting from whom.
> >
> > Happy to take any pointers to this interesting case,
> >
> > Cheers,
> >
> > Will
> > --
> > Users mailing list
> > Users at lists.rspamd.com
> > https://lists.rspamd.com/mailman/listinfo/users
> >
> -- 
> Users mailing list
> Users at lists.rspamd.com
> https://lists.rspamd.com/mailman/listinfo/users


More information about the Users mailing list