Provided by: rspamd_3.8.1-1.2ubuntu2_amd64 bug

NAME

       rspamd_stats - analyze Rspamd rules by parsing log files

SYNOPSIS

       rspamd_stats [options] [--symbol=SYM1 [--symbol=SYM2...]] [--log file]

DESCRIPTION

       rspamd_stats  will  read  the given log file (or standard input) and provide statistics for the specified
       symbols:

              Symbol: BAYES_SPAM (weight 3.763) (381985 hits, 26.827%)
              Ham hits: 184557 (48.315%), total ham: 1095487 (ham with BAYES_SPAM: 16.847%)
              Spam hits: 15134 (3.962%), total spam: 16688 (spam with BAYES_SPAM: 90.688%)
              Junk hits: 182294 (47.723%), total junk: 311699 (junk with BAYES_SPAM: 58.484%)
              Spam changes (ham/junk -> spam): 7026 (1.839%), total percentage (changes / spam hits): 42.102%
              Junk changes (ham -> junk): 95192 (24.920%), total percentage (changes / junk hits): 30.540%

       Where there are the following attributes:

       • Weight: average score for a symbols

       • Total hits: total number of hits and percentage of symbol hits divided by total number of messages

       • HAM hits: provides the following information about HAM messages with the specified symbol (from left to
         right):

       1. total symbol hits: number of messages that has this symbol and are HAM

       2. ham percentage: number of symbol hits divided by overall HAM messages count

       3. total ham hits: overall number of HAM messages

       4. ham with symbol percentage: percentage of number of hits with specified symbol in HAM messages divided
          by total number of HAM messages.

       • SPAM hits: provides the following information about SPAM messages -  same  as  previous  but  for  SPAM
         class.

       • Junk  hits:  provides  the  following  information  about Junk messages - same as previous but for JUNK
         class.

       • Spam changes: displays data about how much messages switched their class because of the specific symbol
         weight.

       • Junk changes: displays data about how much messages switched their class because of the specific symbol
         weight.

OPTIONS

       --log  Specifies log file or directory to read data from.   If  a  directory  is  specified  rspamd_stats
              analyses files in the directory including known compressed file types.  Number of log files can be
              limited using --num-logs and --exclude-logs options.  This assumes that files in the log directory
              have  newsyslog(8)-  or logrotate(8)-like name format with numeric indexes.  Files without indexes
              (generally it is merely one file) are considered the most recent and files with lower indexes  are
              considered newer.

       --reject-score
              Specifies the reject (spam) threshold.

       --junk-score
              Specifies the junk (add header or rewrite subject) threshold.

       --alpha-score
              Specifies the minimum score for a symbol to be considered by this script.

       --symbol
              Add symbol or pattern (pcre format) to analyze.

       --num-logs
              If set, limits number of analyzed logfiles in the directory to the specified value.

       --exclude-logs
              Number of latest logs to exclude (0 by default).

       --correlations
              Additionally  print  correlation  rate  for each symbol displayed.  This routine calculates merely
              paired correlations between symbols.

       --search-pattern
              Do not process input unless finding the specified regular expression.  Useful to skip  logs  to  a
              certain position.

       --exclude
              Exclude  log lines if certain symbols are fired (e.g.  GTUBE).  You may specify this option multi‐
              ple time to skip multiple symbols.

       --start
              Select log entries after this time.  Format: "YYYY-MM-DD HH:MM:SS" (can be truncated  to  any  de‐
              sired  accuracy).   If used with --end select entries between --start and --end.  The omitted date
              defaults to the current date if you supply the time.

       --end  Select log entries before this time.  Format: "YYYY-MM-DD HH:MM:SS" (can be truncated to  any  de‐
              sired accuracy).  If used with --start select entries between --start and --end.  The omitted date
              defaults to the current date if you supply the time.

       --help Print a brief help message and exits.

       --man  Prints the manual page and exits.

AUTHORS

       Vsevolod Stakhov.

                                                  March 5, 2018                                  RSPAMD_STATS(8)