Provided by: pcp_5.3.6-1build1_amd64 bug

NAME

       pmlogger_check, pmlogger_daily - administration of Performance Co-Pilot archive log files

SYNOPSIS

       $PCP_BINADM_DIR/pmlogger_check [-CNPpqsTV?]  [-c control] [-l logfile]
       $PCP_BINADM_DIR/pmlogger_daily  [-EfKMNoprRV?]   [-c  control]  [-k time] [-l logfile] [-m addresses] [-s
       size] [-t want] [-x time] [-X program] [-Y regex]

DESCRIPTION

       These shell scripts and  associated  control  files  may  be  used  to  create  a  customized  regime  of
       administration and management for Performance Co-Pilot (see PCPIntro(1)) archive log files.

       pmlogger_check  may  be  run  at  any  time  of  the  day  and is intended to check that a desired set of
       pmlogger(1) processes are running.  If not, it (re-)starts any missing logger processes.

       pmlogger_daily is intended to be run once per day,  preferably  in  the  early  morning,  as  soon  after
       midnight  as  practicable.  Its task is to aggregate, rotate and perform general housekeeping one or more
       sets of PCP archives.

       To accommodate the evolution of PMDAs and changes in production logging environments,  pmlogger_daily  is
       integrated with pmlogrewrite(1) to allow optional and automatic rewriting of archives before merging.  If
       there are global rewriting rules to be applied across all archives mentioned in the control file(s), then
       create  the directory $PCP_SYSCONF_DIR/pmlogrewrite and place any pmlogrewrite(1) rewriting rules in this
       directory.  For rewriting rules that are specific to only one family of archives, use the directory  name
       from  the control file(s) - i.e. the fourth field - and create a file, or a directory, or a symbolic link
       named pmlogrewrite within this directory and place the required rewriting  rule(s)  in  the  pmlogrewrite
       file  or  in files within the pmlogrewrite subdirectory.  pmlogger_daily will choose rewriting rules from
       the archive directory if they exist, else rewriting  rules  from  $PCP_SYSCONF_DIR/pmlogrewrite  if  that
       directory exists, else no rewriting is attempted.

       As  an  alternate  mechanism,  if  the file $PCP_LOG_DIR/pmlogger/.NeedRewrite exists when pmlogger_daily
       starts   then   this   is   treated   the   same   as   specifying   -R   on   the   command   line   and
       $PCP_LOG_DIR/pmlogger/.NeedRewrite will be removed once all the rewriting has been done.

OPTIONS

       -c control, --control=control
            Both  pmlogger_check  and pmlogger_daily are controlled by PCP logger control file(s) that specifies
            the pmlogger instances to be managed.  The default control file is $PCP_PMLOGGERCONTROL_PATH, but an
            alternate may be specified using the -c option.  If the  directory  $PCP_PMLOGGERCONTROL_PATH.d  (or
            control.d from the -c option) exists, then the contents of any additional control files therein will
            be appended to the main control file (which must exist).

       -C   This option causes pmlogger_check to query the system service runlevel information for pmlogger, and
            use that to determine whether to start processes or not.

       -E, --expunge
            This  option causes pmlogger_daily to pass the -E flag to pmlogger_merge in order to expunge metrics
            with metadata inconsistencies and continue rather than fail.  This is intended for  automated  daily
            log  rotation  where  it  is  highly  desirable  for unattended daily archive merging, rewriting and
            compression to succeed.  For further details, see pmlogger_merge(1) and description for the -x  flag
            in pmlogextract(1).

       -f, --force
            This  option  causes  pmlogger_daily  to  forces  action.   Using  this  option in production is not
            recommended.

       -k time, --discard=time
            After some period, old PCP archives are discarded.  time is a time specification in  the  syntax  of
            find-filter(1),  so  DD[:HH[:MM]].   The  optional  HH  (hours)  and MM (minutes) parts are 0 if not
            specified.  By default the time is 14:0:0 or 14 days, but may be changed using this option.

            Some special values are recognized for the time, namely 0 to keep no archives beyond  the  the  ones
            being  currently  written  by  pmlogger(1),  and  forever  or  never  to  prevent any archives being
            discarded.

            The time can also be set using the $PCP_CULLAFTER variable, set in either the environment  or  in  a
            control  file.  If both $PCP_CULLAFTER and -k specify different values for time then the environment
            variable value is used and a warning is issued.  I.e., if $PCP_CULLAFTER is set in the control file,
            it overrides -k given on the command line.

            Note that the semantics of time are that it is measured from the time of last modification  of  each
            archive,  and  not  from  the  original  archive  creation  date.   This has subtle implications for
            compression (see below) - the compression process results in the creation of new archive files which
            have new modification times.  In this case, the time period (re)starts from the time of compression.

       -K   When this option is specified for pmlogger_daily then only the compression tasks are  attempted,  so
            no pmlogger rotation, no culling, no rewriting, etc.  When -K is used and a period of 0 is in effect
            (from  -x on the command line or $PCP_COMPRESSAFTER in the environment or via the control file) this
            is intended for environments where compression of archives is desired  before  the  scheduled  daily
            processing happens.  To achieve this, once pmlogger_check has completed regular processing, it calls
            pmlogger_daily  with  just  the  -K  option.  Provided $PCP_COMPRESSAFTER is set to 0 along with any
            other required compression options to match the scheduled invocation of  pmlogger_daily,  then  this
            will   compress   all   volumes  except  the  ones  being  currently  written  by  pmlogger(1).   If
            $PCP_COMPRESSAFTER is set to a value greater than zero, then manually  running  pmlogger_daily  with
            the  -x  option  may  be used to compress volumes that are younger than the $PCP_COMPRESSAFTER time.
            This may be used to reclaim filesystem space by compressing volumes earlier  than  they  would  have
            otherwise  been  compressed.  Note that since the default value of $PCP_COMPRESSAFTER is 0 days, the
            -x option has no effect unless the control file has been edited and $PCP_COMPRESSAFTER has been  set
            to a value greater than 0.

       -l file, --logfile=file
            In  order  to  ensure  that mail is not unintentionally sent when these scripts are run from cron(8)
            diagnostics    are    always    sent    to    log    files.     By    default,    this    file    is
            $PCP_LOG_DIR/pmlogger/pmlogger_check.log or $PCP_LOG_DIR/pmlogger/pmlogger_daily.log but this can be
            changed  using  the  -l  option.  If this log file already exists when the script starts, it will be
            renamed with a .prev suffix  (overwriting  any  log  file  saved  earlier)  before  diagnostics  are
            generated to the log file.  The -l and -t options cannot be used together.

       -m addresses, --mail=addresses
            Use  of  this  option  causes  pmlogger_daily to construct a summary of the ``notices'' file entries
            which were generated in the last 24 hours, and e-mail that summary to  the  set  of  space-separated
            addresses.  This daily summary is stored in the file $PCP_LOG_DIR/NOTICES.daily, which will be empty
            when no new ``notices'' entries were made in the previous 24 hour period.

       -M   This  option  may  be  used  to disable archive merging (or renaming) and rewriting (-M implies -r).
            This is most useful in cases  where  the  archives  are  being  incrementally  copied  to  a  remote
            repository,  e.g.  using  rsync(1).   Merging,  renaming  and  rewriting all risk an increase in the
            synchronization load, especially immediately after pmlogger_daily has run, so -M may  be  useful  in
            these cases.

       -N, --showme
            This  option enables a ``show me'' mode, where the programs actions are echoed, but not executed, in
            the style of ``make -n''.  Using -N in conjunction with -V maximizes the diagnostic capabilities for
            debugging.

       -o   By default all possible archives will be merged.  This option reinstates the old behaviour in  which
            only  yesterday's archives will be considered as merge candidates.  In the special case where only a
            single input archive needs to be merged,  pmlogmv(1)  is  used  to  rename  the  archive,  otherwise
            pmlogger_merge(1) is used to merge all of the archives for a single host and a single day into a new
            PCP archive and the individual archives are removed.

       -p, --skip-primary
            If  this option is specified for pmlogger_check then any line from the control files for the primary
            pmlogger will be ignored.  This option is intended for environments where some system  daemon,  like
            systemd(1),  is  responsible  for  controlling  (starting,  stopping,  restarting, etc.) the primary
            pmlogger.

       -P, --only-primary
            If this option is specified for pmlogger_check then only the primary logger  entry  in  the  control
            files  will  be  processed.   This  is  the logical opposite of the -p option described above and is
            intended for use by RC scripts that start only the primary  logger,  such  as  the  pmlogger.service
            unit.  The -p and -P options to pmlogger_check are mutually exclusive.

       -p   If this option is specified for pmlogger_daily then the status of the daily processing is polled and
            if  the  daily pmlogger(1) rotation, culling, rewriting, compressing, etc.  has not been done in the
            last 24 hours then it is done now.  The intent is to have pmlogger_daily called regularly  with  the
            -p  option  (at  30  mins  past  the hour, every hour in the default cron(8) set up) to ensure daily
            processing happens as soon as possible if it was missed at the regularly scheduled  time  (which  is
            00:10  by  default),  e.g.  if  the  system  was  down  or suspended at that time.  With this option
            pmlogger_daily simply exits if the previous day's processing has already been done.  Note that  this
            option  is not used on platforms supporting systemd(1) because the pmlogger_daily.timer service unit
            specifies a timer setting with Persistent=true.   The  -K  and  -p  options  to  pmlogger_daily  are
            mutually exclusive.

       -q, --quick
            If  this  option  is  specified  for pmlogger_check then the script will ``quickstart'' avoiding any
            optional processing like file compression.

       -r, --norewrite
            This command line option acts as an override and prevents all archive rewriting with pmlogrewrite(1)
            independent of the presence of any rewriting rule files or directories.

       -R, --rewriteall
            Sometimes PMDA changes require all archives to be rewritten, not  just  the  ones  involved  in  any
            current  merging.   This  is  required  for  example  after  a PCP upgrade where a new version of an
            existing PMDA has revised metadata.  The -R command line forces this universal-style  of  rewriting.
            The -R option to pmlogger_daily is mutually exclusive with both the -r and -M options.

       -s size, --rotate=size
            If  the  PCP ``notices'' file ($PCP_LOG_DIR/NOTICES) is larger than 20480 bytes, pmlogger_daily will
            rename the file with a ``.old'' suffix, and start a new ``notices'' file.  The rotate threshold  may
            be changed from 20480 to size bytes using the -s option.

       -s, --stop
            Use  of  this option provides the reverse pmlogger_check functionality, allowing the set of pmlogger
            processes to be cleanly shutdown.

       -t period
            To assist with debugging or diagnosing intermittent failures the -t option may be used.   This  will
            turn   on   very   verbose   tracing   (-VV)   and   capture  the  trace  output  in  a  file  named
            $PCP_LOG_DIR/pmlogger/daily.datestamp.trace, where datestamp is the time pmlogger_daily was  run  in
            the  format  YYYYMMDD.HH.MM.   In addition, the period argument will ensure that trace files created
            with -t will be kept for period days and then discarded.

       -T, --terse
            This option to pmlogger_check produces less verbose output than the default.  This is most  suitable
            for a pmlogger ``farm'' where many instances of pmlogger are expected to be running.

       -V, --verbose
            The output from the cron execution of the scripts may be extended using the -V option to the scripts
            which  will  enable  verbose  tracing  of their activity.  By default the scripts generate no output
            unless some error or warning condition is encountered.  Using -N in conjunction  with  -V  maximizes
            the diagnostic capabilities for debugging.

       -x time, --compress-after=time
            Archive  data  files can optionally be compressed after some period to conserve disk space.  This is
            particularly useful for large numbers of pmlogger processes under the control of pmlogger_check.

            time is a time specification in the syntax of find-filter(1),  so  DD[:HH[:MM]].   The  optional  HH
            (hours) and MM (minutes) parts are 0 if not specified.

            Some  special values are recognized for the time, namely 0 to apply compression as soon as possible,
            and forever or never to prevent any compression being done.

            If transparent_decompress is enabled when libpcp was built (can be checked with the  pmconfig(1)  -L
            option),  then  the default behaviour is compression ``as soon as possible''.  Otherwise the default
            behaviour is to not compress files (which matches the historical default behaviour  in  earlier  PCP
            releases).

            The  time can also be set using the $PCP_COMPRESSAFTER variable, set in either the environment or in
            a control file.  If both $PCP_COMPRESSAFTER and -x  specify  different  values  for  time  then  the
            environment  variable  value  is  used  and a warning is issued.  For important other detailed notes
            concerning volume compression, see the -K and -k options (above).

       -X program, --compressor=program
            This option specifies the program  to  use  for  compression  -  by  default  this  is  xz(1).   The
            environment  variable  $PCP_COMPRESS  may be used as an alternative mechanism to define program.  If
            both $PCP_COMPRESS and -X specify different compression programs then the environment variable value
            is used and a warning is issued.

       -Y regex, --regex=regex
            This option allows a regular expression to be specified causing files in the set  of  files  matched
            for  compression  to be omitted - this allows only the data file to be compressed, and also prevents
            the  program  from  attempting  to  compress  it   more   than   once.    The   default   regex   is
            ".(index|Z|gz|bz2|zip|xz|lzma|lzo|lz4)$"  - such files are filtered using the -v option to egrep(1).
            The environment variable $PCP_COMPRESSREGEX may be used as an alternative mechanism to define regex.
            If both $PCP_COMPRESSREGEX and -Y specify different values for regex then the  environment  variable
            value is used and a warning is issued.

       -?, --help
            Display usage message and exit.

CONFIGURATION

       Warning:  The  $PCP_PMLOGGERCONTROL_PATH  file and files within the $PCP_PMLOGGERCONTROL_PATH.d directory
       must not be writable by any user other than root.

       The control file(s) should be customized according to the following rules that  define  for  the  current
       version (1.1) of the control file format.

       1.  Lines  beginning  with  a  ``#''  are comments.  A special case is lines beginning ``#!#''; these are
           control lines for a pmlogger(1) that has been stopped using pmlogctl(1).
       2.  Lines beginning with a ``$'' are assumed to be assignments to environment variables in the  style  of
           sh(1),  and  all text following the ``$'' will be eval'ed by the script reading the control file, and
           the corresponding variable exported into the environment.  This is particularly  useful  to  set  and
           export variables into the environment of the administrative scripts, e.g.
               $ PMCD_CONNECT_TIMEOUT=20
       3.  There must be a version line in the initial control file of the form:
               $ version=1.1
       4.  There should be one line in the control file(s) for each pmlogger instance of the form:

               host y|n y|n directory args

       5.  Fields  within  a  line  of  the  control file(s) are usually separated by one or more spaces or tabs
           (although refer to the description of the directory field for some important exceptions).
       6.  The first field is the name of the host that is the  source  of  the  performance  metrics  for  this
           pmlogger instance.
       7.  The  second field indicates if this is a primary pmlogger instance (y) or not (n).  Since the primary
           logger must run on the local host, and there may be at most one primary logger for a particular host,
           this field can be y for at most one pmlogger instance, in which case the host name must be  the  name
           of the local host.
       8.  The  third  field  indicates  if  this  pmlogger  instance  needs  to be started under the control of
           pmsocks(1) to connect to a pmcd through a firewall (y or n).
       9.  The fourth field is a directory name.  All files associated  with  this  pmlogger  instance  will  be
           created  in  this directory, and this will be the current directory for the execution of any programs
           required in the maintenance of those archives.  A useful convention is that primary  logger  archives
           for the local host with hostname myhost are maintained in the directory $PCP_ARCHIVE_DIR/myhost (this
           is  where  the  default  pmlogger start-up script in $PCP_RC_DIR/pcp will create the archives), while
           archives for the remote host mumble are maintained in $PCP_ARCHIVE_DIR/mumble.
       10. The directory field may contain embedded shell syntax that will be evaluated by sh(1) to produce  the
           real directory name to be used.  The allowed constructs are:
           • Any text (including white space) enclosed with $( and ).
           • Any text (including white space) enclosed with ` and ` (back quotes).
           • Any text (including white space) enclosed with " and " (double quotes).
           • Any word containing a $ (assumed to introduce an environment variable name).
       11. All other fields are interpreted as arguments to be passed to pmlogger(1).  Most typically this would
           be the -c option.

       The  following  sample  control  lines specify a primary logger on the local host (bozo), and non-primary
       loggers to collect and log performance metrics from the hosts wobbly and boing.

       $version=1.1
       bozo   y  n  $PCP_ARCHIVE_DIR/bozo   -c config.default
       wobbly n  n  "/store/wobbly/$(date +%Y)"  -c ./wobbly.config
       boing  n  n  $PCP_ARCHIVE_DIR/boing  -c ./pmlogger.config

       Typical crontab(5) entries for periodic execution of  pmlogger_daily  and  pmlogger_check  are  given  in
       $PCP_SYSCONF_DIR/pmlogger/crontab (unless installed by default in /etc/cron.d already) and shown below.

       # daily processing of archive logs
       14      0       *       *       *       $PCP_BINADM_DIR/pmlogger_daily
       # every 30 minutes, check pmlogger instances are running
       25,55   *       *       *       *       $PCP_BINADM_DIR/pmlogger_check

       When  using systemd(1) on Linux, no crontab entries are needed as the timer mechanism provided by systemd
       is used instead.

FILES

       $PCP_PMLOGGERCONTROL_PATH
            the PCP logger control file.  For a new installation this file contains no pmlogger(1) control lines
            (the real control files are all in the $PCP_PMLOGGERCONTROL_PATH.d  directory),  but  this  file  is
            still processed to support any legacy configurations therein from earlier PCP releases.
            Warning: this file must not be writable by any user other than root.

       $PCP_PMLOGGERCONTROL_PATH.d
            optional directory containing additional PCP logger control files, typically one per host
            Warning: the files herein must not be writable by any user other than root.

       $PCP_SYSCONF_DIR/pmlogger/crontab
            sample  crontab  for automated script execution by $PCP_USER (or root).  Exists only if the platform
            does not support the /etc/cron.d mechanism.

       $PCP_VAR_DIR/config/pmlogger/config.default
            default pmlogger configuration file location for  the  local  primary  logger,  typically  generated
            automatically by pmlogconf(1).

       $PCP_ARCHIVE_DIR/<hostname>
            default location for archives of performance information collected from the host hostname

       $PCP_ARCHIVE_DIR/<hostname>/lock
            transient  lock  file  to  guarantee  mutual  exclusion  during pmlogger administration for the host
            hostname - if present, can be safely  removed  if  neither  pmlogger_daily  nor  pmlogger_check  are
            running

       $PCP_ARCHIVE_DIR/<hostname>/Latest
            PCP  archive  folio created by mkaf(1) for the most recently launched archive containing performance
            metrics from the host hostname

       $PCP_LOG_DIR/NOTICES
            PCP ``notices'' file used by pmie(1) and friends

       $PCP_LOG_DIR/pmlogger/pmlogger_check.log
            if the previous execution of pmlogger_check produced any output it is saved here.  The  normal  case
            is no output in which case the file does not exist.

       $PCP_LOG_DIR/pmlogger/pmlogger_daily.log
            if  the  previous execution of pmlogger_daily produced any output it is saved here.  The normal case
            is no output in which case the file does not exist.

       $PCP_ARCHIVE_DIR/<hostname>/SaveLogs
            if this directory exists, then the log file from the -l argument of a newly launched pmlogger(1) for
            hostname will be linked into this directory with the name archive.log where archive is the  basename
            of  the  associated  pmlogger(1)  PCP  archive files.  This allows the log file to be inspected at a
            later time, even if several pmlogger(1) instances for hostname have been launched  in  the  interim.
            Because  the  cron-driven  PCP  archive  management  scripts  run under the uid of the user ``pcp'',
            $PCP_ARCHIVE_DIR/hostname/SaveLogs typically needs to be owned by the user ``pcp''.

       $PCP_LOG_DIR/pmlogger/.NeedRewrite
            if this file exists, then this is treated as equivalent to using -R on the command line and the file
            will be removed once all rewriting has been done.

PCP ENVIRONMENT

       Environment variables with the prefix PCP_ are used to parameterize the file and directory names used  by
       PCP.   On  each  installation, the file /etc/pcp.conf contains the local values for these variables.  The
       $PCP_CONF variable may be used to specify an alternative configuration file, as described in pcp.conf(5).

       The default behaviour, when pmlogger(1) configuration comes  from  pmlogconf(1),  is  to  regenerate  the
       configuration  file  and  check  for changes whenever pmlogger(1) is started from pmlogger_check.  If the
       PMDA configuration is stable, this is not necessary,  and  setting  $PMLOGGER_CHECK_SKIP_LOGCONF  to  yes
       disables the regeneration and checking.

COMPATIBILITY ISSUES

       Earlier versions of pmlogger_daily used find(1) to locate files for compressing or culling and the -k and
       -x  options  took only integer values to mean ``days''.  The semantics of this was quite loose given that
       find(1) offers different precision and semantics across platforms.

       The current implementation of pmlogger_daily uses find-filter(1) which provides high precision  intervals
       and semantics that are relative to the time of execution and are consistent across platforms.

SEE ALSO

       egrep(1),  find-filter(1),  PCPIntro(1),  pmconfig(1),  pmlc(1),  pmlogconf(1), pmlogctl(1), pmlogger(1),
       pmlogger_daily_report(1), pmlogger_merge(1), pmlogextract(1),  pmlogmv(1),  pmlogrewrite(1),  pmsocks(1),
       systemd(1), xz(1) and cron(8).

Performance Co-Pilot                                   PCP                                     PMLOGGER_CHECK(1)