SiteScope User's Guide


SiteScope Health Monitor Reference

SiteScope Health Monitors are deployed by using the Health group page. The error, warning, and good status thresholds for these monitors are set in the same way as for other monitor types.

This section describes:

MG Health Monitor

This monitor is designed to check all monitor group (*.mg) files currently defined in the local SiteScope installation. You can edit the update frequency and the display name for this monitor type. You can use the Advanced Options disable the monitor individually as well as selecting other options as shown on the monitor setup page.

The status thresholds for this monitor are based on the number of errors (numErrors) detected in the monitor group files.

History Health Monitor

This monitor is designed to check the history.config file for reports currently defined in the local SiteScope installation. You can edit the update frequency and the display name for this monitor type. You can use the Advanced Options disable the monitor individually as well as selecting other options as shown on the monitor setup page.

The status thresholds for this monitor are based on the number of errors (numErrors) detected in the reports configuration file.

Master Health Monitor

This monitor is designed to check the master.config file for the local SiteScope installation. You can edit the update frequency and the display name for this monitor type. You can use the Advanced Options disable the monitor individually as well as selecting other options as shown on the monitor setup page.

The status thresholds for this monitor are based on the number of errors (numErrors) detected in the master configuration file.

Log Event Health Monitor

This monitor is designed to check the error.log file for the local SiteScope installation. You can edit the counters, the update frequency and the display name for this monitor type. Use the choose counters link to edit the counters selected for the monitor. You can use the Advanced Options disable the monitor individually as well as selecting other options as shown on the monitor setup page.

The status thresholds for this monitor are based on the counters selected. The counters for this monitor type are listed in the table below.

Note: Only the first 15 selected counters will be configured and monitored. A maximum of 10 measurements can be used as status threshold criteria for alerting.

Log Event Monitor Counters

Counter

Description

skipped #1

A monitor has skipped its scheduled run once

skipped #2

A monitor has skipped its scheduled run two times

skipped #3

A monitor has skipped its scheduled run three times

skipped #4

A monitor has skipped its scheduled run four times

skipped #5

A monitor has skipped its scheduled run five times

SiteScope shutting down

SiteScope has been shut down

Reached the limit of processes in the process pool

The number of processes requested from the process pool exceeds the number of processes available in the pool

Error. data reporter failed to report chunk of data

There was a fault in the transfer of SiteScope monitor measurement data to Mercury Application Management

Error. config reporter failed to report chunk of data

There was a fault in the transfer of SiteScope configuration data to Mercury Application Management Monitor Administration

Error. Topaz failed to process data

Mercury Application Management reported a fault in processing data sent from SiteScope

Error. CacheSender. Got to the max number of cached files

SiteScope has reached the maximum number of cached data file awaiting transfer to Mercury Application Management. This may occur if data transfer between SiteScope and Mercury Application Management has been interrupted.

Error. CacheSender. Got to the max old dir size

SiteScope has reached the maximum directory size for cached data file awaiting transfer to Mercury Application Management. This may occur if data transfer between SiteScope and Mercury Application Management has been interrupted.

Topaz SEVERE

Mercury Application Management reported a data transfer or processing fault with a status of SEVERE

Status thresholds are set on counters listed above.

Monitor Load Monitor

This monitor is designed to check several SiteScope load statistics reported by the Progress Report for the local SiteScope installation. You can edit the counters, the update frequency and the display name for this monitor type. Use the choose counters link to edit the counters selected for the monitor. You can use the Advanced Options disable the monitor individually as well as selecting other options as shown on the monitor setup page.

The status thresholds for this monitor are based on the counters selected. The counters for this monitor type are listed below.

Note: Only the first 15 selected counters will be configured and monitored. A maximum of 10 measurements can be used as status threshold criteria for alerting.

Monitor Load Counters

  • Current Monitors Run Per Minute
  • Current Monitors Running
  • Current Monitors Waiting
  • Maximum Monitors Run Per Minute
  • Maximum Monitors Running
  • Maximum Monitors Waiting

Health of SiteScope Server Monitor

This monitor is designed to check the SiteScope several server resource and process statistics for the local SiteScope installation. You can edit the counters, the update frequency and the display name for this monitor type. Use the choose counters link to edit the counters selected for the monitor. You can use the Advanced Options disable the monitor individually as well as selecting other options as shown on the monitor setup page.

The status thresholds for this monitor are based on the counters selected. The counters available depend on the platform on which SiteScope is running. The counters for this monitor type are listed below.

Note: Only the first 15 selected counters will be configured and monitored. A maximum of 10 measurements can be used as status threshold criteria for alerting.

Health of SiteScope Server Counters on UNIX

The following are default Health of SiteScope Server Monitor counters for SiteScope on UNIX platforms:


Used Disk Space on SiteScope Drive
MegaBytes Available on SiteScope Drive
Used Disk Space on /
MegaBytes Available on /
Disk Blocks Written/sec
Disk Blocks Read/sec
Physical Memory Free
Physical Memory Free Megabytes
Swap Free,Swap Free Megabytes
Load Avg 5min
Sitescope Process Memory
SiteScope Process Thread Count
SiteScope Process Handle Count
Average CPU
PageIns/sec
PageOuts/sec
SwapIns/sec
SwapOuts/sec
ContextSwitches/sec
Net_TotalPacketsIn/sec
Net_TotalPacketsOut/sec
Net_TotalCollisions/sec

Health of SiteScope Server Counters on Windows

On the Windows platform the counters for this monitor type are presented in an expandable tree selection menu. You use the navigation features to expand and collapse the selection menu and select counters to monitor. The following are default Health of SiteScope Server Monitor counters for SiteScope on Windows platforms.

System Component

Avaiable Counters

skipped #1

A monitor has skipped its scheduled run once

Memory

Page Faults/sec
Pool Paged Bytes
Pool Nonpaged Bytes
% Committed Bytes In Use
Available MBytes

System

Context Switches/sec
File Data Operations/sec
System Up Time
Processor Queue Length
Processes
Threads

Processor

_Total
% Processor Time
% DPC Time

Process

java
Thread Count
Pool Paged Bytes
Pool Nonpaged Bytes
Handle Count

Process

perfex
% Processor Time
Thread Count
Pool Paged Bytes
Pool Nonpaged Bytes
Handle Count

Network Interface

MS TCP Loopback interface
Bytes Total/sec
Current Bandwidth
Bytes Received/sec
Bytes Sent/sec
<Ethernet_hardware> (hardware specific to the particular SiteScope server)
Bytes Total/sec
Current Bandwidth
Bytes Received/sec
Bytes Sent/sec

LogicalDisk

<logical_drive> (hardware specific to the particular SiteScope server)
% Free Space
Free Megabytes
Avg. Disk Bytes/Transfer

_Total
% Free Space
Free Megabytes
Avg. Disk Bytes/Transfer

PhysicalDisk

_Total
Current Disk Queue Length
Disk Transfers/sec

<physical_disk(s)> (hardware specific to the particular SiteScope server)
Current Disk Queue Length
Disk Transfers/sec

Server

Bytes Total/sec
Errors Logon
Errors Access Permissions
Errors System
Files Open
Server Sessions