Manual Chapter : Background diagnostics overview

Applies To:

  • F5OS-A

    1.2.0

Background diagnostics overview

Background diagnostics run continuously in the background, do not affect production traffic, and do not require you to take the system out of service to run them.

Background diagnostics tasks cover specific components and functions, at varying schedules.

Subsystem Task Schedule
Hardware LOP monitors sensor status Asynchronous
TPM TPM status check System start-up
CPLD CPLD check System start-up
Drives NVME SMART attributes Interval
Drives Disk usage task Interval
Memory Temperature Interval
Memory Memory controller events Interval
CPU Temperature Interval
CPU Memory controller event records Interval
CPU Extended event log events Interval
LOP LOP task - firmware diagnostic data Interval
Sensors LOP - sensors Interval
LCD LCD - health Interval
LCD LCD - sensors Interval

The TPM (Trusted Platform Module) status check runs when the system starts up and verifies the TPM health status. For more information, see “Trusted Platform Module (TPM) overview” in F5 rSeries Systems: Administration and Configuration.

System start-up

The CPLD (Complex Programmable Logic Device) status check runs when the system starts up and verifies the ability to read the CPLD magic byte.

System start-up

The NVME SMART attributes task verifies the internal status of installed storage drives, including this information:

  • Drive critical warnings
  • Drive media errors
  • Drive media percentage of storage used
  • Drive media percentage of available storage
  • Drive temperature within normal operating range

Interval

The disk usage task calculates the percentage used of disk capacity and reports alarms when drives reach low capacity of unused space.

Interval

The memory temperature task gathers errors produced by dual in-line memory module (DIMM) sensors when temperature is out of an acceptable range.

Interval

The memory controller task gathers errors reported by the system and errors reported by the error detection and correction (EDAC) mechanism for dual in-line memory modules (DIMMs).

Interval

The CPU temperature task monitors the CPU core temperatures and reports alarms when the temperature is out of an acceptable range.

Interval

The CPU memory controller event records task gathers errors reported by the memory reliability, availability, and serviceability (RAS) daemon.

Interval

The CPU extended event log check task gathers extended event log errors reported by the memory reliability, availability, and serviceability (RAS) daemon.

Interval

The LOP - Firmware diagnostic data task monitors the lights-out processor (LOP) firmware diagnostic data and stores data in a JSON log file on the file system at /var/F5/diag-agent/logs. These log files are added when generating a QKView file.

Interval

The LOP - Sensors task monitors the lights-out processor (LOP) sensor status for the fan tray, power supply unit (PSU) controller, blades, and system controllers.

Interval

The LCD Health Check task monitors the health of the LCD and reports alarms when the LCD is unhealthy.

Interval

The LCD Sensors task periodically monitors the various LCD sensors and reports their values.

Interval