Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

The Uptime Infrastructure Monitor IRIX agent collects the following performance metrics from the systems on which it is installed: CPU
Multi-CPU Memory Disk
Image Removed Network Image Removed Process Image Removed User

Table of Contents
maxLevel2
minLevel2

The IRIX agent uses a number of utilities to gather these metrics, including: Image Removed

  • sar

...

  • . Collects information about system activity.

...

  • ifconfig

...

  • . Configures the parameters for network interfaces.

...

  • ps

...

  • . Reports on the status of processes.

...

  • netstat

...

  • . Reports on network status.

Each set of performance metrics is averaged between the interval at which the Uptime Infrastructure Monitor UIM monitoring station polls the agent (e.g. every 10 minutes).

...

...

CPU

The Uptime Infrastructure Monitor UIM agent uses the sar utility (with the -u and -f options) to collect the metrics listed below from an IRIX system. The statistics returned by the agent are averaged for all CPUs on the system.

Metric
Explanation
Description
% Usr
The amount
Amount of time that the CPU spends in user mode.
% Sys
The amount
Amount of time that the kernel spends processing system calls.
%
WIO
WHO
The amount
Amount of waiting time that a runable process for a device takes to perform an I/O operation.
Multi CPU UsageWhether
or not
a system with multiple CPUs is effectively balancing tasks between CPUs, or if processes are being forced off CPUs in certain circumstances.
Run Queue Length
The percentage
Percentage of time that one or more services or processes are waiting to be served by the CPU.
Run Queue Occupancy
The percentage
Percentage of time that one or more services or processes are waiting to be served by the CPU.

...

...

Multi-CPU

The Uptime Infrastructure Monitor UIM agent collects the metrics listed below from IRIX systems with multiple CPUs. The CPU statistics output by the agent are an average of all the CPUs on the server.

Me tric
Metric
Explanation
Description
Us er
User %
The percentage
Percentage of CPU user processes that are in use.
Sys tem
System %
The percentage
Percentage of CPU kernel processes that are in use.
Wa it
Wait I

/O

%
The percentage
Percentage of time that a process which can be run must wait for a device to perform an I/O operation.
SM TX
SMTX
The number
Number of read or write locks that a thread was not able to acquire on the first attempt, as reported by the mpstat command.
XC AL
XCAL
The number
Number of interprocess cross-calls. In a multi-processor environment, one processor sends cross-calls to another processor to get that processor to do work. Cross-calls can also be used to ensure consistency in virtual memory. Heavy file system activity such as NFS can result in a high number of cross-calls.
Int err
Interrupts
The number
Number of CPU interrupts.

upts

 

Tot al
Total %
The total
Total amount of User %, System %, and Wait I/O%.

...

Memory

The Uptime Infrastructure Monitor UIM agent uses the the sar utility  utility with the following options to collect memory metrics from an IRIX system: Image Removed

  • -w -f (swap activity)

...

  • -b -f (buffer activity)

...

  • -p -f (paging activity)

The statistics the agent returns are for the entire system. 

Explanation
MetricDescription
Free MemoryThe amount Amount of physical memory available to the operating system, system library files, and applications.
Cache Hit RateHow often the system accesses the CPU cache.
Page-outs/sThe rate Rate at which pages were written to disk.
Page-ins/sThe rate Rate at which pages were read from or written to the disk.
Page Frees/sThe number Number of pages that are freed from memory each second.
Attaches/sThe number Number of pages that get attached to memory each second.
Page-out Requests/sThe number Number of requests to perform a write operation that occur each second.
Page-in reqs/sThe number Number of requests to perform a read operation that occur each second.
PageScans/sThe number Number of pages that are scanned each second.
PageFaults/sThe number Number of page faults that occur each second.
Software Locks/sThe number Number of software locks that are issued each second.

...

...

Disk

The Uptime Infrastructure Monitor UIM agent uses the the sar utility with the -d and -f options to collect the disk metrics listed below from an IRIX system. The agent collects volume capacity statistics from each filesystem, while the disk statistics (%busy, Read/Write/s) returned are for each disk.

Metric
Explanation
Description

Disk (Spindle) Name

The names

Names of each disk on the system.

Usage (% Busy)

The percentage

Percentage of time during which the disk drive is handling read or write requests.

Blocks per second

The number

Number of read and write operations on the disk that occur each second.

Transfers/s

The average

Average number of bytes that have been transferred to or from the disk during write or read operations.

Average Queued Requests

The number

Number of threads that are waiting for processor time.

Average Service Time

The average

Average amount of time, in milliseconds, that is required for a request to be carried out.

Average Wait Time

The average

Average time, in milliseconds, that a transaction is waiting in a queue. The wait time is directly proportional to the length of the queue.

...

Network

The Uptime Infrastructure Monitor UIM agent uses the following utilities to collect network data from an IRIX system: Image Removed

  • ifconfig -a

...

  • netstat -s

Except for TCP retransmits, the agent averages all statistics per interface. Explanation

MetricDescription

Receive Rate

The rate

Rate, in kilobytes per seconds, at which data is received over a specific network adapter.

Send Rate

The rate

Rate, in kilobytes per seconds, at which data is sent over a specific network adapter.

Packets Inbound Errors

The number

Number of inbound packets that contained errors, which preventing those packets from being delivered to a higher-layer protocol.

Packets Outbound Errors

The number

Number of outbound packets that could not be transmitted because of errors.

Collisions

The number

Number of signals from two separate nodes on the network that have collided.

TCP Retransmits

The number

Number of packets that have been re-sent over a network interface.

...

Process

The Uptime Infrastructure Monitor UIM agent uses the ps utility with the -ef option to collect the process data listed below from an IRIX system. By default, the agent only gathers the top 20 processes, and sorts them by the highest CPU usage.

Metric
Explanation
Description

Number of Processes

The number

Number of processes that are currently running on a system.

Process Creation Rate

This metric determines

Determines whether

or not

there are runaway processes on a system or if a forking-based process (like a Web server) is spawning too many processes over a specified period of time.

Processes Running

The number

Number of processes that are currently running.

Processes Blocked

The number

Number of processes that are currently being blocked from running.

Processes Waiting

The number

Number of processes that are currently waiting to run.

Workload - User

The demand

Demand that network and local services are putting on the system, based on the IDs of the users who are logged into a system.

Workload - Group

The demand

Demand that network and local services are putting on the system, based on the IDs of the user groups that are logged into a system.

Workload - Process Name

The demand

Demand that network and local services are putting on a system, based on the processes that are running.

Workload Top 10 - User

The 10 network and local services that are are putting the most load on the system, based on the IDs of the users who are logged into a system.

Workload Top 10 - Group

The 10 network and local services that are are putting the most load on the system, based on the IDs of the user groups who are logged into a system.

Workload Top 10 - Process Name

The 10 network and local services that are are putting the most load on the system, based on the processes that are running.

...

User

The Uptime Infrastructure Monitor UIM agent uses the following utilities to collect user information from an IRIX system: Image Removed

  • ps -eo

...

  • /usr/bsd/last /usr/bsd/head -10 (login history for the last 10 users on the system)

...

  • /usr/bin/who (lists who is currently logged into the system) 
Metric
Explanation
Description

Login History

The number of times or frequency at which a user has logged into a system during any 30 minute time interval.

Sessions

The number of sessions or number of distinct users who are logged into a system during any 30 minute time interval.