Introduction
Most current day IT infrastructures are heterogeneous environments including a mix of different server hardware and operating systems. Sometimes, administrators might not want to monitor any of the applications executing on these operating systems, but would be interested in knowing how healthy the operating system hosting the application is. To cater to these needs, eG Enterprise offers 100% web-based, integrated moni of heterogeneous IT infrastructures. Administrators can monitor and manage a variety of Unix, Windows, and legacy operating systems from a common console. A novel layer model representation is used to analyze and depict the performance of different protocol layers of the infrastructure – network, operating system, TCP/IP stack, critical application processes and services, etc. By using a common performance model representation across heterogeneous infrastructures, eG Enterprise ensures that administrators are not exposed to the differing nature of each operating system and hence, have a short learning curve.
The monitoring can be done in an agent-based or in an agentless manner, and administrators can pick and choose the servers that have to be monitored with agents (e.g., critical production servers) and those that can be monitored in an agentless manner (e.g., staging servers).
A single agent license suffices to monitor a server and the agent license is transportable across operating systems. Agent-based and agentless monitoring is supported for Microsoft Windows 2000/2003, Sun Solaris, Red Hat Linux, Free BSD, SuSE Linux, HPUX, Tru64, and AIX operating systems. Agentless monitoring is also available for Novell Netware, OpenVMS, and OS/400 operating systems.
The following table summarizes the system monitoring capabilities of the eG Enterprise.
Capability | Metric | Description |
---|---|---|
CPU Monitoring
|
CPU utilization per processor of a server |
|
Run queue length of a server |
Determine how many processes are contending for CPU resources simultaneously |
|
Top 10 CPU consuming processes on a server |
Know which processes are causing a CPU spike on the server |
|
Top 10 servers by CPU utilization |
Know which servers have high CPU utilization, and which ones are under-utilized |
|
Memory Monitoring |
Free memory availability |
|
Swap memory usage |
Determine servers with high swap usage |
|
Top 10 processes consuming memory on the server |
Know which processes are taking up memory on a server |
|
Top 10 servers by memory usage |
Know which servers have the lowest free memory available and hence, may be candidates for memory upgrades |
|
I/O Monitoring |
Blocked processes |
|
Disk activity |
|
|
Disk read/write times |
Monitor disk read and write times to detect instances when a disk is slowing down (Windows only) |
|
Disk queue length |
Track the number of processes queued on each disk drive to determine disk drives that may be responsible for slow downs |
|
Top 10 processes by disk activity |
Determine which processes are causing disk reads/writes |
|
Uptime Monitoring |
Current uptime |
|
Top 10 servers by uptime |
Know which servers have not been rebooted for a long time; |
|
Disk Space Monitoring |
Total capacity |
Know the total capacity of each of the disk partitions of a server |
Free space |
|
|
Page File Usage |
Current usage |
Monitor and alert on page file usage of a Windows server; |
Network Traffic Monitoring |
Bandwidth usage |
|
Outbound queue length |
|
|
Incoming and outgoing traffic |
|
|
Network Monitoring |
Packet loss |
|
Average delay |
Determine the average delay of packets to a server; |
|
Availability |
Determine times when a server is not reachable over the network; |
|
TCP Monitoring |
Current connections |
Track currently established TCP connections to a server; |
Incoming/outgoing TCP connection rate |
Monitor the server workload by tracking the rate of TCP connections to and from a server |
|
TCP retransmissions |
|
|
Process Monitoring |
Processes running |
|
CPU usage |
|
|
Memory usage |
|
|
Threads |
Track the number of threads running for an application’s process (Windows only); |
|
Handles |
|
|
Windows Services Monitoring |
Availability |
Determine if a service is running or not |
Server Log Monitoring |
New events |
|
Security success and failure events |
|
|
Events in /var/adm/messages log |
Track and be alerted of all errors logged in the /var/adm/messages log of a Unix system |
|
Auto-correction |
Automatic restart of failed services |
Determine Windows services that should be running automatically; Monitor if these services are up or not, and restart any failed service automatically |