Free Resources

Back to resources.
Back to web site.

Monitoring Dell PowerEdge Servers with MRTG

4/1/2003 (updated 4/22/2003)

This document describes monitoring Dell PowerEdge specific hardware in both the Windows and Linux environments. Items that can be monitored appear in the Dell OpenManage application, these include various temperatures, fan RPMs, and power supply voltages.

1. Disclaimer / Links
2. Dell SNMP OID Definitions
3. Examples
4. Conclusion


Disclaimer / Links

This document assumes an intermediate familiarity with the MRTG software. For more general information about MRTG, please see the main website: http://people.ee.ethz.ch/~oetiker/webtools/mrtg/ .

Dell SNMP OID Definitions

These will vary depending on the hardware configuration of your Dell server. These examples are from a fairly robust PowerEdge 1600SC. (Dual processor, hot-swap bays, etc.) A lesser 600SC produces near-identical results however.

Temperatures
.1.3.6.1.4.1.674.10892.1.700.20.1.6.1.1 ESM Frt I/O Temp
.1.3.6.1.4.1.674.10892.1.700.20.1.6.1.2 ESM CPU 1 Temp
.1.3.6.1.4.1.674.10892.1.700.20.1.6.1.3 ESM CPU 2 Temp
.1.3.6.1.4.1.674.10892.1.700.20.1.6.1.4 ESM Riser Temp
.1.3.6.1.4.1.674.10892.1.700.20.1.6.1.5 BP Bottom Temp
.1.3.6.1.4.1.674.10892.1.700.20.1.6.1.6 BP Top Temp
Fan RPMs
.1.3.6.1.4.1.674.10892.1.700.12.1.6.1.1 ESM MB Fan1 RPM
.1.3.6.1.4.1.674.10892.1.700.12.1.6.1.2 ESM MB Fan2 RPM
.1.3.6.1.4.1.674.10892.1.700.12.1.6.1.3 ESM MB Fan3 RPM
.1.3.6.1.4.1.674.10892.1.700.12.1.6.1.4 ESM MB Fan4 RPM
.1.3.6.1.4.1.674.10892.1.700.12.1.6.1.5 ESM MB Fan7 RPM
Power System
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.1 ESM MB Bat Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.2 ESM 3.3 FP Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.3 ESM MB 3.3 Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.4 ESM MB 5 Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.5 ESM CPU Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.6 ESM MB +12 Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.7 ESM MB -12 Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.8 ESM MB 2.5 Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.9 ESM GB0 2.5 Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.10 ESM GB1 2.5 Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.11 ESM 5 AUX Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.12 ESM ROMB PK Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.13 ESM GB0 1.8 Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.14 ESM GB1 1.8 Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.15 ESM GB0 1.4 Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.16 ESM GB1 1.4 Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.17 ESM VTT Volt
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.18 BP 5V
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.19 BP 12V
.1.3.6.1.4.1.674.10892.1.600.20.1.6.1.20 BP 3.3V

If you look closely at the OID results of an SNMP dump of a Dell server (or the table above) you'll notice the categorical groupings of the various fields (700 for the temperatures, 600 for the power levels, etc.). Here is a comprehensive explanation of those groupings:

The 10892 MIB definition:
systemManagementSoftwareGroup 100
systemStateGroup 200
chassisInformationGroup 300
operatingSystemGroup 400
systemResourceGroup 500
powerGroup 600
thermalGroup 700
userSecurityGroup 800
remoteFlashBIOSGroup 900
portGroup 1000
deviceGroup 1100
slotGroup 1200
memoryGroup 1300
biosSetUpControlGroup 1400
lraGroup 1500
cooGroup 1600
alertGroup 5000

Examples

As you know, when you're pulling straight SNMP values, it doesn't really matter whether you're running MRTG from Windows NT or from Unix. We run it successfully on both platforms on PowerEdge hardware with no problems. Here are a few sample MRTG configurations based on the above values.

Example #1 - CPU Temperature


`Daily' Graph (5 Minute Average)
day
Max  In: 93.0 °F Average  In: 85.0 °F Current  In: 86.0 °F
Max  Out: 93.0 °F Average  Out: 85.0 °F Current  Out: 86.0 °F

`Monthly' Graph (2 Hour Average)
month
Max  In: 86.0 °F Average  In: 82.0 °F Current  In: 86.0 °F
Max  Out: 86.0 °F Average  Out: 82.0 °F Current  Out: 86.0 °F

MRTG Configuration file:

# CPU 1 Temp
Target[pt1]: ( 1.3.6.1.4.1.674.10892.1.700.20.1.6.1.2&.1.3.6.1.4.1.674.10892.1.700.20.1.6.1.2:public@localhost / 10 ) * 1.8 + 32
XSize[pt1]: 360
YSize[pt1]: 75
Title[pt1]: Processor 1 - Temperature (F)
MaxBytes[pt1]: 212
AbsMax[pt1]: 212
Options[pt1]: growright,gauge,nopercent
PageTop[pt1]: <h1>Processor 1 - Temperature (F)</h1>
YLegend[pt1]: Deg F
ShortLegend[pt1]: &deg;F
Legend1[pt1]: CPU 1 - TEMP
Colours[pt1]: RED#F90000,RED#F90000,RED#F90000,RED#F90000

In this example, we're converting the SNMP result from Celsius * 10 to Farenheight. Note the math syntax around the Target...

Example #2 - Fan RPMs


`Daily' Graph (5 Minute Average)
day
Max  In: 7620.0  rpms Average  In: 7606.0  rpms Current  In: 7560.0  rpms
Max  Out: 7620.0  rpms Average  Out: 7606.0  rpms Current  Out: 7560.0  rpms

`Yearly' Graph (1 Day Average)
year
Max  In: 7620.0  rpms Average  In: 7554.0  rpms Current  In: 7615.0  rpms
Max  Out: 7620.0  rpms Average  Out: 7554.0  rpms Current  Out: 7615.0  rpms

MRTG Configuration File:

# ESM MB Fan2 RPM
Target[fan2]: .1.3.6.1.4.1.674.10892.1.700.12.1.6.1.2&.1.3.6.1.4.1.674.10892.1.700.12.1.6.1.2:public@localhost
XSize[fan2]: 360
YSize[fan2]: 75
Title[fan2]: ESM MB Fan2 RPM
MaxBytes[fan2]: 9120
AbsMax[fan2]: 9660
Options[fan2]: growright,gauge,nopercent
PageTop[fan2]: <h1>ESM MB Fan2 RPM</h1>
YLegend[fan2]: rpms
ShortLegend[fan2]: &nbsp;rpms
Legend1[fan2]: FAN RPMs
Colours[fan2]: DKBLUE#000099,DKBLUE#000099,RED#F90000,RED#F90000

Now, this example is much more straightforward, as we don't have to do any math on the SNMP result. It is the actual RPM number....

 

Conclusion

Again, for more general information about MRTG, please visit the main MRTG site at http://people.ee.ethz.ch/~oetiker/webtools/mrtg/ .

If you have any specific questions regarding this page, feel free to e-mail contact@csma.biz.

 

Copyright (c) 2003, Corey S. McFadden & Associates. All rights reserved.
 For more information, please contact contact@csma.biz. Served via ewrgga-hst01 .