network fault management and it automation training

32
OpManager Performance monitoring, health monitoring, WAN monitoring, workflow automation Network Configuration Manager Configuration management, change management, compliance auditing, user activity tracking OpUtils IP address management, switch port management, rogue detection, MAC IP mapping IT operations management solutions Netflow Analyzer Bandwidth monitoring, forensics, traffic shaping, DPI monitoring Firewall Analyzer Compliance management, firewall policy management, log analysis, network forensic audits

Upload: manageengine-zoho-corporation

Post on 22-Jan-2018

133 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: Network fault management and IT automation training

OpManager

Performance monitoring, health monitoring, WAN

monitoring, workflow automation

Network Configuration Manager

Configuration management, change

management, compliance auditing, user activity

tracking

OpUtils

IP address management, switch port

management, rogue detection, MAC IP

mapping

IT operations management

solutions

Netflow Analyzer

Bandwidth monitoring, forensics, traffic

shaping, DPI monitoring

Firewall Analyzer

Compliance management, firewall

policy management, log analysis,

network forensic audits

Page 2: Network fault management and IT automation training

Welcome to a free OpManager training

session

Page 3: Network fault management and IT automation training

TrainerSuresh Bagavathy

Evangelist for ManageEngine's ITOM suite

10 years' experience

Page 4: Network fault management and IT automation training

Training schedule

Week Module Date Schedule Status

1st Discovery and

classification

October 25th 11:30 am EDT Completed

2nd Network/server

performance

monitoring

November 1st 11:30 am EDT Completed

3rd Effective fault

management and

IT automation

November 8th 11:30 am EST In progress

4th Dashboard and

widgets; business

views; 3D data

center builder;

reports

November 15th 11:30 am EST Upcoming

www.opmanager.com/training.html

Page 5: Network fault management and IT automation training

Week 3

Effective fault management and IT

automation

Page 6: Network fault management and IT automation training

1. How to identify the faults

quickly?

2. How to prioritize the problems?

Page 7: Network fault management and IT automation training

All services are

currently UP

1. How to identify the faults

quickly?

2. How to prioritize the problems?

3. How do you get it resolved

quickly?

Page 8: Network fault management and IT automation training

Agenda

• Alarm severity levels

• Threshold violation alarms

• VMware events

• Event log alarms

• Syslog alarms

• SNMP trap alarms

• Notifications

• Using an IT workflow to automate repetitive or scheduled tasks

• Tips and tricks

• Questions

Page 9: Network fault management and IT automation training

Alarm severity levels

Page 10: Network fault management and IT automation training

Severity Color code

Attention

Trouble

Critical

Service down

Clear

Page 11: Network fault management and IT automation training

Device down

Interface down

Severity: predefined

Page 12: Network fault management and IT automation training

Process down

Service down

URL down

Severity: predefined

Page 13: Network fault management and IT automation training

Event log

Syslog

SNMP trap

Severity: configurable

Page 14: Network fault management and IT automation training

Threshold-based alarms

Page 15: Network fault management and IT automation training

• Configuring threshold values on an individual device

• Configuring consecutive times

• Configuring rearm value to remove cleared faults

• Using device templates to configure thresholds globally based on device

type

Threshold-based alarms

Page 16: Network fault management and IT automation training

VMware events

Page 17: Network fault management and IT automation training

Alarms for inventory changeso vMotion

o Host added/removed

o Host or VMs connected/disconnected

o VMs powered on/off

o VMs orphaned

o Scheduled task removed

o Etc.

VMware events

Page 18: Network fault management and IT automation training

Event log alarms

Page 19: Network fault management and IT automation training

Event log alarms

Prerequisiteso Check if WMI and RPC services are enabled on the Windows servers

o Default WMI ports: 135 & 445, 5000 to 6000 (TCP)

• Configuring event logs for a Windows server in OpManager

• Ignoring a specific event log from a Windows server

• Configuring OpManager to handle event floods

(http://help.opmanager.com/stopping-event-flood)• serverparameters.conf (OpManager/conf/OpManager)

• EVENTS_PER_HOUR 1000

• EVENT_FLOOD_SEVERITY Critical

• Creating a custom event log

Page 20: Network fault management and IT automation training

Syslog alarms

Page 21: Network fault management and IT automation training

Syslog alarms

Prerequisiteso Configure devices to forward syslog events to OpManager's server

o Default ports: 514 & 519 (UDP); configurable

• Creating a syslog rule

• Syslog receiver

• Using facility name, severity, or match text to filter and

clear syslog alarms (regex format)

• Identifying the syslog flow rate from OpManager

• Forwarding OpManager events (as syslogs) or received

syslog messages to another NMS platform

Page 22: Network fault management and IT automation training

SNMP trap alarms

Page 23: Network fault management and IT automation training

Prerequisites o Configure devices to forward SNMP trap events to OpManager's server

o Default port: 162 (UDP); configurable

• Creating an SNMP trap processor rule

• Using the failure component to combine SNMP traps received

from two different OIDs

• Using varbinds to filter and clear SNMP trap alarms

• Loading SNMP traps from vendor MIBs

• Processing unsolicited traps

• Configuring OpManager to handle event floods

• Forwarding OpManager events (as traps) or received SNMP

traps to another NMS platform

SNMP trap alarms

Page 24: Network fault management and IT automation training

Notifications

Page 25: Network fault management and IT automation training

Notification

cycle

Profile type- Send email or SMS

- Send modern SMS

- Run system

command

- Run program

- Log a ticket

- Web alarm

- Syslog

- Trap

Alarm criteria- Device down

- Service down

- Hardware fault

- Threshold violation

- Virtual device fault

- UCS fault

Device selection- Category

- Business view

- Devices

Schedule- All the time

- Selected time window

- Delayed trigger

- Recurring trigger

Preview- Verify inputs

- Add a profile

Page 26: Network fault management and IT automation training

SMS: modem/app SMS/Clickatell/email-based

Log a ticket

Web alarm

Run system command

Run a program

Syslog

SNMP trap

Email

Notification types

Page 27: Network fault management and IT automation training

IT workflow automation

Page 28: Network fault management and IT automation training

• Instant device check

• Test SNMP service

• Export/ Import available templates

o site to siteo https://resources.manageengine.com/resources/forum/opmanager/workflows

IT workflow automation

Create a workflow Associate devices Schedule/trigger tasks

1 2 3

Page 29: Network fault management and IT automation training

Tips and tricks

Page 30: Network fault management and IT automation training

Tips and tricks

• Configure device dependencies to stop polling a dependent

device when its parent device is down

• Suppress known alarms from an individual device

• Configure the downtime scheduler and stop polling devices

during maintenance windows

• Configure alarm escalation and notify the super admin when a

critical alarm is not cleared within a given amount of time

• Unified alarm console

• Widgets for alarms

Page 31: Network fault management and IT automation training

youtube.com/opmanagertechvideos

help.opmanager.com

opmanager-

[email protected]

+1 (888) 720-9500 / +1 (408) 916-

9400

Need more help?

forums.manageengine.com/opmanager

Page 32: Network fault management and IT automation training

www.manageengine.com

THANK YOUSuresh Bagavathy

[email protected]