- 10 IT security companies to watch
- Mobile phone chargers are energy vampires
- Smartphone smackdown: Storm vs. iPhone
- Video game collisions I'd like to see
- Court slams door on sale of spyware
It sounds simple. Instead of spending hours or days troubleshooting an application slowdown or system outage, why not just avoid it to begin with?
Until recently, the only way for IT organizations to resolve problems was to sift through alerts, log files and trouble tickets and burn the midnight oil on conference calls. Today, powerful analytics and automation capabilities built into system management tools can help organizations identify and resolve issues before they become problems.
Interconnected business services have made management exponentially more difficult. Collecting more data isn’t the answer because:
* Monitoring static thresholds triggers a flood of alerts, most of which do not represent actual problems.
* Problems are identified by groups of abnormal behaviors, not a solitary metric.
* With tens of thousands of devices and millions of metrics, the correlation effort required to identify problems is impossible.
This deterministic approach is not only ineffective but also cannot scale to accommodate increasing complexity. Highly complex service infrastructures demand a new approach, a probabilistic approach.
Intelligent system-management solutions now employ sophisticated correlation algorithms to sample subsets of metric data and deliver accurate information about potential system behavior. In addition, new learning technologies continuously refine alert thresholds — providing dynamic thresholds that recognize and accommodate the normal ebbs and flows of business. A probabilistic approach allows organizations to solve problems faster and with far less manual effort.
Intelligent management solutions integrate with existing monitoring infrastructures, automatically collecting and analyzing metrics from across all tiers of an application — such as Web server, application server and database tiers.
The first job for the intelligent management solution is to learn the normal behavior of the application. It should be possible to build behavior models for each resource in your infrastructure by using dynamic thresholding algorithms to continuously collect data. This makes it possible to compare the real-time measurements of metrics with the expected range of values to determine when a metric should trigger a threshold violation.
Partner Content
NetScout and analyst Jim Metzler have teamed to deliver a series of IT Briefs on Network and Application Performance Management leveraging research from NetScout’s nGenius & Sniffer users.
www.netscout.com
Metzler on CIO Priorities
The top five CIO priorities based on a survey of NetScout users revealing CIOs' top priorities and what they think they should be. Also includes interviews with CIOs of large organizations.
Read the Report
Metzler on Application Delivery
How to eliminate the stovepiped or siloed nature of application delivery from both an organization and a technological perspective.
Read the Brief
Metzler on Network Troubleshooting
Overview of network troubleshooting that provides an assessment of where we are, and where we need to be relative to the complexities of today's IT challenges.
Read the Brief
Comment