<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://www.communities.hp.com/online/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Application Management : advanced analytics</title><link>http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/advanced+analytics/default.aspx</link><description>Tags: advanced analytics</description><dc:language>en</dc:language><generator>CommunityServer 2008.5 SP1 (Build: 31106.3070)</generator><item><title>Fighting or friendly, Problem Isolation and OMi</title><link>http://www.communities.hp.com/online/blogs/mbsmreality/archive/2009/09/22/fighting-or-friendly-problem-isolation-and-omi.aspx</link><pubDate>Tue, 22 Sep 2009 20:27:00 GMT</pubDate><guid isPermaLink="false">964d1d0f-bea0-4201-a2aa-8aa369a35a46:115652</guid><dc:creator>Michael_Procopio</dc:creator><slash:comments>2</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://www.communities.hp.com/online/blogs/mbsmreality/rsscomments.aspx?PostID=115652</wfw:commentRss><comments>http://www.communities.hp.com/online/blogs/mbsmreality/archive/2009/09/22/fighting-or-friendly-problem-isolation-and-omi.aspx#comments</comments><description>&lt;p&gt;by Michael Procopio &lt;/p&gt;
&lt;p&gt;In the post&amp;nbsp; &lt;a href="http://www.communities.hp.com/online/blogs/managementsoftware/archive/2009/09/25/event-correlation-omi-tbec-and-problem-isolation-what-s-the-difference-part-1-of-3.aspx"&gt;&lt;i&gt;&lt;span style="font-family:&amp;#39;MS Shell Dlg 2&amp;#39;;font-size:10pt;color:#000000;"&gt;Event Correlation OMi TBEC and Problem Isolation What&amp;#39;s the Difference&lt;/span&gt;&lt;/i&gt;&lt;/a&gt;, my fellow blogger, Jon Haworth, discussed the differences between TBEC and Problem Isolation. To be consistent, I&amp;#39;ll use the acronyms &lt;b&gt;PI for Problem Isolation&lt;/b&gt; and &lt;b&gt;TBEC to refer to OMi (Operations Manager i series)&amp;nbsp;Topology Based Event Correlation.&lt;/b&gt; &lt;/p&gt;
&lt;p&gt;Briefly, he mentioned that TBEC works &amp;ldquo;bottom up&amp;rdquo;, that is starting &lt;b&gt;from the infrastructure&lt;/b&gt;, with events. PI works &amp;ldquo;top down&amp;rdquo;, that is, starting &lt;b&gt;from an end user experience problem&lt;/b&gt;, primarily with metric (time series) data. &lt;/p&gt;
&lt;p&gt;Jon did an excellent job describing TBEC; I&amp;rsquo;ll do my best on PI because like Jon I have a conscience to settle. &lt;/p&gt;
&lt;p&gt;Problem Isolation is a tool to: &lt;/p&gt;
&lt;p&gt;1. automate the steps a troubleshooter would go through &lt;/p&gt;
&lt;p&gt;2. run additional tests that might uncover the problem &lt;/p&gt;
&lt;p&gt;3. look at all metric/performance data from the end user experience monitoring and all the infrastructure it depends &lt;/p&gt;
&lt;p&gt;4. find the infrastructure metric the most closely matches the end user problem using behavior learning and regression analysis techniques (developed by HP Labs) &lt;/p&gt;
&lt;p&gt;5. bring additional data such as events, help/service desk tickets and changes to the troubleshooter &lt;/p&gt;
&lt;p&gt;6. allow the troubleshooter to execute Run books to potentially solve the problem &lt;/p&gt;
&lt;p&gt;Potentially the biggest difference in the underlying technology is that Problem Isolation does not require any correlation rules or thresholds to be set for it to do the regression analysis to point to the problem. Like TBEC, it does require that an application be modeled in a &lt;a href="http://en.wikipedia.org/wiki/CMDB"&gt;&lt;span style="color:#b85b5a;"&gt;CMDB&lt;/span&gt;&lt;/a&gt;. &lt;/p&gt;
&lt;p&gt;An example: Presume a situation with a typical composite application - web server, application server and database. No infrastructure thresholds were violated; therefore, there are no infrastructure alerts. Again, as mentioned in the previous post, end user monitoring (EUM) is the back stop. EUM alerts on slow end user performance, now what? &lt;/p&gt;
&lt;p&gt;Here is what Problem Isolation does: &lt;/p&gt;
&lt;p&gt;1. determines which infrastructure elements (ITIL configurations items or CIs) support the transaction &lt;/p&gt;
&lt;p&gt;2. reruns the test(s) that caused the alert &amp;ndash; this validates it is not transient problem &lt;/p&gt;
&lt;p&gt;3. runs any additional tests defined for the CIs &lt;/p&gt;
&lt;p&gt;4. collects Service Level Agreement information &lt;/p&gt;
&lt;p&gt;5. collects all available infrastructure performance metrics (web server, application server, database server and operating systems for each) and compares them to the EUM data using behavior and regression analysis &lt;/p&gt;
&lt;p&gt;&lt;a&gt;&lt;/a&gt;&lt;a href="http://www.communities.hp.com/online/cfs-file.ashx/__key/CommunityServer.Blogs.Components.WeblogFiles/mbsmreality/PI_2D00_event_2D00_correlation_2D00_500.jpg"&gt;&lt;/a&gt;&lt;a href="http://www.communities.hp.com/online/cfs-file.ashx/__key/CommunityServer.Blogs.Components.WeblogFiles/mbsmreality/PI_2D00_metric_2D00_correlation_2D00_500.jpg"&gt;&lt;img src="http://www.communities.hp.com/online/resized-image.ashx/__size/550x0/__key/CommunityServer.Blogs.Components.WeblogFiles/mbsmreality/PI_2D00_metric_2D00_correlation_2D00_500.jpg" border="0" alt="" /&gt;&lt;/a&gt; &lt;/p&gt;
&lt;p&gt;&lt;i&gt;Problem Isolation screen show performance correlation between end user response and SQL Server database locks&lt;/i&gt; &lt;/p&gt;
&lt;p&gt;&lt;i&gt;-------------------------------------------------------------------------------------------&lt;/i&gt;&lt;/p&gt;
&lt;p&gt;6. determines and displays the most probable suspect CI and alternates &lt;/p&gt;
&lt;p&gt;7. displays run books available for all infrastructure CIs for the PI user to run directly from the tool &lt;/p&gt;
&lt;p&gt;8. allows the PI user to attach all the information to a service ticket, either existing or create a new one &lt;/p&gt;
&lt;p&gt;Another key differentiator of OMi/TBEC and PI is the target user. There is such a wide variance in how organizations work that it is hard to name the role but let me do a brief description and I think will be able to determine the title in your organization. &lt;/p&gt;
&lt;p&gt;There are some folks in the organization whose job is to take a quick look (typically &amp;lt; 10 minutes, in one organization I interviewed &amp;lt; 1 minute) at a situation and determine if they have explicit instructions on what to do via scripts or run books. When they have no instructions for a situation they pass it on to someone who has a bit more experience and does some free form triage. &lt;/p&gt;
&lt;p&gt;This person might be able to fix the problem or may have to pass it on to a subject matter expert, for example if they believe it is an MS Exchange problem to an Exchange admin. It is this second person that Problem Isolation is targeted at. This is helping automate her job, reducing what might take tens of minutes to hours and performing it in seconds. If it ends up she can&amp;rsquo;t solve the problem it automatically provides full documentation of all information collected. That alone might take someone five minutes to write-up. &lt;/p&gt;
&lt;p&gt;OMi&amp;rsquo;s target is the operations bridge console user. Ops Bridge operators tend to be lower skilled and face hundreds if not thousands of events per hour. Jon described how OMi helps them work smarter. &lt;/p&gt;
&lt;p&gt;TBEC and Problem Isolation both work to find the root cause of an incident but in different ways. Much like a doctor might use an MRI or CAT scan to diagnose a patient based on what the situation is, TBEC and Problem Isolation are complementary tools each with unique capabilities. &lt;/p&gt;
&lt;p&gt;Problem Isolation will not find problems in redundant infrastructure that OMi will. Conversely, OMi can&amp;rsquo;t help with EUM problems when no events are triggered, where Problem Isolation will. &lt;/p&gt;
&lt;p&gt;We know this can be a confusing area. We welcome your questions to help us do a better job of describing the difference. But these two are definitely friendly. &lt;/p&gt;
&lt;p&gt;For &lt;i&gt;&lt;a href="http://www.communities.hp.com/online/controlpanel/blogs/In%20the%20post%20%3Cxyz%3E,%20my%20fellow%20blogger,%20Jon%20Haworth,%20discussed%20the%20differences%20between%20TBEC%20and%20Problem%20Isolation.%20To%20be%20consistent,%20I&amp;#39;ll%20use%20the%20acronyms%20PI%20for%20Problem%20Isolation%20and%20TBEC%20to%20refer%20to%20OMi%20Topology%20Based%20Event%20Correlation."&gt;&lt;span style="color:#0066cc;"&gt;Business Availability Center&lt;/span&gt;&lt;/a&gt;&lt;/i&gt;, &lt;i&gt;&lt;a href="http://www.communities.hp.com/online/members/Michael_5F00_Procopio/default.aspx"&gt;&lt;span style="color:#b85b5a;"&gt;Michael Procopio&lt;/span&gt;&lt;/a&gt;&lt;/i&gt; &lt;/p&gt;
&lt;p&gt;Get the latest updates on our Twitter feed @HPITOps &lt;a href="http://twitter.com/HPITOps"&gt;&lt;span style="color:#0066cc;"&gt;http://twitter.com/HPITOps&lt;/span&gt;&lt;/a&gt; &lt;/p&gt;
&lt;p&gt;Join the &lt;a href="http://www.linkedin.com/groups?gid=65439&amp;amp;trk=myg_ugrp_ovr"&gt;&lt;span style="color:#b85b5a;"&gt;HP Software group on LinkedIn&lt;/span&gt;&lt;/a&gt;&amp;nbsp;and/or the&amp;nbsp;&lt;a href="http://www.linkedin.com/groups?gid=848997&amp;amp;trk=myg_ugrp_ovr"&gt;&lt;span style="color:#b85b5a;"&gt;Business Availability Center group on LinkedIn&lt;/span&gt;&lt;/a&gt;. &lt;/p&gt;
&lt;p&gt;&lt;b&gt;Related Items&lt;/b&gt; &lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;a href="http://www.communities.hp.com/online/blogs/mbsmreality/archive/2009/04/15/advanced-analytics-reduces-downtime-costs-detection.aspx"&gt;&lt;span style="color:#0066cc;"&gt;Advanced analytics reduces downtime costs - detection&lt;/span&gt;&lt;/a&gt; &lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.communities.hp.com/online/blogs/mbsmreality/archive/2009/05/08/advanced-analytics-reduces-downtime-costs-isolation.aspx"&gt;&lt;span style="color:#0066cc;"&gt;Advanced analytics reduces downtime costs &amp;ndash; isolation&lt;/span&gt;&lt;/a&gt; &lt;/li&gt;
&lt;li&gt;&lt;a href="https://h10078.www1.hp.com/cda/hpms/display/main/hpms_content.jsp?zn=bto&amp;amp;cp=1-11-15-25%5E924_4000_100__"&gt;&lt;span style="color:#0066cc;"&gt;Problem Isolation page&lt;/span&gt;&lt;/a&gt; &lt;/li&gt;
&lt;li&gt;&lt;a href="https://h10078.www1.hp.com/cda/hpms/display/main/hpms_content.jsp?zn=bto&amp;amp;cp=1-11-15-28%5E37673_4000_100__"&gt;&lt;span style="color:#0066cc;"&gt;Operations Manager i page&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;
&lt;/ol&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://www.communities.hp.com/online/aggbug.aspx?PostID=115652" width="1" height="1"&gt;</description><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/IT+operations/default.aspx">IT operations</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Application+Management/default.aspx">Application Management</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/infrastructure+management/default.aspx">infrastructure management</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/application+performance+management/default.aspx">application performance management</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/User+Experience+Monitoring/default.aspx">User Experience Monitoring</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Michael+Procopio/default.aspx">Michael Procopio</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Problem+Isolation/default.aspx">Problem Isolation</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/advanced+analytics/default.aspx">advanced analytics</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Business+Availability+Center/default.aspx">Business Availability Center</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/operations+manager+i+series/default.aspx">operations manager i series</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/OMi/default.aspx">OMi</category></item><item><title>HP Software Universe – Mainstage Andy Isherwood</title><link>http://www.communities.hp.com/online/blogs/mbsmreality/archive/2009/06/17/hp-software-universe-mainstage-andy-isherwood.aspx</link><pubDate>Wed, 17 Jun 2009 03:57:00 GMT</pubDate><guid isPermaLink="false">964d1d0f-bea0-4201-a2aa-8aa369a35a46:92325</guid><dc:creator>Michael_Procopio</dc:creator><slash:comments>0</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://www.communities.hp.com/online/blogs/mbsmreality/rsscomments.aspx?PostID=92325</wfw:commentRss><comments>http://www.communities.hp.com/online/blogs/mbsmreality/archive/2009/06/17/hp-software-universe-mainstage-andy-isherwood.aspx#comments</comments><description>&lt;p&gt;&lt;a href="http://www.communities.hp.com/online/cfs-file.ashx/__key/CommunityServer.Blogs.Components.WeblogFiles/mbsmreality/mainstage_2D00_Andy_2D00_0751.jpg"&gt;&lt;img src="http://www.communities.hp.com/online/resized-image.ashx/__size/550x0/__key/CommunityServer.Blogs.Components.WeblogFiles/mbsmreality/mainstage_2D00_Andy_2D00_0751.jpg" border="0" alt="" /&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;Andy Isherwood VP, Support &amp;amp; Services 
kicked off Mainstage. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;There are four key areas shown in the picture above. HP announced this week its&amp;nbsp;&lt;i&gt; &lt;a href="http://www.hp.com/hpinfo/newsroom/press/2009/090616xa.html?mtxs=rss-corp-news"&gt;IT Financial Management&lt;/a&gt;&lt;/i&gt; offering. Andy likened ITFM to an ERP system for IT. &lt;a href="http://www.information-management.com/news/hp_new_solution_financial_management_transparency-10015607-1.html"&gt;&lt;i&gt;Information Management magazine&lt;/i&gt;&lt;/a&gt; wrote an article on HP ITFM.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;HP has had offerings in IT Performance Analytics and IT Resource Optimization for awhile. HP Cloud Assure was announced was announced in May 2009, &lt;/span&gt;&lt;a href="http://www.hp.com/hpinfo/newsroom/press/2009/090331xa.html" class="l"&gt;&lt;i&gt;HP&lt;/i&gt; Unveils &amp;ldquo;&lt;i&gt;Cloud Assure&lt;/i&gt;&amp;rdquo; to Drive Business Adoption of &lt;i&gt;Cloud&lt;/i&gt; Services&lt;/a&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;Some key points from his opening remarks:&lt;/span&gt; 
&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;Prepare for coming out of the 
recession when cutting costs and innovating. &lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;Best in class means being good at all 
four - aligning to the business, taking out costs, increasing efficiency and 
consolidation. &lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;Jetblue, Altec and T-Mobile were the 
winners of the HP Software&amp;nbsp;Award of Excellence. &lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;As an example of the quick ROI 
companies can get, Altec produced 10% application downtime reduction, 20% faster 
response time, 15% increase in customer satisfaction and a 300% improve 
application transaction time in 6 months. &lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;Last year we were HP Software, this 
year HP Software and Solutions. This was the combining HP Software with HP 
Consulting and Integration. The net result increased our delivery options. In 
addition to offering software for in-house use, HP now has EDS, SaaS and 
continues with it Partners &lt;/span&gt;
&lt;/li&gt;
&lt;li&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;HP SaaS business is seven years old 
this year and has 650 customers. &lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;You can read other coverage of HP Software Universe in the &lt;i&gt;&lt;a href="http://hp.com/go/itopsblog"&gt;ITOpsBlog&lt;/a&gt;&lt;/i&gt;. There are a variety of Twitter accounts 
you can follow: &lt;br /&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://twitter.com/hpitops" target="_blank"&gt;&lt;span style="font-size:small;"&gt;&lt;span style="font-family:Bookman Old Style;"&gt;&lt;i&gt;HPITOps&lt;/i&gt; &lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size:small;"&gt;&lt;span style="font-family:Bookman Old Style;"&gt;&lt;i&gt;&amp;nbsp;&lt;/i&gt;&amp;ndash; Covers BSM, ITFM, ITSM, Operations and 
Network Management&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://twitter.com/hpsu09" target="_blank"&gt;&lt;i&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;HPSU09&lt;/span&gt;&lt;/i&gt;&lt;/a&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt; &lt;i&gt;&amp;nbsp;&lt;/i&gt;&amp;ndash; show logistics and other 
information&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://twitter.com/HPSoftwareCTO" target="_blank"&gt;&lt;span style="font-size:small;"&gt;&lt;span style="font-family:Bookman Old Style;"&gt;&lt;i&gt;HPSoftwareCTO&lt;/i&gt; &lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://twitter.com/informationCTO" target="_blank"&gt;&lt;i&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;informationCTO&lt;/span&gt;&lt;/i&gt;&lt;/a&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://twitter.com/hpsoftware" target="_blank"&gt;&lt;i&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;HPSoftware&lt;/span&gt;&lt;/i&gt;&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://twitter.com/BTOCMO" target="_blank"&gt;&lt;i&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;BTOCMO&lt;/span&gt;&lt;/i&gt;&lt;/a&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt; &amp;ndash; HP BTO Chief Marketing Officer &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family:Bookman Old Style;font-size:small;"&gt;as well as the Twitter hashtag #HPSU09&lt;/span&gt;&lt;span style="font-size:small;"&gt;&lt;span style="font-family:Bookman Old Style;"&gt; &lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;"&gt;&lt;span style="font-family:Bookman Old Style;"&gt;For &lt;/span&gt;&lt;/span&gt;&lt;a href="http://hp.com/go/BSM" target="_blank"&gt;&lt;i&gt;&lt;span style="font-size:small;"&gt;&lt;span style="font-family:Bookman Old Style;"&gt;HP BSM&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;/a&gt;&lt;span style="font-size:small;"&gt;&lt;span style="font-family:Bookman Old Style;"&gt;, Michael Procopio&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://www.communities.hp.com/online/aggbug.aspx?PostID=92325" width="1" height="1"&gt;</description><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/IT+operations/default.aspx">IT operations</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/IT+efficiency/default.aspx">IT efficiency</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Cloud+Assure/default.aspx">Cloud Assure</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/announcment/default.aspx">announcment</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/cloud+computing/default.aspx">cloud computing</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Michael+Procopio/default.aspx">Michael Procopio</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/SaaS/default.aspx">SaaS</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/advanced+analytics/default.aspx">advanced analytics</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/recession/default.aspx">recession</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/HPSU09/default.aspx">HPSU09</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Las+Vegas/default.aspx">Las Vegas</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/IT+financial+management/default.aspx">IT financial management</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Andy+Isherwood/default.aspx">Andy Isherwood</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/ITFM/default.aspx">ITFM</category></item><item><title>Advanced analytics reduces downtime costs – isolation</title><link>http://www.communities.hp.com/online/blogs/mbsmreality/archive/2009/05/08/advanced-analytics-reduces-downtime-costs-isolation.aspx</link><pubDate>Fri, 08 May 2009 15:40:00 GMT</pubDate><guid isPermaLink="false">964d1d0f-bea0-4201-a2aa-8aa369a35a46:89512</guid><dc:creator>Michael_Procopio</dc:creator><slash:comments>0</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://www.communities.hp.com/online/blogs/mbsmreality/rsscomments.aspx?PostID=89512</wfw:commentRss><comments>http://www.communities.hp.com/online/blogs/mbsmreality/archive/2009/05/08/advanced-analytics-reduces-downtime-costs-isolation.aspx#comments</comments><description>&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;by Michael Procopio, Product Manager, BAC&amp;nbsp; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;&lt;img align="left" width="114" src="http://lh6.ggpht.com/__Vfb7MLSLnY/ShwF9NNJJ8I/AAAAAAAAAhk/V5Bb1N06Cnc/s400/Mjp-crop_1689.JPG" height="145" style="display:inline;margin:9px;" alt="" /&gt; In the world of advanced analytics, two areas that are of interest to the IT management world are:&amp;nbsp; detection of a problem and isolation of a problem. Previously I wrote &lt;/span&gt;&lt;a href="http://www.communities.hp.com/online/blogs/mbsmreality/archive/2009/04/15/advanced-analytics-reduces-downtime-costs-detection.aspx"&gt;&lt;span style="font-size:small;color:#669966;font-family:Bookman Old Style;"&gt;Advanced analytics reduces downtime costs &amp;ndash; detection&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;; in this post I&amp;rsquo;ll cover isolation.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;In the previous post, I covered how advanced analytics finds an anomaly, potentially before a threshold is crossed. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;Problem Isolation is the process of determining which component in the infrastructure is causing the &lt;/span&gt;&lt;a href="http://en.wikipedia.org/wiki/ITIL#Problem_Management"&gt;&lt;span style="font-size:small;color:#669966;font-family:Bookman Old Style;"&gt;problem&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;* or &lt;/span&gt;&lt;a href="http://en.wikipedia.org/wiki/ITIL#Incident_Management"&gt;&lt;span style="font-size:small;color:#669966;font-family:Bookman Old Style;"&gt;incident&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;* that we found. We will presume we are monitoring the service that is having the issue.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;If one &lt;strong&gt;had no management tools&lt;/strong&gt; (amazingly I have spoken to customers in this situation) the method of trying to find a problem is to login to each system, router, switch and potentially application (ex: Oracle) look at the items with whatever tools are available (ex: Windows Perfmon)and hopefully you find it. If you are interested in advanced analytics, this is probably not your situation.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;The more typical case is &lt;strong&gt;you have multiple management tools&lt;/strong&gt;, network, system, virtualization, database and perhaps others. So if&amp;nbsp; you know the domain the problem exists in you have a good place to start. I&amp;rsquo;ve listened to podcasts / read reports which bring up few problems with this: (if you know of any good IT podcasts please send them along) &lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;~80% of problems are sent to the network team with only ~20% being network issues &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;~60% of problems take &amp;gt;10 experts to resolve &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;~80% of the time to restore service is spent isolating the problem &lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;Here is an analogy I use with my &lt;strong&gt;non IT friends &lt;/strong&gt;on why this area is needed. You are monitoring the speed of a car going across the country (pick your favorite country). You are separately monitoring the infrastructure, all:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;roads &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;bridges &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;ferries to take cars across the water &lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;What &lt;strong&gt;you don&amp;rsquo;t know is where the car&lt;/strong&gt; is (old car, no GPS). You are getting many alerts from the roads, bridges and ferries. Which one is affecting the car? Since you don&amp;rsquo;t know what road the car is on you don&amp;rsquo;t know if any given alert is the one affecting your car.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;This is where the &lt;/span&gt;&lt;a href="https://h10078.www1.hp.com/cda/hpms/display/main/hpms_content.jsp?zn=bto&amp;amp;cp=1-11%5E28601_4000_100__"&gt;&lt;span style="font-size:small;color:#669966;font-family:Bookman Old Style;"&gt;CDMB&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt; comes into the isolation process. The CMDB has the route the car is taking or, in our case, the items in the IT infrastructure that make up the service that has the problem.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;Part one of the isolation process is to restrict what we are looking at to the relevant IT items. This greatly reduces the computational power required. For example, one customer I recently visited told me he has 2000+ servers. If we can reduce that to a few app servers and a few database servers (isn&amp;rsquo;t SOA wonderful for we operations types) that is a factor of ~200 reduction.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;Part two of the isolation is the heavy math from &lt;/span&gt;&lt;a href="http://www.hpl.hp.com/"&gt;&lt;span style="font-size:small;color:#669966;font-family:Bookman Old Style;"&gt;HP Labs&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;, with more patent filings.&amp;nbsp; It is a form of &lt;/span&gt;&lt;a href="http://en.wikipedia.org/wiki/Regression_analysis"&gt;&lt;span style="font-size:small;color:#669966;font-family:Bookman Old Style;"&gt;regression analysis&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;, where application or end user response time monitoring is the dependent variable and all the infrastructure metrics are independent variables. In plain terms, if end user response gets worse find the infrastructure metrics that get worse. When end user response gets better find the metrics that get better. The more closely an infrastructure metric tracks the end user response the more likely it is to be the cause.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;Again, while the math is interesting, pictures work better for me. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;a href="http://www.communities.hp.com/online/cfs-file.ashx/__key/CommunityServer.Blogs.Components.WeblogFiles/mbsmreality/PI_2D00_correlation.jpg"&gt;&lt;img border="0" src="http://www.communities.hp.com/online/resized-image.ashx/__size/550x0/__key/CommunityServer.Blogs.Components.WeblogFiles/mbsmreality/PI_2D00_correlation.jpg" alt="" /&gt;&lt;/a&gt; &lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;The thick grey line is the end user response, the red-purple line is the most closely correlated metric -- in this case a database metric. Just so you don&amp;rsquo;t have to strain your eyes we provide a table like this (from a different problem) showing the weighted correlations score.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;&amp;nbsp;&lt;a href="http://www.communities.hp.com/online/cfs-file.ashx/__key/CommunityServer.Blogs.Components.WeblogFiles/mbsmreality/pi_2D00_suspects.jpg"&gt;&lt;img border="0" src="http://www.communities.hp.com/online/resized-image.ashx/__size/550x0/__key/CommunityServer.Blogs.Components.WeblogFiles/mbsmreality/pi_2D00_suspects.jpg" alt="" /&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;Isolation part 3 is to include non-time series data. In the screen capture below you see planned changes and incident details (think alerts) on the timeline. Unplanned changes can also be displayed. Changes are pulled from the CMDB and incidents can come from any management system that can send alerts. And since we know that most problems occur from changes that is an important component. Finally tickets from the helpdesk are included on the timeline, for the case where users are doing the monitoring.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;&amp;nbsp;&lt;a href="http://www.communities.hp.com/online/cfs-file.ashx/__key/CommunityServer.Blogs.Components.WeblogFiles/mbsmreality/pi_2D00_change.jpg"&gt;&lt;img border="0" src="http://www.communities.hp.com/online/resized-image.ashx/__size/550x0/__key/CommunityServer.Blogs.Components.WeblogFiles/mbsmreality/pi_2D00_change.jpg" alt="" /&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;All together this automates a number of things the operations teams already do and some math help isolating problems.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Bookman Old Style;"&gt;*Incident and problem are ITIL terms. There may be many incidents that are symptoms of an underlying problem.&lt;/span&gt; &lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;img border="0" width="16" src="http://lh6.ggpht.com/_xn2gmPb9TfM/Sb_fZkjAxpI/AAAAAAAAD3E/_9xpsQgFfTg/s128/twitter-16x16.png" height="16" alt="" /&gt; &lt;a href="http://twitter.com/home/?status=%20Liked%20"&gt;tweet this!&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size:small;font-family:Arial Black;"&gt;Related Items&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://h10078.www1.hp.com/cda/hpdc/navigation.do?action=downloadPDF&amp;amp;caid=24533&amp;amp;cp=54_4000_100&amp;amp;zn=bto&amp;amp;filename=4AA1-6949ENW.pdf"&gt;Finding the needle in a million haystacks: best practices for IT problem isolation white paper (0.15MB, PDF)&lt;/a&gt; &lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;a href="http://www.hpl.hp.com/techreports/2006/HPL-2006-160R1.pdf"&gt;Achieving Scalable Automated Diagnosis of Distributed Systems Performance Problems&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;a href="http://www.hpl.hp.com/research/slic/publications.html"&gt;HP Labs whitepapers&lt;/a&gt; (heavy math)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;a href="https://h10078.www1.hp.com/cda/hpms/display/main/hpms_content.jsp?zn=bto&amp;amp;cp=1-11-15-25^924_4000_100__"&gt;Problem Isolation web page&lt;/a&gt; &lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Since I asked for podcasts here are some I listen too:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="http://www.networkworld.com/podcasts/"&gt;Network World Podcasts&lt;/a&gt; &amp;ndash; this points to their podcast page with many podcasts. &lt;/li&gt;
&lt;li&gt;&lt;a href="http://rss.conversationsnetwork.org/series/technometria.xml"&gt;Technometria with Phil Windley&lt;/a&gt; &lt;/li&gt;
&lt;li&gt;&lt;a href="http://rss.conversationsnetwork.org/series/ieee.xml"&gt;IEEE Spectrum Radio&lt;/a&gt; &lt;/li&gt;
&lt;/ul&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://www.communities.hp.com/online/aggbug.aspx?PostID=89512" width="1" height="1"&gt;</description><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/BSM/default.aspx">BSM</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/infrastructure+management/default.aspx">infrastructure management</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Business+Service+Management/default.aspx">Business Service Management</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/application+performance+management/default.aspx">application performance management</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Michael+Procopio/default.aspx">Michael Procopio</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Problem+Isolation/default.aspx">Problem Isolation</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/advanced+analytics/default.aspx">advanced analytics</category></item><item><title>Advanced analytics reduces downtime costs - detection</title><link>http://www.communities.hp.com/online/blogs/mbsmreality/archive/2009/04/15/advanced-analytics-reduces-downtime-costs-detection.aspx</link><pubDate>Wed, 15 Apr 2009 18:01:00 GMT</pubDate><guid isPermaLink="false">964d1d0f-bea0-4201-a2aa-8aa369a35a46:88984</guid><dc:creator>Michael_Procopio</dc:creator><slash:comments>0</slash:comments><wfw:commentRss xmlns:wfw="http://wellformedweb.org/CommentAPI/">http://www.communities.hp.com/online/blogs/mbsmreality/rsscomments.aspx?PostID=88984</wfw:commentRss><comments>http://www.communities.hp.com/online/blogs/mbsmreality/archive/2009/04/15/advanced-analytics-reduces-downtime-costs-detection.aspx#comments</comments><description>&lt;p style="MARGIN:0in 0in 10pt;" class="MsoNormal"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;by Michael Procopio&amp;nbsp;&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 10pt;" class="MsoNormal"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;In the world of advanced analytics, two areas that are of interest to the IT management world are: detection of a problem and isolation of a problem. In this post I&amp;#39;ll cover detection.&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 10pt;" class="MsoNormal"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;Problem detection, typically called anomaly detection, in the analytics circles started in a very basic way. Take a metric, say CPU utilization, set a threshold for it and anytime the threshold is crossed, we have an anomaly. &lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 10pt;" class="MsoNormal"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;This, of course, has many problems:&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 0pt 0.5in;TEXT-INDENT:-0.25in;mso-list:l0 level1 lfo1;" class="MsoListParagraphCxSpFirst"&gt;&lt;span style="font-family:book antiqua,palatino;"&gt;&lt;span style="font-size:small;"&gt;&lt;span style="FONT-FAMILY:Symbol;mso-fareast-font-family:Symbol;mso-bidi-font-family:Symbol;"&gt;&lt;span style="mso-list:Ignore;"&gt;&amp;middot;&lt;span style="FONT:7pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;How do I know where to set a threshold&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 0pt 0.5in;TEXT-INDENT:-0.25in;mso-list:l0 level1 lfo1;" class="MsoListParagraphCxSpMiddle"&gt;&lt;span style="font-family:book antiqua,palatino;"&gt;&lt;span style="font-size:small;"&gt;&lt;span style="FONT-FAMILY:Symbol;mso-fareast-font-family:Symbol;mso-bidi-font-family:Symbol;"&gt;&lt;span style="mso-list:Ignore;"&gt;&amp;middot;&lt;span style="FONT:7pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;The right level may be different at different times&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 10pt 0.5in;TEXT-INDENT:-0.25in;mso-list:l0 level1 lfo1;" class="MsoListParagraphCxSpLast"&gt;&lt;span style="font-family:book antiqua,palatino;"&gt;&lt;span style="font-size:small;"&gt;&lt;span style="FONT-FAMILY:Symbol;mso-fareast-font-family:Symbol;mso-bidi-font-family:Symbol;"&gt;&lt;span style="mso-list:Ignore;"&gt;&amp;middot;&lt;span style="FONT:7pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;If there is a one sample spike above the threshold is that really an anomaly I care about (for some it is but not for most in my experience)&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 10pt;" class="MsoNormal"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;The next step in setting thresholds was using &lt;/span&gt;&lt;a href="http://en.wikipedia.org/wiki/Standard_deviation"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;standard deviation&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt; (STD). I will create a sleeve of upper and lower bounds that cover a major percentage of the situations (+/- 1 STD covers 68.2%) I have measured and use that. This has some of the same problems as above. However, let&amp;rsquo;s focus on the time period problem.&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 10pt;" class="MsoNormal"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;The next step is to set thresholds by time of day. With this added capability I can set a reasonable threshold for the typical 10am and 2pm peak traffic periods separately and alerts still come if there is unusual behavior at 8am. This quickly leads to &amp;ldquo;my Mondays are busier than most of my other days&amp;rdquo;. To avoid false&amp;nbsp;alerts, this leads us to time of day and day of week where we keep the standard deviation for each of the 168 hours of the week.&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 10pt;" class="MsoNormal"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;The next problem, the end of quarter booking and shipping madness or it is black Friday (largest shopping day of the year) and we realize we need to add in a seasonally adjusted set of thresholds as well. And something that seasonality can&amp;#39;t take into account are macro events such as a weak economy affecting purchasing.&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 10pt;" class="MsoNormal"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;Of course, none of these will take into account the spikes mentioned above. How to solve all these problems -- hmmm.&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 0pt;LINE-HEIGHT:normal;mso-layout-grid-align:none;" class="MsoNormal"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;There is a completely different approach, for which &lt;/span&gt;&lt;a href="http://hplabs.com/"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;HP Labs&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt; has a patent filing, which uses more sophisticated machine learning. Like the other approaches, it breaks time up into segments, which, in the paper &lt;/span&gt;&lt;a href="http://www.hpl.hp.com/techreports/2006/HPL-2006-160R1.pdf"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;Achieving Scalable Automated Diagnosis of Distributed Systems Performance Problems&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;, are called epochs. Unlike the other approaches, it does not simply compare now to a predetermined set of threshold levels.&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 0pt;LINE-HEIGHT:normal;mso-layout-grid-align:none;" class="MsoNormal"&gt;&lt;span style="mso-spacerun:yes;"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;&lt;/span&gt;&lt;/span&gt;&amp;nbsp;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 10pt;" class="MsoNormal"&gt;&lt;span style="font-size:x-small;font-family:Verdana;"&gt;&lt;span style="font-family:verdana,geneva;"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;This method compares &amp;lsquo;now&amp;rsquo; to recent epochs as well as previous learning and makes a determination of what behavior is good and bad. While a spike is bad, if the epoch is behaving well overall it is considered to have good behavior. There is a lot of math behind this but I find looking a picture much more obvious&lt;/span&gt;.&lt;/span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;img border="0" src="http://lh6.ggpht.com/__Vfb7MLSLnY/ShwHKW_sadI/AAAAAAAAAhs/LsWDVKmyJbc/pi-anomaly-detection.jpg" style="max-width:550px;" alt="" /&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 10pt;" class="MsoNormal"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;Notice the gray outline the looks like a city skyline. This is what the algorithm has determine is &amp;lsquo;normal&amp;rsquo; or good behavior. The hatch lines on the right show where it found an anomaly.&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 10pt;" class="MsoNormal"&gt;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;These advanced algorithms are implemented in &lt;a href="https://h10078.www1.hp.com/cda/hpms/display/main/hpms_content.jsp?zn=bto&amp;amp;cp=1-11-15-25^924_4000_100__"&gt;HP Problem Isolation&lt;/a&gt;. In the next post, I&amp;rsquo;ll discuss how analytics are&amp;nbsp;used to find the source of the problem.&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 10pt;" class="MsoNormal"&gt;&lt;span style="font-size:small;font-family:Book Antiqua;"&gt;&lt;img border="0" width="16" src="http://lh6.ggpht.com/_xn2gmPb9TfM/Sb_fZkjAxpI/AAAAAAAAD3E/_9xpsQgFfTg/s128/twitter-16x16.png" height="16" alt="" /&gt;&amp;nbsp;&lt;span style="font-family:arial,helvetica,sans-serif;"&gt;&lt;a target="_blank" href="http://twitter.com/home/?status=%20I&amp;#39;m%20reading%20Advanced%20analytics%20reduces%20downtime%20costs%20(HP-PI)%20http://bit.ly/Z2wA9"&gt;t&lt;span style="font-size:x-small;"&gt;weet this!&lt;/span&gt;&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p style="MARGIN:0in 0in 10pt;" class="MsoNormal"&gt;&lt;span&gt;Related Items&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;div&gt;&lt;span style="font-family:arial,helvetica,sans-serif;"&gt;&lt;a target="_blank" href="https://h10078.www1.hp.com/cda/hpms/display/main/hpms_content.jsp?zn=bto&amp;amp;cp=1-11-15-25^924_4000_100__"&gt;Problem Isolation&lt;/a&gt; web page&lt;/span&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;div&gt;&lt;a href="https://h10078.www1.hp.com/cda/hpdc/navigation.do?action=downloadPDF&amp;amp;caid=24533&amp;amp;cp=54_4000_100&amp;amp;zn=bto&amp;amp;filename=4AA1-6949ENW.pdf"&gt;Finding the needle in a million haystacks: best practices for IT problem isolation white paper (0.15MB, PDF)&lt;/a&gt;&lt;/div&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;span style="font-family:Arial Black;"&gt;&lt;/span&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family:Arial Black;"&gt;&lt;/span&gt;&amp;nbsp;&lt;span style="font-size:small;font-family:book antiqua,palatino;"&gt;&lt;/span&gt;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://www.communities.hp.com/online/aggbug.aspx?PostID=88984" width="1" height="1"&gt;</description><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Automated+Business_2F00_IT+Service+Management/default.aspx">Automated Business/IT Service Management</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Michael+Procopio/default.aspx">Michael Procopio</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/Problem+Isolation/default.aspx">Problem Isolation</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/advanced+analytics/default.aspx">advanced analytics</category><category domain="http://www.communities.hp.com/online/blogs/mbsmreality/archive/tags/anomaly+detection/default.aspx">anomaly detection</category></item></channel></rss>