<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://www.communities.hp.com/online/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"><channel><title>Research on Security and Identity Management (by Marco Casassa Mont) : Unstructured Data</title><link>http://www.communities.hp.com/online/blogs/mcm/archive/tags/Unstructured+Data/default.aspx</link><description>Tags: Unstructured Data</description><dc:language>en</dc:language><generator>CommunityServer 2008.5 SP1 (Build: 31106.3070)</generator><item><title>Part II: Risk Management for Unstructured Data in Enterprises</title><link>http://www.communities.hp.com/online/blogs/mcm/archive/2008/09/04/part-ii-risk-management-for-unstructured-data-in-enterprises.aspx</link><pubDate>Thu, 04 Sep 2008 13:09:00 GMT</pubDate><guid isPermaLink="false">964d1d0f-bea0-4201-a2aa-8aa369a35a46:84600</guid><dc:creator>marcocasassamont</dc:creator><slash:comments>2</slash:comments><comments>http://www.communities.hp.com/online/blogs/mcm/archive/2008/09/04/part-ii-risk-management-for-unstructured-data-in-enterprises.aspx#comments</comments><description>&lt;p&gt;In a recent post published on the &lt;a href="http://sgciam.wordpress.com/2008/09/04/risk-management-and-information/"&gt;Netweaver Identity Manager Weblog&lt;/a&gt;, the author&amp;nbsp; has made a few comments about my post on &lt;a href="http://www.communities.hp.com/online/blogs/mcm/archive/2008/09/02/risk-management-for-unstructured-data-in-enterprises.aspx"&gt;&amp;quot;Risk Management for Unstructured Data in Enterprises&amp;quot;&lt;/a&gt; (well, actually the published URL to my post is apparently broken ...). &lt;/p&gt;
&lt;p&gt;Thanks for this input, in particular about three main points that I (tried to) summarise as it follows:&lt;/p&gt;
&lt;p&gt;1) Meaning of unstructured data (or the fact that unstructured data does not exist by definition ...)&lt;/p&gt;
&lt;p&gt;2) Narrowness of perception of approaches and incompleteness of my list of required solutions&lt;/p&gt;
&lt;p&gt;3) Availability of comprehensive methodology for implementing enterprise wide risk management&lt;/p&gt;
&lt;p&gt;About point 1), this looks pretty much a philosophical discussion. No doubt that, at the end, we talk about information that has some sort of structure (well, an email has a header, a body with some texts and attachments; a document is made of paragraphs or lines of text; ...). However, the (maybe over-hyped) &amp;quot;unstructured data&amp;quot; term is currently used to (a) identify specific types of information and (b) contrast it against classic &amp;quot;structured data&amp;quot; (e.g. information stored in RDBMS repositories, etc.).&amp;nbsp; I think I will stick with this terminology ...&lt;/p&gt;
&lt;p&gt;Back to the key point, recent reports (including the Ponemon Institute&amp;#39;s survey on &amp;quot;Governance of Unstructured Data&amp;quot; and other market and research reports) indeed highlight that the management of unstructured data in enterprises is a raising concern for enterprises, both in terms of governance and risk management. I think this is what really matters - independently from the terminology.&lt;/p&gt;
&lt;p&gt;No doubt that classification of data is an important point, especially if you ever manage to &amp;quot;find&amp;quot; where this &amp;quot;unstructured data&amp;quot; is, within a complex enterprise environment ... I would say that, given the particular nature of &amp;quot;unstructured data&amp;quot;, a preliminary &amp;quot;data discovery&amp;quot; phase might be required, indeed followed by a classification and assessment of its value (considering though, that the value of some of this information might also come from aggregations and correlations ...).&lt;/p&gt;
&lt;p&gt;About point 2), by no means my post was meant to provide a definitive or comprehensive assessment and answer to the problem of information risk management or, more specifically, on &amp;quot;unstructured&amp;quot; information risk management. It was just a statement of some &amp;quot;desirable&amp;quot; properties and capabilities that I would like to see (and I know it would be of some help to customers ...).&lt;/p&gt;
&lt;p&gt;I am well aware of the complexity of the overall &amp;nbsp;(security) &amp;quot;enterprise risk assessment and management&amp;quot; problem, its extent and the fact that, when assessing and managing (security) risks, many factors are involved, including business goals, IT, other assets, people, processes, awareness/education, etc.&amp;nbsp;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;(Security) risk assessment and management techniques/methodologies/frameworks and standards/etc. are indeed out there (e.g. ISO 27005/2700x, CoBIT, etc.). These &amp;quot;standards&amp;quot; provide guidelines and criteria to be carefully refined, grounded and contextualized in various &amp;quot;operational&amp;quot; realities, along with some good, common sense ...&lt;/p&gt;
&lt;p&gt;So, no doubt that there are already &amp;quot;comprehensive methodology for implementing enterprise wide risk management&amp;quot;, at least from a consulting perspective, but this was not my main point. &lt;/p&gt;
&lt;p&gt;My main point was not so focused on these methodologies but rather on the need to better understand and possibly improve the process of exploring, explaining and predicting the consequences and impacts of strategic (policy) choices and decisions in enterprise contexts and environments, in particular when dealing with security matters. &lt;/p&gt;
&lt;p&gt;An approach that we are currently exploring is based on modeling and simulation techniques in the security field, coupled with economic theory and social science. Please have a look at the &lt;a href="http://www.hpl.hp.com/techreports/2008/HPL-2008-84.html"&gt;HPL Technical Report on &amp;quot;Identity Analytics&amp;quot;&lt;/a&gt; that I mentioned a few times - to see what I mean, in more details (at least from an &amp;quot;IdM perspective&amp;quot;). &lt;/p&gt;
&lt;p&gt;Specifically, one of my R&amp;amp;D interests is in &amp;quot;(semi-) automation&amp;quot; tools and solutions in this space that can indeed help and support professional and consulting services in their risk assessment &amp;amp; management activities. This includes providing decision support and &amp;quot;what-if analysis&amp;quot;, involving modeling and simulation, providing trade-off analysis, etc. &lt;/p&gt;
&lt;p&gt;Given the complexity of this space, I deliberately focused on the aspect of &amp;quot;management of unstructured data&amp;quot; and the IdM perspective, well conscious this is just a part of the overall problem and space. &lt;/p&gt;
&lt;p&gt;I hope I clarified this point.&lt;/p&gt;
&lt;p&gt;About point 3), no doubt about this, as I mentioned above.&lt;/p&gt;
&lt;p&gt;However the statement that &amp;quot;comprehensive methodology for implementing enterprise wide risk management is done&amp;quot; sounds (at least to me) sounds a little bit abstract to me ... &lt;/p&gt;
&lt;p&gt;It would be of some interest to the readers of this blog if this statement could be elaborated (specifically in the space of IdM and information management) along with providing some recommendations/input/directions (hopefully beyond having to hire a consulting company ...:-)).&lt;/p&gt;
&lt;p class="MsoNormal" style="MARGIN:0pt;"&gt;&lt;font face="Times New Roman" size="3"&gt;--- NOTE:&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;use this &lt;/font&gt;&lt;a href="http://research-on-identitymanagement.blogspot.com/"&gt;&lt;font face="Times New Roman" size="3"&gt;mirror blog&lt;/font&gt;&lt;/a&gt;&lt;font face="Times New Roman" size="3"&gt; if you prefer posting on an external blog site &lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp;&lt;/span&gt;---&lt;/font&gt;&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://www.communities.hp.com/online/aggbug.aspx?PostID=84600" width="1" height="1"&gt;</description><category domain="http://www.communities.hp.com/online/blogs/mcm/archive/tags/risk+management/default.aspx">risk management</category><category domain="http://www.communities.hp.com/online/blogs/mcm/archive/tags/Security+Analytics/default.aspx">Security Analytics</category><category domain="http://www.communities.hp.com/online/blogs/mcm/archive/tags/Identity+Analytics/default.aspx">Identity Analytics</category><category domain="http://www.communities.hp.com/online/blogs/mcm/archive/tags/Unstructured+Data/default.aspx">Unstructured Data</category></item><item><title>Risk Management for Unstructured Data in Enterprises</title><link>http://www.communities.hp.com/online/blogs/mcm/archive/2008/09/02/risk-management-for-unstructured-data-in-enterprises.aspx</link><pubDate>Mon, 01 Sep 2008 16:21:00 GMT</pubDate><guid isPermaLink="false">964d1d0f-bea0-4201-a2aa-8aa369a35a46:84551</guid><dc:creator>marcocasassamont</dc:creator><slash:comments>0</slash:comments><comments>http://www.communities.hp.com/online/blogs/mcm/archive/2008/09/02/risk-management-for-unstructured-data-in-enterprises.aspx#comments</comments><description>&lt;p&gt;In the context of the HP Labs&amp;#39; Security and &lt;a href="http://www.hpl.hp.com/personal/Marco_Casassa_Mont/Projects/IdentityAnalytics/IdentityAnalytics.htm"&gt;Identity Analytics&lt;/a&gt; project I have been investigating the implications of &amp;quot;unstructured data&amp;quot; (i.e. emails, documents, multimedia files, pages in data sharing sites, messages exchanged with Instant Messaging tools, blog posts, data mash-ups, etc.) within organizations, along with how to explain and predict involved risks and explore the consequences of related security (policy) choices.&lt;/p&gt;
&lt;p&gt;Is &amp;quot;unstructured data&amp;quot; really a problem for organizations? If so, where is this problem? Well, the content of unstructured data (and/or an aggregation of it) can be confidential as it might include personal, financial and business-critical information. Because of the nature of unstructured data (and associated, emerging tools to handle and share it), there are many ways this data could leak and/or be misused, ranging from accidental disclosures to aggregations of information posted in public areas.&lt;/p&gt;
&lt;p&gt;The threat landscape (including threats to data confidentiality, integrity and availability) is potentially broad as many contextual elements, IT components, processes and behavioral aspects are involved.&lt;/p&gt;
&lt;p&gt;Most of the current approaches (I am aware of), that mitigate some of the involved risks, are based on traditional IT security and identity &amp;quot;control points&amp;quot; (such as access control, interception points, complex document lifecycle management tools, etc.), addressing &amp;quot;point problems&amp;quot;.&lt;/p&gt;
&lt;p&gt;I believe this is not enough. Solutions are required to help organizations (and decision makers) to: (1) fully understand the nature of the problem, based on their specific context and environment; (2) have a picture of their overall risk exposure; (3) make informed decisions on which approaches to follow, explain and predict the consequences and define appropriate policies; (3) explore trade-offs.&lt;/p&gt;
&lt;p&gt;So far I have found no comprehensive approach/solution providing these features. Is anybody aware of any? &lt;/p&gt;
&lt;p&gt;--- NOTE:&amp;nbsp; use this &lt;a href="http://research-on-identitymanagement.blogspot.com/"&gt;mirror blog&lt;/a&gt; if you prefer posting on an external blog site &amp;nbsp;---&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://www.communities.hp.com/online/aggbug.aspx?PostID=84551" width="1" height="1"&gt;</description><category domain="http://www.communities.hp.com/online/blogs/mcm/archive/tags/risk+management/default.aspx">risk management</category><category domain="http://www.communities.hp.com/online/blogs/mcm/archive/tags/Identity+Analytics/default.aspx">Identity Analytics</category><category domain="http://www.communities.hp.com/online/blogs/mcm/archive/tags/Unstructured+Data/default.aspx">Unstructured Data</category></item><item><title>Part III: Identity Analytics and Unstructured Data Analysis</title><link>http://www.communities.hp.com/online/blogs/mcm/archive/2008/07/25/part-iii-identity-analytics-and-unstructured-data-analysis.aspx</link><pubDate>Fri, 25 Jul 2008 13:11:00 GMT</pubDate><guid isPermaLink="false">964d1d0f-bea0-4201-a2aa-8aa369a35a46:84009</guid><dc:creator>marcocasassamont</dc:creator><slash:comments>2</slash:comments><comments>http://www.communities.hp.com/online/blogs/mcm/archive/2008/07/25/part-iii-identity-analytics-and-unstructured-data-analysis.aspx#comments</comments><description>&lt;p&gt;In previous posts of mine (&lt;a href="http://www.communities.hp.com/online/blogs/mcm/archive/2008/07/13/on-identity-analytics-part-ii.aspx"&gt;here&lt;/a&gt; and &lt;a href="http://www.communities.hp.com/online/blogs/mcm/archive/2008/07/09/on-identity-analytics-new-hp-labs-technical-report.aspx"&gt;here&lt;/a&gt;) I introduced our vision of Identity Analytics and the focus and purposes of our R&amp;amp;D activities.&lt;/p&gt;
&lt;p&gt;I received a few emails and queries asking to clarify the link between Identity Analytics and Unstructured Data, considering that this was mentioned in the &amp;quot;&lt;a href="http://www.hpl.hp.com/techreports/2008/HPL-2008-84.html"&gt;On Identity Analytics: Setting the Context&lt;/a&gt;&amp;quot; HPL Technical Report.&lt;/p&gt;
&lt;p&gt;We believe that &amp;quot;Unstructured Data&amp;quot; is a possible, fertile and rich &amp;quot;case study&amp;quot;/scenario where to explore the concept of Identity Analytics, the applicability of our approach and potential limitations..&lt;/p&gt;
&lt;p&gt;The adoption of new &amp;quot;web 2.0&amp;quot; collaborative tools within organizations (TWiki, Sharepoint, IM, etc.) and social networks (Facebook, LinkedIn, del.icio.us, etc.) provides users with better ways to collaborate, create and share contents. At the same time this poses new threats and security risks, due to the nature of unstructured data, the fact that confidentiality issues could emerge from aggregated, simpler pieces of information and the difficulty to retain control on this data. This is where traditional Identity management solutions can show their limitations and where decision makers need to better understand the implications of their choices and/or the impact of defining new policies.&lt;/p&gt;
&lt;p&gt;Our R&amp;amp;D work in Identity Analytics really aims, in this context, to explore how modeling and simulation can help to explain and predict the impact of some of these decisions on the organizations (e.g. in terms of risks, reputation, costs, etc.) and explore options and &amp;quot;trade-offs&amp;quot; by providing &amp;quot;what-if&amp;quot; analysis.&lt;/p&gt;
&lt;p&gt;Of course the &amp;quot;unstructured data&amp;quot; scenario is just one of the various scenarios we are exploring. I would be interested in hearing from you about other areas you think the &amp;quot;Identity Analytics&amp;quot; approach could provide help and/or address (decision support) issues you might have.&lt;/p&gt;
&lt;p&gt;--- NOTE:&amp;nbsp; use this &lt;a href="http://research-on-identitymanagement.blogspot.com/"&gt;mirror blog&lt;/a&gt; if you prefer posting on an external blog site &amp;nbsp;---&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;&lt;img src="http://www.communities.hp.com/online/aggbug.aspx?PostID=84009" width="1" height="1"&gt;</description><category domain="http://www.communities.hp.com/online/blogs/mcm/archive/tags/Identity+Analytics/default.aspx">Identity Analytics</category><category domain="http://www.communities.hp.com/online/blogs/mcm/archive/tags/Unstructured+Data/default.aspx">Unstructured Data</category></item></channel></rss>