DSstar Logo Providing News & Information For Data Intensive
Storage Solutions For The Enterprise

Previous Article  |  Table of Contents  |  Next Article

Features - Enterprise Data Insights:

INFORMATION LIFECYCLE MANAGEMENT: UBIQUITOUS DATA MANAGEMENT
by Sr. Analyst Nancy Marrone and Analyst Steve Kenniston, Enterprise Storage Group

Information Lifecycle Management (ILM) is quickly becoming the buzzword of the storage industry early in 2003. ILM is a combination of technology and methodology that helps end users manage data from the moment it is created to the time at which it is no longer needed. As new regulatory rules are erected, and the number of disaster recovery implementations increase, ILM maintains a pivotal role in helping IT professionals adhere to new standards while incurring minimum management headache. ILM is not a technology; it is a combination of processes and technologies that determine how data flows through an environment. Many companies are implementing a combination of these technologies and processes to enable a successful data lifecycle management practice.

The Process

ILM is a process that utilizes a number of technologies as well as business methodologies; these include:

  • Assessment
  • Socialization
  • Classification
  • Automation
  • Review

In the assessment phase of ILM, storage administrators can take advantage of SRM technologies. Storage Resource Management (SRM) solutions help IT administrators figure out what data resides on the storage assets in their environment. Most SRM tools have the capability to generate reports for IT, which outlines data usage patterns. Once IT understands what data they have and where it currently lives in the environment, they can begin the next steps of the assessment phase of the ILM process.

In the next step, IT generates reports from the SRM tools, presents them to the company's department heads, and explains the breakdown of storage assets utilization and costs involved. This process is known as the socialization phase of ILM. Once IT meets with the department heads and they collaborate to understand data usage patterns, department heads must determine how this data is used, and how critical it is to the business at any given point in time. The ability to prioritize data based on business requirements (i.e.: mission critical, business sensitive and departmental) will allow IT to determine where data should live through its lifecycle and assist them in creating policies to migrate data to the proper storage "class" over time. IT must work with these department heads to set up a classification schema for the company. Data classification is also a process. (See Classify, Classify, Classify at www.enterprisestoragegroup.com .) Data can be classified by:

  • Data type
  • Data "organization"
  • Data age
  • Data "value"

IT will use all data collected at this point to establish policies to automate the data migration through the environment with the minimum amount of hands on data management.

SRM solutions should be employed throughout the ILM process, not merely for an initial assessment. SRM solutions can monitor the storage environment constantly, revealing where excess capacity, duplicate files, "unnecessary files" or aged files exist. This information is very important in the ILM process, for it is essential in understanding what data should be migrated, archived or purged.

Once data has been classified, and IT and the department heads have agreed on a plan for where their data should live at any point in its life cycle, policies must be established to determine where (on which storage resources) the data should be located. Automated Data Migration (ADM) tools can automate the migration of data from one storage class to another based user defined policies. ADM solutions are essentially a combination of intelligent SRM and HSM solutions. HSM was designed to automate data migration for archival purposes; it was access driven. In contrast, ADM solutions enable data migration across various storage resources based on a combination of user-defined criteria; it is data value driven. Quality ADM tools assist IT in assigning value to the storage assets within the environment. Administrators can set up criteria around the data based on type, ownership, last access, or age. Data from a specific type of application may be placed on primary storage, but later may be migrated to secondary storage or archive if it has not been accessed in a certain period of time. Another policy option allows data from one application to be mirrored or copied to secondary storage as soon as it is written. Significant cost savings can be realized when data is assigned to storage resources according to the data's value. ADM solutions also have the ability to "recover" data to a higher-level storage resource if that data has been migrated to secondary storage and is accessed repeatedly. These solutions provide IT with the ability to optimize storage resources based on the value of the data.

Other data management tools are required, in conjunction with ADM solutions, to fulfill disaster recovery (DR) and business continuance (BC) objectives. Department heads must inform IT about what data is essential to the continuity of business operations during the process of classifying the data. To protect data that must remain readily available processes such as mirroring and replication to remote arrays or sites should be incorporated. Disaster recovery has been an important topic over the past two years, resulting in a number of assessments; however, there have been few implementations. It is often considered cost prohibitive to implement full-blown DR/BC plans due to the quantity of data IT believes needs to be replicated; however, several of the assessments discussed here can help IT determine the real quantity of data needing replication for DR/BC. DR/BC costs can be greatly reduced with the proper classification of data.

The classification and automatic migration of data helps IT with the process of backing up data. Backup initially consisted of moving data to some form of media that could be transported and housed in a reasonable footprint and stored for long periods of time (10 years, this is now considered archiving). Backup has evolved in practice into data migration to some form of storage medium for quick recovery. In the past, backup to tape has been the only option for offloading aged data from the primary resources. Now, with ADM and disk-to-disk back up solutions, long-term retention of data does not necessarily have to be on tape. Again, by assessing and classifying data, IT better understands what data needs to be on tape at given points in time, and what should be retained on secondary storage resources. SRM solutions can eliminate issues with backing up redundant and unnecessary files, open up the backup window, and greatly reduce the server and storage resources required.

The act of archiving data is evolving as well. Today, IT must determine how to avoid redundancy; nobody wants to have 4000 copies of the same file backed up to tape, wasting valuable floor space. Thus, how can IT find a necessary file without scouring through a bunch of archive files, especially when they know the file was active 60 days ago? SRM and ADM tools can ensure that IT is properly archiving the right data at the appropriate time, avoiding redundancy.

Archiving, by definition, is important in the compliance world. Regulatory requirements will determine the amount of time data is archived as well as the requirements surrounding archiving; these include: how long the data needs to be kept, how quickly the data needs to be recovered, and how it must be destroyed. Regulatory requirements will also determine the medium which data is stored on, such as "WORM" like media (Write Once Read Many).

Solving the Problem

ILM solve key problems such as ineffective storage utilization, the costs of managing storage, and the ability to manage storage growth surrounding data replication, disaster recovery / business continuance and backup. Our belief is that unless IT understands the real value of data within the environment, it remains difficult to determine how to best assign storage resources for its placement.

Once the data has been assigned a value, the storage resource should also be assigned a value. By properly placing data (the proper data value on the proper value array), IT can more effectively distribute data across multiple resources, which should lead to better utilization and cost savings. Automating the data management and migration process can ensure that the resources are being optimized and can also reduce the number of staff and man-hours required to manage the storage. By using SRM tools, IT no longer has to backup the same .mpg file that they have 245 copies of already. This process saves time (backup), disk space (secondary storage) and tape space. ADM solutions enable policy driven data placement, ensuring only the highest value data utilize primary resources; this reduces waste and allows IT to develop budgets based upon the importance of the data to business requirements.

ILM needs to be a part of the overall IT strategy as it relates to DR/BC and regulatory requirements. The ability to track the location of data within the company will help IT to successfully handle situations from audits to true disasters.

The Bottom Line

ILM is as significant a business process as CRM and ERP, considering an effective ILM implementation can significantly streamline costs and management efficiencies. Actually, IT organizations can use ILM processes to more effectively implement CRM and ERP solutions, making sure that critical data has resource priorities and is always available. ILM is an ever growing and evolving process In order to realize the benefits of the ILM process, IT must continuously review the usage patterns of the storage resources and ensure adherence to policies and procedures. By taking advantage of the new SRM / ADM tools, monitoring the process becomes easy.

The time has come for IT to start solving the foreseeable business challenges in order get ahead of and control the explosive growth of data that will continue to take place. New advances in ATA and SATA disk will play an important role in helping IT with ILM, giving them the ability to stage backups and snapshots inexpensively. Software innovations around SRM and ADM have also increased the ability to identify data, classify data, and move data to the proper location over time. Once IT can begin showing executive management exactly how their assets are being used, they will be in a position to properly assign charge backs to the various groups in an effort turn themselves into a profit (or at least break even) center. Now is the time to start planning.

Authors

Sr. Analyst Nancy Marrone and Analyst Steve Kenniston

For more information, see www.enterprisestoragegroup.com .

 
DSstar Sponsors
LSI Logic Auspex PowerQuest MicroStrategy
Sun Microsystems Computer Associates Clementine Sony
NEC IBM Hitachi Data Systems Hewlett Packard
Quantum Overland Storage Educom adic
MCE McDATA    

Top of Page


Previous Article  |  Table of Contents  |  Next Article