
Features - Enterprise Data Insights:
INFORMATION LIFECYCLE MANAGEMENT: UBIQUITOUS DATA MANAGEMENT
by Sr. Analyst Nancy Marrone and Analyst Steve Kenniston, Enterprise Storage
Group
Information Lifecycle Management (ILM) is quickly becoming the buzzword of the
storage industry early in 2003. ILM is a combination of technology and
methodology that helps end users manage data from the moment it is created to
the time at which it is no longer needed. As new regulatory rules are erected,
and the number of disaster recovery implementations increase, ILM maintains a
pivotal role in helping IT professionals adhere to new standards while
incurring minimum management headache. ILM is not a technology; it is a
combination of processes and technologies that determine how data flows
through an environment. Many companies are implementing a combination of these
technologies and processes to enable a successful data lifecycle management
practice.
The Process
ILM is a process that utilizes a number of technologies as well as business
methodologies; these include:
- Assessment
- Socialization
- Classification
- Automation
- Review
In the assessment phase of ILM, storage administrators can take advantage of
SRM technologies. Storage Resource Management (SRM) solutions help IT
administrators figure out what data resides on the storage assets in their
environment. Most SRM tools have the capability to generate reports for IT,
which outlines data usage patterns. Once IT understands what data they have
and where it currently lives in the environment, they can begin the next steps
of the assessment phase of the ILM process.
In the next step, IT generates reports from the SRM tools, presents them to
the company's department heads, and explains the breakdown of storage assets
utilization and costs involved. This process is known as the socialization
phase of ILM. Once IT meets with the department heads and they collaborate to
understand data usage patterns, department heads must determine how this data
is used, and how critical it is to the business at any given point in time.
The ability to prioritize data based on business requirements (i.e.: mission
critical, business sensitive and departmental) will allow IT to determine
where data should live through its lifecycle and assist them in creating
policies to migrate data to the proper storage "class" over time. IT must work
with these department heads to set up a classification schema for the company.
Data classification is also a process. (See Classify, Classify, Classify at
www.enterprisestoragegroup.com
.) Data can be classified by:
- Data type
- Data "organization"
- Data age
- Data "value"
IT will use all data collected at this point to establish policies to automate
the data migration through the environment with the minimum amount of hands on
data management.
SRM solutions should be employed throughout the ILM process, not merely for an
initial assessment. SRM solutions can monitor the storage environment
constantly, revealing where excess capacity, duplicate files, "unnecessary
files" or aged files exist. This information is very important in the ILM
process, for it is essential in understanding what data should be migrated,
archived or purged.
Once data has been classified, and IT and the department heads have agreed on
a plan for where their data should live at any point in its life cycle,
policies must be established to determine where (on which storage resources)
the data should be located. Automated Data Migration (ADM) tools can automate
the migration of data from one storage class to another based user defined
policies. ADM solutions are essentially a combination of intelligent SRM and
HSM solutions. HSM was designed to automate data migration for archival
purposes; it was access driven. In contrast, ADM solutions enable data
migration across various storage resources based on a combination of
user-defined criteria; it is data value driven. Quality ADM tools assist IT in
assigning value to the storage assets within the environment. Administrators
can set up criteria around the data based on type, ownership, last access, or
age. Data from a specific type of application may be placed on primary
storage, but later may be migrated to secondary storage or archive if it has
not been accessed in a certain period of time. Another policy option allows
data from one application to be mirrored or copied to secondary storage as
soon as it is written. Significant cost savings can be realized when data is
assigned to storage resources according to the data's value. ADM solutions
also have the ability to "recover" data to a higher-level storage resource if
that data has been migrated to secondary storage and is accessed repeatedly.
These solutions provide IT with the ability to optimize storage resources
based on the value of the data.
Other data management tools are required, in conjunction with ADM solutions,
to fulfill disaster recovery (DR) and business continuance (BC) objectives.
Department heads must inform IT about what data is essential to the continuity
of business operations during the process of classifying the data. To protect
data that must remain readily available processes such as mirroring and
replication to remote arrays or sites should be incorporated. Disaster
recovery has been an important topic over the past two years, resulting in a
number of assessments; however, there have been few implementations. It is
often considered cost prohibitive to implement full-blown DR/BC plans due to
the quantity of data IT believes needs to be replicated; however, several of
the assessments discussed here can help IT determine the real quantity of data
needing replication for DR/BC. DR/BC costs can be greatly reduced with the
proper classification of data.
The classification and automatic migration of data helps IT with the process
of backing up data. Backup initially consisted of moving data to some form of
media that could be transported and housed in a reasonable footprint and
stored for long periods of time (10 years, this is now considered archiving).
Backup has evolved in practice into data migration to some form of storage
medium for quick recovery. In the past, backup to tape has been the only
option for offloading aged data from the primary resources. Now, with ADM and
disk-to-disk back up solutions, long-term retention of data does not
necessarily have to be on tape. Again, by assessing and classifying data, IT
better understands what data needs to be on tape at given points in time, and
what should be retained on secondary storage resources. SRM solutions can
eliminate issues with backing up redundant and unnecessary files, open up the
backup window, and greatly reduce the server and storage resources
required.
The act of archiving data is evolving as well. Today, IT must determine how to
avoid redundancy; nobody wants to have 4000 copies of the same file backed up
to tape, wasting valuable floor space. Thus, how can IT find a necessary file
without scouring through a bunch of archive files, especially when they know
the file was active 60 days ago? SRM and ADM tools can ensure that IT is
properly archiving the right data at the appropriate time, avoiding
redundancy.
Archiving, by definition, is important in the compliance world. Regulatory
requirements will determine the amount of time data is archived as well as the
requirements surrounding archiving; these include: how long the data needs to
be kept, how quickly the data needs to be recovered, and how it must be
destroyed. Regulatory requirements will also determine the medium which data
is stored on, such as "WORM" like media (Write Once Read Many).
Solving the Problem
ILM solve key problems such as ineffective storage utilization, the costs of
managing storage, and the ability to manage storage growth surrounding data
replication, disaster recovery / business continuance and backup. Our belief
is that unless IT understands the real value of data within the environment,
it remains difficult to determine how to best assign storage resources for its
placement.
Once the data has been assigned a value, the storage resource should also be
assigned a value. By properly placing data (the proper data value on the
proper value array), IT can more effectively distribute data across multiple
resources, which should lead to better utilization and cost savings.
Automating the data management and migration process can ensure that the
resources are being optimized and can also reduce the number of staff and
man-hours required to manage the storage. By using SRM tools, IT no longer has
to backup the same .mpg file that they have 245 copies of already. This
process saves time (backup), disk space (secondary storage) and tape space.
ADM solutions enable policy driven data placement, ensuring only the highest
value data utilize primary resources; this reduces waste and allows IT to
develop budgets based upon the importance of the data to business
requirements.
ILM needs to be a part of the overall IT strategy as it relates to DR/BC and
regulatory requirements. The ability to track the location of data within the
company will help IT to successfully handle situations from audits to true
disasters.
The Bottom Line
ILM is as significant a business process as CRM and ERP, considering an
effective ILM implementation can significantly streamline costs and management
efficiencies. Actually, IT organizations can use ILM processes to more
effectively implement CRM and ERP solutions, making sure that critical data
has resource priorities and is always available. ILM is an ever growing and
evolving process In order to realize the benefits of the ILM process, IT must
continuously review the usage patterns of the storage resources and ensure
adherence to policies and procedures. By taking advantage of the new SRM / ADM
tools, monitoring the process becomes easy.
The time has come for IT to start solving the foreseeable business challenges
in order get ahead of and control the explosive growth of data that will
continue to take place. New advances in ATA and SATA disk will play an
important role in helping IT with ILM, giving them the ability to stage
backups and snapshots inexpensively. Software innovations around SRM and ADM
have also increased the ability to identify data, classify data, and move data
to the proper location over time. Once IT can begin showing executive
management exactly how their assets are being used, they will be in a position
to properly assign charge backs to the various groups in an effort turn
themselves into a profit (or at least break even) center. Now is the time to
start planning.
Authors
Sr. Analyst Nancy Marrone and Analyst Steve Kenniston
For more information, see www.enterprisestoragegroup.com
.
|