THE CORPORATE INFORMATION OR KNOWLEDGE FACTORY?
by Joseph M. Firestone, Ph.D.
W. H. Inmon's vision of the IT future is an information ecosystem
"with different components, each serving a community directly while working
in concert with other components to produce a cohesive, balanced information
environment. Like nature's ecosystem, an information ecosystem must be
adaptable, changing as the inhabitants and participants within its aegis
change. Over time, the balance between different components and their
relationship to each other changes as well, as the environment changes.
Sometimes the effect will appear on seemingly unrelated parts (sometimes
disastrously!). Adaptability, change, and balance, are the hallmarks of the
components of a healthy information ecosystem." [1]
Further, "the corporate information factory (CIF) is the physical embodiment
of the notion of an information ecosystem." [2] In other words, the CIF "is an
architecture for the information ecosystem, consisting of the following
architectural components:
An applications environment An integration and transformation layer (I &
T
layer) A data warehouse with current and historical detailed data A data
mart(s) An operational data store (ODS) An Internet and Intranet A metadata
repository" [3] Both the information ecosystem and the CIF are, according to
Inmon, evolving to meet the pressures for business efficiency in three key
areas: business operations, business intelligence, and business management.
While IT has successfully delivered capabilities in the area of business
operations without the CIF, it is because of the increasing need for greater
business intelligence and greater business management capability that
corporations are developing the information ecosystem and the CIF. [4]
In short the goals are better business operations, improved business
intelligence, and enhanced business management. The means are the information
ecosystem and the CIF. But left unanswered is the question of why the
information ecosystem and the CIF are the appropriate means. The means can be
questioned in two ways. First, we can disagree about the make-up of the CIF.
That is, it is easy to postulate additional components for the CIF such as:
Component Transaction Servers [5], Active Information Managers [6], Business
Process Engines [7], and other Application Servers [8]. But second, beyond
this and more important, why should we assume that an information ecosystem,
and an associated CIF are appropriate objectives for fulfilling our goals? To
see whether this is true, we need to examine what information is, and how it
is related to other outcomes of our attempts to understand the business
environment, namely: data, knowledge, and wisdom.
What are the Differences Among Data, Information, Knowledge, and Wisdom?
To begin with, organizational data, information, knowledge, and wisdom, all
emerge from the social process of an organization, and are not private. In
defining them, we are not trying to formulate definitions that will elucidate
the nature of personal data, information, knowledge, or wisdom. Instead, to
use a word that used to be more popular in discourse than it is at present, we
are trying to specify inter-subjective constructs and to provide metrics for
them.
A datum is the value of an observable, measurable or calculable attribute.
Data is more than one such attribute value. Is a datum (or is data)
information? Yes, information is provided by a datum, or by data, but only
because data is always specified in some conceptual context. At a minimum, the
context must include the class to which the attribute belongs, the object that
is a member of that class, some ideas about object operations or behavior, and
relationships to other objects and classes.
Data alone and in the abstract therefore, does not provide information.
Rather, information, in general terms, is data plus conceptual commitments and
interpretations. Information is data extracted, filtered or formatted in some
way (but keep in mind that data is always extracted filtered, or formatted in
some way).
Knowledge is a subset of information. But it is a subset that has been
extracted, filtered, or formatted in a very special way. More specifically,
the information we call knowledge is information that has been subjected to,
and passed tests of validation. Common sense knowledge is information that has
been validated by common sense experience. Scientific knowledge is information
(hypotheses and theories) validated by the rules and tests applied to it by
some scientific community.
Organizational knowledge, in terms of this framework, is information
validated by the rules and tests of the organization seeking knowledge. The
quality of its knowledge then, will be largely dependent on the tendency of
its validation rules and tests to produce knowledge that improves
organizational performance or in Inmon's terms business operations, business
intelligence, and business management (the organization's version of objective
knowledge).
Wisdom, lastly, has a more active component than data, information, or
knowledge. It is the application of knowledge expressed in principles to
arrive at prudent, sagacious decisions about conflictful situations. [9]
From the viewpoint of the definition given of organizational knowledge, what
is an organization doing when it validates information to produce knowledge?
It seems reasonable to propose that the validation process is an essential
aspect of the broader organizational learning process, and that validation is
a form of learning. So, though knowledge is a product and not a process
derived from learning, knowledge validation (validation of information to
admit it into the knowledge base) is certainly closely tied to learning, and
depending on the definition of organizational learning, may be viewed as
derived from it.
The Corporate Distributed Knowledge Management System and The Corporate
Knowledge Factory
It follows directly from the distinctions among data, information, knowledge
and wisdom, that if our goals are to improve business operations, business
intelligence, and business management, we must do this not through an
information ecosystem and a corporate information factory, but with analogous
constructs focused on knowledge, rather than just information. Why? Because:
We need correct information to improve business operations, intelligence,
and management; incorrect information will lead to worse performance, not
improvement Information that has been validated as correct, is more likely to
be correct than unvalidated information or validated information that has not
survived testing Therefore we need a special kind of corporate information
ecosystem called a corporate knowledge ecosystem or a Corporate Distributed
Knowledge Management System (CDKMS) to improve performance And we also need a
special kind of information architecture called a Corporate Knowledge Factory
(CKF), to express the IT architecture supporting this system Let's get to some
definitions.
The Knowledge Management System (KMS) [10] is the on-going, persistent
interaction among agents within a system that produces, maintains, and
enhances the system's knowledge base. This definition is meant to apply to any
intelligent, adaptive system composed of interacting agents, including a
corporation. So, the Corporate Knowledge Management System (CKMS) is just the
KMS of a corporation. An agent is a purposive, self-directed object. Knowledge
base will be defined in the next section. In saying that a CKMS produces
knowledge I am saying that it (a) gathers information and (b) compares
conceptual formulations describing and evaluating its experience, with its
goals, objectives, expectations or past formulations of descriptions, or
evaluations. Further, this comparison is conducted with reference to
validation criteria. Through use of such criteria, CKMSs and other intelligent
systems distinguish competing descriptions and evaluations in terms of
closeness to the truth, closeness to the legitimate, and closeness to the
beautiful. [11]
In saying that a CKMS maintains knowledge I am also saying that it continues
to evaluate its knowledge base against new information by subjecting the
knowledge base to continuous testing against its validation criteria. We are
also saying that to maintain its knowledge, a more complex system must ensure
both the continued dissemination of its currently validated knowledge base,
and continued socialization of intelligent agents in the use and content of
its knowledge base.
Finally, in saying that a CKMS enhances its knowledge base, I am saying that
it adds new propositions and new models to its knowledge base, and also
simplifies and increases the explanatory and predictive power of its older
propositions and models. That is, one of the functions of the CKMS is to
provide for the growth of knowledge.
A CKMS's knowledge base is: the set of remembered data; validated
propositions and models (along with metadata related to their testing);
refuted propositions and models (along with metadata related to their
refutation); metamodels; and software used for manipulating these, pertaining
to the CKMS and produced by it. [12] A corporate knowledge management system,
in this view, requires an at least partly automated knowledge base to begin
operation. But it enhances its own knowledge base with the passage of time
because it is a self-correcting system, and subjects its knowledge base to
testing against experience.
The Corporate Knowledge Management Process (CKMP) is an on-going persistent
interaction among human-based agents who aim at integrating all of the various
agents, components, and activities of the CKMS into a planned, directed
process producing, maintaining and enhancing its knowledge base. [13]
Corporate Knowledge Management (CKM) is the human activity within the CKMP
aimed at creating and maintaining this integration, and its associated
planned, directed process. The Corporate Distributed Knowledge Management
System (CDKMS) [14] manages the integration of distributed IT components into
a functioning whole supporting the activities of producing, maintaining, and
enhancing its knowledge base. A CDKMS, in this view, is an information systems
application. It is a special kind of information ecosystem, in Inmon's sense.
It is distinct from the CKMS, which transcends information systems, and covers
all types of corporate activity involving the knowledge base. The CDKMS
requires a knowledge base to begin operation. But it enhances its own
knowledge base with the passage of time because it is a self-correcting
system, subject to testing against experience. The CDKMS must not only manage
data, but all of the information, components, objects, object models, process
models, use case models, object interaction models, and dynamic models, used
to process data and to interpret it to produce a business knowledge base. It
is because of its role in managing and processing distributed data, objects,
and models to produce a knowledge base that the term Corporate Distributed
Knowledge Management System is so appropriate.
The Corporate Distributed Knowledge Management Architecture (CDKMA), which
can also be called the Corporate Knowledge Factory (CKF) [15], is the IT
architecture needed to implement the CDKMS. It is different from the CIF in
essential ways I'll cover shortly. How are the CDKMS and the CKF different
from the information ecosystem and the CIF? The difference is that the CDKMS
and the CKF are sub-types of the information ecosystem and the CIF, focused on
the objectives of producing, maintaining, and enhancing knowledge, and further
on the necessity of validating information as the essential aspect of
knowledge production, maintenance and enhancement. I cannot over-emphasize the
importance of this difference between the respective information management
system/architecture and knowledge system management/architecture constructs.
There is nothing in Inmon's conception that orients IT toward Knowledge
Management (KM), only toward information management and data management.
Without an orientation toward KM however, there is no tight coupling between
the IT remedies -- the information ecosystem, and the CIF -- and the goals of
improving business operations, intelligence, and management. An information
ecosystem, however balanced and adaptable, is oriented toward producing,
maintaining, and enhancing information, and information, again, is just data
filtered and formatted in such a way as to give it structure. It has no
necessary relationship to reality or the ability to influence it in accordance
with corporate goals. But knowledge has such a relationship, and insofar as
CKM with the assistance of the CDKMS and the CKF is successful in improving
relevant knowledge production, maintenance, and enhancement, it must also
provide the capability to improve business operations intelligence, and
management.
The CIF and The CKF
As well as being different in their goal orientation the CIF and the CKF are
different in their components. The differences from the CIF are substantial.
They are:
The CKF emphasizes most of the components of the CIF, but not the ODS
which is optional in the CKF In addition, central to the CKF is the Active
Knowledge Manager [17], a server providing: process control services, an
active in-memory object model, and connectivity services that may include an
Object request Broker (ORB). In addition, persistent storage for the AKM is
provided by an OODBMS In addition, the CKF emphasizes a number of application
servers as essential to the architecture. These include: A Data Mining
Server(s) devoted to Knowledge Discovery in Databases (KDD), including
pre-processing data for data mining, data mining, and validation [18] A
Component Transaction Server (CTS) devoted to monitoring and managing
transactions among system components A Business Process Engine(s) [19] devoted
to supporting scalable performance in the CDKMS by breaking the chain of
serial requests for data using in-memory business state management, business
state synchronization, transactional multi-threading, proactive operations
including intelligent software agents, and component management [20]. Most or
all of these additions could be added to the CIF to make it equivalent to the
CKF, and certainly there is nothing in the CIF concept that would make such
additions inconsistent with its conceptual foundation. But this is to beg the
question of comparing the orientations provided by the CIF with that of the
CKF, and determining whether the CIF or the CKF should be preferred as an
architectural concept.
The CIF does not naturally orient the architect to KDD, because it does not
emphasize the centrality of knowledge discovery. It also does not naturally
orient the architect to object-orientation and therefore to the other added
components either, because "information management" has long been more
associated with data processing and data management than it has been with
abstract modeling, general systems approaches to knowledge development, and
advanced data analysis. On the other hand, when we begin to think in terms of
the CKF about KDD-related manipulation and validation; we naturally think
about analytical model repositories, tree structures, graph theory, neural
networks, genetic algorithms, complex adaptive system simulations, and other
matters that are handled well by O-O based components added to the
architecture, and not as well by more traditional database components.
In short, for all of the reasons stated above, if you want to improve
business operations, intelligence and management, the proper orienting
concepts are not the information ecosystem and the CIF, but instead the CDKMS
and the CKF. These are more closely related to the business goals that Inmon
wanted to support. In addition, they are suggestive of a wider and
increasingly necessary set of O-O -based components for IT solutions than are
the information ecosystem and the CIF. So let's replace the CIF with the CKF
and get on with constructing corporate distributed knowledge management
solutions.
Brief No. One
References
[1] See W. H. Inmon, Claudia Imhoff, and Ryan Sousa, Corporate Information
Factory (New York, NY: John Wiley & Sons, 1998), Pp. 2-3
[2] Ibid. P. 8
[3] Ibid. P. 13
[4] Pp. 5-11
[5] Some examples are Sybase's Jaguar CTS, and Microsoft's MTS.
[6] Later on, I call this component the Active Knowledge Manager (AKM) and
define it.
[7] Defined later.
[8] For example, data mining servers, commodity trading servers, etc.
[9] I read Gene Bellinger's views on data, information, knowledge, and wisdom
at
www.radix.net/~crbnblu/musings/kmgmt/kmgmt.htm, before writing my
own differing account of these four concepts. His views are certainly worth
keeping in mind when considering mine
[10] For more detail see my "Basic Concepts of Knowledge Management," White
Paper at www.dkms.com/KMBASIC.html.
[11] I'm referring to the view that validation criteria can be applied in
arriving at ethical and aesthetic knowledge, as in arriving at factual
knowledge. See Nicholas Rescher's Objectivity: The Obligations of Impersonal
Reason (Notre Dame, IN: University of Notre Dame Press, 1997), Chs. 9-11, and
E. W. Hall, Our Knowledge of Fact and Value (Chapel Hill, NC: University of
North Carolina Press, 1961).
[12] For more detail see my "Basic Concepts . . ." op. cit.
[13] For more detail, see Ibid.
[14] I introduced the DKMS concept in two previous White Papers
"Object-Oriented Data Warehouse," and "Distributed Knowledge Management
Systems: The Next Wave in DSS." Both are available at
www.dkms.com/White_Papers.htm.
[15] I've used the more general term Distributed Knowledge Management (DKM)
Architecture in an earlier context. But the CKF is just the DKM in corporate
contexts. See my "Architectural Evolution in Data Warehousing," White Paper
No. Eleven, July 1, 1998, at www.dkms.com White_Papers.htm.
[16] Ibid.
[17] The ideas for the AKM owe much to the following White Papers. Template
Software, "Integration Solutions for the Real-Time Enterprise: EIT -
Enterprise Integration Template," Dulles, VA, White Paper May 8, 1998. See
also www.template.com.
Persistence Software, "The PowerTier Server: A
Technical Overview" at
www.persistence.com/products/tech_overview.html,
and John Rymer, "Business Process Engines, A New Category of Server Software,
Will Burst the Barriers in Distributed Application Performance Engines,"
Emeryville, CA, Upstream Consulting White Paper, April 7, 1998 at
www.persistence.com/products/wp_rymer.html. Two other products that
could be used to develop the AKM component are DAMAN's InfoManager (inquire at
www.damanconsulting.com),
and Ibex's DAWN workflow product along with
its ITASCA active database (at www.ibex.ch)
[18] A recent white paper of mine "Knowledge Management Metrics Development: A
Technical Approach," treats KDD as a use case in the DKMS. See it at
www.dkms.com/White_Papers.htm.
[19] See John Rymer, "Business Process Engines . . ." op. cit.
[20] This is a very close paraphrase of John Rymer's statement in Ibid. P. 2.
It is not quoted because I omitted bullet points and added the clarification
on the use of agents.
Biography
Joseph M. Firestone, Ph.D.
CEO, Chief Scientist
Executive Information Systems Inc (EIS)
703-461-8823, eisai@home.com
Joseph M. Firestone, Ph.D. is CEO and Chief Scientist of Executive Information Systems (EIS)
Inc. Joe has varied experience in consulting, management, information
technology, decision support, and social systems analysis. Currently, he
focuses on product, methodology, architecture, and solutions development in
Enterprise Information and knowledge Portals, where he performs Knowledge and
knowledge management audits, training, and facilitative systems planning,
requirements capture, analysis, and design. Joe was the first to define and
specify the Enterprise Knowledge Portal Concept. He is widely published in the
areas of Decision Support (especially Enterprise Information and Knowledge
Portals, Data Warehouses/Data Marts, and Data Mining), and Knowledge
Management, and has recently completed a full-length industry report entitled
"Approaching Enterprise
Information Portals." Joe is a founding member of the Knowledge Management
Consortium International (KMCI), Editor of the new KMCI Journal, Chairperson
of the KMCI’s Artificial Knowledge Management Systems SIG, a member of its
Executive Committee, its Metaprise Project, and the KMCI Institute Governing
Council. Joe is a frequent speaker at national conferences on KM and Portals.
He is also developer of the Web site www.dkms.com, one of the most widely visited
Web sites in the Portal and KM fields. DKMS.com has now reached a visitation
rate of 83,000 visits annually.
Executive Information Systems Inc
The Executive Information Systems (EIS) Enterprise Knowledge Portal (EKP) is
the only portal solution that provides the assurance that enterprise decision
making will be based on validated knowledge. EIS’s EKP lets enterprises avoid
the risk involved in Enterprise Information Portals which claim to offer
increases in competitive advantage, ROI, speed of innovation, productivity,
effectiveness and profitability, but have as a central vulnerability the fact
that they are only capable of managing data and information, not
knowledge.
Enterprises using EIP-based solutions when they could be using EKP-based ones,
are gambling that unvalidated information can produce promised EIP benefits.
The central value proposition of the EIS EKP is that it replaces gambling on
unvalidated information with knowledge-based decision making. That is why it
is much more likely to achieve the promised benefits of EIP-based solutions
than its EIP competitors.
For more information, see
www.dkms.com
|