APPENDIX C: GLOSSARY OF DATA MANAGEMENT TERMS
Aggregate Area. A delineation of two or more
waste sites, waste
area
groups, operable units, or any other basis for aggregation deemed appropriate
by
the parties to provide for more streamlined investigations, studies, or
remedial
actions.
Analysis. The separation of an intellectual or
substantial whole
into its
constituent parts for individual study. The stated findings of such separation
or determination.
Application. The data manipulation and
processing operations
that are
related to specific requirements of an information system. Software or
program
specific to the solution of a specific business problem or process.
Application System. A collection of
applications that uses the
services
provided by the human-computer interface, communications facility, and data
management system to perform the processing necessary to meet the requirements
of the information system.
Architecture. The manner in which hardware,
software, or data
are
structured. Architecture typically describes how the system or program is
constructed, how its components fit together, and now the protocols and
interfaces are used for communication and cooperation among modules or
components of the system.
Archive. To save data for possible later
use.
Business Rule. A statement reflecting a policy,
operating
standard of
practice or some other form of constraint, which governs the correctness and
usefulness of enterprise information. A business rule statement must always
be
true within the context to which it applies, never sometimes true or partially
true.
Change Control. (1) The process of managing
changes to
individual
components within each version of information that may include source code,
version control, documentation, and system configuration. (2) The process of
controlling, authorizing, and tracking changes to manual or automated
data.
Client/Server. A computing environment in which
applications on
intelligent
workstations work transparently with data and applications on other processors
across a local or wide area network.
Complex (EM). U.S. Department of Energy (DOE)
Environmental
Restoration and
Waste Management (EM) sites and operations.
Computer System. A collection of hardware
managed as a single
unit by
software such as an operating system, which also can provide common services
such as access control, interprocess communications, and a graphical user
interface.
Computer-aided Software Engineering (CASE). The
concept of using
automated
interactive graphical design techniques, design checking rules, encyclopedias
of
design elements, and simulation to drive or facilitate the analysis, design,
documentation, construction, and maintenance, of computer systems. Many CASE
tools incorporate a code generation module, which generates application code
automatically from consistent design specifications.
Configuration Management. (1) The environment
that establishes
and
administers the process for uniform change control ensuring stability,
consistency, reliability, quality, and traceability of any/all forms of change
within any enterprise. (2) The system for applying technical and
administrative
direction and surveillance to the identification and documentation of
functional
and physical characteristics of a product, including control of changes and
reporting of change implementation status.
Custodian. An individual designated to manage
data and/or
databases through
data's life cycle. Custodians generally direct the day-to-day, physical
handling of data within business activities.
Data. (1) A representation of facts, concepts,
or instructions
in a
formalized manner suitable for communication, interpretation, or processing by
humans or by automatic means. (2) Symbols, written or stored representing
certain things, ideas, or values and conveying meaningful information in
particular contexts; the encoded representation of information.
Data Accuracy. (1) The closeness of a measured
value to the
accepted true
value. (2) The degree of agreement of a measurement with an accepted
reference
or true value. Accuracy is a measure of the bias in a system.
Data Administration. (1) The function of
controlling the
acquisition,
analysis, storage, retrieval, and distribution of data. Synonymous with Data
Management. (2) The function of developing and administering the policies,
procedures, practices, and plans for the definition, organization, protection,
and efficient use of data as a corporate resource. Functional deliverables of
data administration include providing information about business information
requirements and developing logical (or subject) data models. Data
administration relies heavily, but not exclusively, on the capabilities of a
data directory system.
Data Analysis. The study of the definition and
characteristics
of data, and
the relationships among data elements.
Data Architecture. (1) The conceptual
arrangement of all data
into logical
groupings representing a company-wide perspective. (2) The design and
structure
of an enterprise's data or data systems.
Data Base. See Database.
Data Custodian. An individual or organization
designated to
manage data
and/or databases through data's life cycle. Custodians perform functions as
designated by the trustee. Restoration and Remediation custodial
responsibilities include entering and validating data, assessing and
maintaining
data quality, developing working level plans and procedures for managing data,
monitoring data usage, and controlling access to data.
Data Dictionary. (1) A description of the
characteristics of
data. The
system tables that contain descriptions of the database objects and how they
are
structured. (2) An inventory that describes, defines, and lists all of the
data
elements stored in a database.
Data Driven. The orientation of an information
engineering
approach to
information resource management. This approach results in optimum
reliability,
shareability and usefulness of information across all enterprise activities.
It
maximizes the reusability of all system components.
Data Element. The logical definition of a unit
of information,
apart from
its actual use (that is, physical rendering) within any given program, file,
database report, screen, and so on; the smallest unit of physical data about
which attributes are defined; the lowest level of addressable data in which
data
values are physically stored.
Data Flow. A representation of the passage of
data or
relationships among
business processes, data stores, data flows, and external entities.
Data Flow Diagram. A graphical representation
of business
processes, data
stores, data flows, and external entities.
Data Integrity. The ability to preserve the
completeness,
currency, and
accuracy of the data without unintentional changes; the ability to produce
results that are correct to a predefined level and to maintain data
availability; conformance of data values to a specified set of rules.
Data Item. An individual piece of data in a
data record; the
smallest named
unit into which data is divided in an organization. Also see Data
Element.
Data Life Cycle. The time span from the genesis
of the data
until it is no
longer maintained. This time span includes the creation/collection of data,
the
use and sharing of data, maintenance of data, and the archival or deletion of
data.
Data Management. See Data Administration.
Data Model. (1) A representation of a
collection of data
entities and the
relationship among the entities. (2) A description of the organization of
data
reflecting the information structure of an enterprise.
Data Name. A name (term) given to units of data
(for example,
item,
element, aggregate, record, set) to uniquely identifying that unit of
data.
Data Owner. The owner of data is generally a
company or
customer. The
owner uses data as a resource to support the missions, objectives, and goals
of
a company. The owner is responsible for strategic planning and structuring
and
chartering organizations to perform the business functions that generate data
as
part of their processes.
Data Protection. The implementation of
appropriate
administrative,
technical or physical means to guard against unauthorized, intentional, or
accidental disclosure, modification, or destruction of data. Also see Data
Security.
Data Quality. (1) The totality of features and
characteristics
of data that
bears on its ability to satisfy a given purpose. The characteristics of major
importance are accuracy, precision, completeness, representativeness, and
comparability. (2) An estimation of acceptability or suitability for a given
purpose of an object, item, or tangible or intangible thing.
Data Quality Objectives. Qualitative and
quantitative statements
that
specify the type, amount, and quality of the data required to support
decisions.
(for example, Hanford Federal Facility Agreement and Consent Order
[Tri-Party Agreement] decisions during remedial response activities.)
Data ResourcesData Resources. A concept that
implies data has
the same
characteristics as other more familiar resources, such as manpower, real
property, and machinery. Data resources have such characteristics as cost and
value, similar to the better understood resources. Data resources provide the
information on the basis of which corporate business activities are planned,
coordinated, executed, and evaluated.
Data Security. (1) The concept that access to
units of data be
limited to
users with proper authority and a "need to know." (2) The
protection
of information and data so that unauthorized persons or systems cannot read or
modify them and authorized persons or systems are not denied access to them.
Data Set. A collection of related data
associated with a
specific project
that may be used in supporting a decision pertaining to the project.
Data Sharing. The ability of multiple users to
access common
data. This
ability helps reduce unwanted duplication, inconsistencies, and storage
requirements, and speeds up processing.
Data Standards. The definition of how facts are
to be referred
to, how they
are to be represented, what they will mean, and the rules governing their
informational use throughout the enterprise.
Data Steward. A data steward is any person who
generates,
maintains, or
uses and supports data. The data steward is responsible for maintaining the
data with proper regard to the rights and requirements of the company and
others.
Data Stewardship Program. A concept that
defines an individual's
and
company's responsibility to manage data resources in compliance with
established
standards, policies, and procedures. The Program provides a consistent
approach
for effective data management.
Data Store. A point of data collection or
storage; data in a
state of rest;
a time-delayed data flow.
Data Subject Area. A broad category of
information or data
centered on a
business subject or resource, such as customer, product, or employee.
Data Traceability. The ability to trace the
source of data or a
measured
value.
Data Trustee. An individual responsible for
administering data
in the best
interests of the owner or highest authority (for example, Westinghouse Hanford
Company, DOE, the U.S. Environmental Protection Agency). A data trustee is
responsible for determining policies, procedures, quality requirements, and
constraints for data in his or her area of business.
Data Validation. (1) The process of determining
that data
collected are
appropriate, correct, and complies with applicable standards, rules, and
conventions. (2) The process whereby data are accepted or rejected based on a
set of criteria. This aspect of quality assurance involves establishing
specified criteria for data validation. (3) A systematic review conducted to
confirm the degree of truth in an analytical measurement. The process
includes
the review of all pertinent analysis and QA/QC (Quality Assurance/Quality
Control) data compared to recognized standards or criteria. The review is
conducted to ensure that data are adequate for their intended use(s).
Data Value Standard. A basis for identifying
the unit of
measure, coded or
abbreviated name, or identifier and other values that must have stable content
and format across application system boundaries. It must have the same
meaning
to all functions or systems that access it.
Data Verification. (1) The process of ensuring
that data
accuracy and
correctness are maintained as data is processed. (2) The process of
determining
if a transfer of information has been accomplished accurately.
Database. (1) A collection of data logically
organized to meet
the
information and time requirements of a universe of users. (2) A collection of
interrelated, largely unique data items or records, in one or more computer
files, that may be processed by many different application programs.
Database Administration. A process that
includes performing
physical
database design and development, monitoring performance, backup, security,
recovery, technology, establishing and enforcing database conventions and
providing operational support to users.
Database Management System (DBMS). A
computerized system
consisting of
numerous components, which have as their collective purpose the
implementation,
processing, management, and protection of databases.
Derived Data. Data values derived from the
values of other data
by a
specified algorithm.
Distributed Information System. An information
system the data
objects
and/or processes of which are distributed across two or more computer systems.
Enterprise. (1) A government agency, a
contractor with a
government agency,
or a company in the public or private sector. (2) A term used in place of
corporation, company, government agency, government program and firm.
Generally, an enterprise applies to any kind of organization with autonomous
control over its own approach to information resource management.
Enterprise Model Set. A hierarchial model that
describes the
business in
terms of processes, data, organizations, locations, events, and goals.
Entity. Something of interest to the business
(for example, a
person,
place, thing, or event) about which knowledge workers provide or require
facts.
Entity Relationship Model. A structured
representation of the
kinds of
information required for the enterprise and/or a business area to operate.
This
model reflects entities and the relationships among them.
File. (1) A collection of related data that is
stored and
retrieved by an
assigned name. Synonymous with data set. (2) A collection of rows (or
records)
that have associated columns (or files). The logical equivalent of a table.
File Server. A network node, usually consisting
of fixed-disk
storage and a
Central Process Unit that stores programs and data shared by users on local
area
networks (LAN). File servers offer operating system-level file and/or print
capabilities, while database servers provide database intelligence such as
transaction processing, indexing, logging, and security.
Foreign key. A column (or combination of
columns) whose values
are required
to match a primary key in a table.
Glossary Standard. A standard that documents
terms agreed upon
by all
Hanford contractors. These terms are document to preserve agreement that may
be
lost if not recorded for easy access.
Hanford Local Area Network (HLAN). The primary
LAN used at the
Hanford Site
(see Local Area Network).
Hardware. All or part of the physical
components of an
information system
or computer environment.
Information. (1) The communication of knowledge
or intelligence.
The
meaning assigned to data by means of known conventions or processes. (2)
Knowledge that is derived, calculated, extracted, summarized or assimilated
from
data.
Information System. The collection (set) of
information needed
to support
the products, services, procedures used by knowledge workers in a business
function or process; a system that organizes the storage and manipulation of
information. For an information system or process to exist people and
procedures are necessary; a computer or other information technology is often
a
part of the total system or process, but is not necessary for it to
exist.
Interface. (1) The identifiable point of
connection or
coordination between
two or more defined entities, which must be properly coordinated for
successful
operations. Entities may be either physical items or organizations. (2)
Assurance that current and correct information is transmitted from one
organization or system to another.
Local Area Network (LAN). (1) Two or more
computing units
connected for
local resource sharing. (2) A network in which communications are limited to
a
moderate-size area such as a single office building, warehouse, or campus, and
that does not extend across public rights-of-way. A local area network
enables
personal computers to have access to common data and peripherals. Local area
networks typically consist of personal computers with adapter cards, file
servers, printers, gateways to departmental or corporate computers, and a
network operating system. Database servers are a new technology that also
will
become key local area network components.
Logical Data Model. A representation of data
entities and
relationships,
including attributes and primary keys. A logical data model can be used to
represent data use throughout an organization or can conceptually represent a
single database structure.
Metadata. Data that define and describe other
data, that is, the
description of the data resource, its characteristics, location, use and so
on.
Metadata are used to identify, describe, and define user data.
Network. A computer communications system
linking a series of
computer
elements. A system of interconnected computing devices that can communicate
and
share resources. Networks may be private (for one user) or shared (for many
users).
Normalization. A process of altering the format
of data to
improve the
performance of the data in user operations, especially updating.
Operable Unit. A group of waste sites based on
similar waste
characteristics, waste volumes, physical proximity, site characterization and
remediation strategies, type and numbers of facilities, and presence of
priority
waste units.
Pedigree. Documented traceability to the origin
of the
data.
Physical Data Model. A representation of how
the logical data
entities will
be implemented on the specific hardware and software used to support the
information system.
Primary Key. A single attribute that uniquely
identifies a
particular
entity instance and chosen to be the standard way to access the entity.
Process. (1) A predetermined course of events
defined by its
purpose or by
its effect, achieved under given conditions. (2) An active component of an
information system.
Protocol Standard. A standard that identifies a
method,
regulation,
agreement, or external standard adopted for compliance.
Record. (1) A group of related data elements
treated as a unit.
(2) A
named and usually ordered collection of zero or more data items and data
aggregates that represent the occurrence of a set of data values that describe
the attributes of a particular entity.
Relational database. (1) A database organized
and accessed
according to
relationships between data items. (2) A data structure perceived by its users
as a collection of tables. A relational database consists of tables, rows,
and
columns. Most minicomputers and mainframes today have relational database
systems available for business use. Typical examples are DB2 a trademark of
International Business Machines, Inc. (IBM), and RDB a trademark of Digital
Equipment Corporation. Relational databases differ from nonrelational
databases
in that there are no system dependencies stored within the data (for example,
hierarchical databases are not relational because they contain pointers to
other
data). Structured Query Language (SQL) Server is a relational Database
Management System.
Sampling and Analysis Plan (SAP). Document
describing field
activities to
be conducted during Remedial Investigation (RI) or Remedial Field
Investigation
(RFI). The SAP is composed of the field sampling plan and the QA
project.
Software. (1) A compilable piece of code. (2)
All or part of
the programs,
procedures, rules, and associated documentation of an information system.
Strategic Data Planning. A method of defining
data requirements
from
strategic business plans, developing global logical data models, establishing
data policies, assessing opportunities, and relating information planning
strategies to business requirements.
System. (1) An organized combination of the
services (machines,
materials,
and people) to accomplish a function. (2) A collection of the electronic data
processing jobs and application programs that interact to perform a business
function.
Validation (of data), Validated Data. See Data
Validation.
Verification. To determine or test the truth or
accuracy by
comparison or
reference.
View. An alternative way of looking at the data
in one or more
tables. A
view is usually created as a subset of columns from one or more tables.
Well-Defined Points. Features sharply
identified as discrete
points.
Points not well defined (poorly defined) may have features that do not have a
well-defined center, such as roads that intersect at shallow angles, or soil
boundaries. The selection of well-defined points is made through agreement by
the contracting parties.
Workstation. A computer from which a person
uses word
processing,
spreadsheet, database, and other types of applications to accomplish work,
taking advantage of resources shared on the local area network.