APPENDIX C: GLOSSARY OF DATA MANAGEMENT TERMS


Aggregate Area.
A delineation of two or more waste sites, waste area groups, operable units, or any other basis for aggregation deemed appropriate by the parties to provide for more streamlined investigations, studies, or remedial actions.
Analysis.
The separation of an intellectual or substantial whole into its constituent parts for individual study. The stated findings of such separation or determination.
Application.
The data manipulation and processing operations that are related to specific requirements of an information system. Software or program specific to the solution of a specific business problem or process.
Application System.
A collection of applications that uses the services provided by the human-computer interface, communications facility, and data management system to perform the processing necessary to meet the requirements of the information system.
Architecture.
The manner in which hardware, software, or data are structured. Architecture typically describes how the system or program is constructed, how its components fit together, and now the protocols and interfaces are used for communication and cooperation among modules or components of the system.
Archive.
To save data for possible later use.
Business Rule.
A statement reflecting a policy, operating standard of practice or some other form of constraint, which governs the correctness and usefulness of enterprise information. A business rule statement must always be true within the context to which it applies, never sometimes true or partially true.
Change Control.
(1) The process of managing changes to individual components within each version of information that may include source code, version control, documentation, and system configuration. (2) The process of controlling, authorizing, and tracking changes to manual or automated data.
Client/Server.
A computing environment in which applications on intelligent workstations work transparently with data and applications on other processors across a local or wide area network.
Complex (EM).
U.S. Department of Energy (DOE) Environmental Restoration and Waste Management (EM) sites and operations.
Computer System.
A collection of hardware managed as a single unit by software such as an operating system, which also can provide common services such as access control, interprocess communications, and a graphical user interface.
Computer-aided Software Engineering (CASE).
The concept of using automated interactive graphical design techniques, design checking rules, encyclopedias of design elements, and simulation to drive or facilitate the analysis, design, documentation, construction, and maintenance, of computer systems. Many CASE tools incorporate a code generation module, which generates application code automatically from consistent design specifications.
Configuration Management.
(1) The environment that establishes and administers the process for uniform change control ensuring stability, consistency, reliability, quality, and traceability of any/all forms of change within any enterprise. (2) The system for applying technical and administrative direction and surveillance to the identification and documentation of functional and physical characteristics of a product, including control of changes and reporting of change implementation status.
Custodian.
An individual designated to manage data and/or databases through data's life cycle. Custodians generally direct the day-to-day, physical handling of data within business activities.
Data.
(1) A representation of facts, concepts, or instructions in a formalized manner suitable for communication, interpretation, or processing by humans or by automatic means. (2) Symbols, written or stored representing certain things, ideas, or values and conveying meaningful information in particular contexts; the encoded representation of information.
Data Accuracy.
(1) The closeness of a measured value to the accepted true value. (2) The degree of agreement of a measurement with an accepted reference or true value. Accuracy is a measure of the bias in a system.
Data Administration.
(1) The function of controlling the acquisition, analysis, storage, retrieval, and distribution of data. Synonymous with Data Management. (2) The function of developing and administering the policies, procedures, practices, and plans for the definition, organization, protection, and efficient use of data as a corporate resource. Functional deliverables of data administration include providing information about business information requirements and developing logical (or subject) data models. Data administration relies heavily, but not exclusively, on the capabilities of a data directory system.
Data Analysis.
The study of the definition and characteristics of data, and the relationships among data elements.
Data Architecture.
(1) The conceptual arrangement of all data into logical groupings representing a company-wide perspective. (2) The design and structure of an enterprise's data or data systems.
Data Base.
See Database.
Data Custodian.
An individual or organization designated to manage data and/or databases through data's life cycle. Custodians perform functions as designated by the trustee. Restoration and Remediation custodial responsibilities include entering and validating data, assessing and maintaining data quality, developing working level plans and procedures for managing data, monitoring data usage, and controlling access to data.
Data Dictionary.
(1) A description of the characteristics of data. The system tables that contain descriptions of the database objects and how they are structured. (2) An inventory that describes, defines, and lists all of the data elements stored in a database.
Data Driven.
The orientation of an information engineering approach to information resource management. This approach results in optimum reliability, shareability and usefulness of information across all enterprise activities. It maximizes the reusability of all system components.
Data Element.
The logical definition of a unit of information, apart from its actual use (that is, physical rendering) within any given program, file, database report, screen, and so on; the smallest unit of physical data about which attributes are defined; the lowest level of addressable data in which data values are physically stored.
Data Flow.
A representation of the passage of data or relationships among business processes, data stores, data flows, and external entities.
Data Flow Diagram.
A graphical representation of business processes, data stores, data flows, and external entities.
Data Integrity.
The ability to preserve the completeness, currency, and accuracy of the data without unintentional changes; the ability to produce results that are correct to a predefined level and to maintain data availability; conformance of data values to a specified set of rules.
Data Item.
An individual piece of data in a data record; the smallest named unit into which data is divided in an organization. Also see Data Element.
Data Life Cycle.
The time span from the genesis of the data until it is no longer maintained. This time span includes the creation/collection of data, the use and sharing of data, maintenance of data, and the archival or deletion of data.
Data Management.
See Data Administration.
Data Model.
(1) A representation of a collection of data entities and the relationship among the entities. (2) A description of the organization of data reflecting the information structure of an enterprise.
Data Name.
A name (term) given to units of data (for example, item, element, aggregate, record, set) to uniquely identifying that unit of data.
Data Owner.
The owner of data is generally a company or customer. The owner uses data as a resource to support the missions, objectives, and goals of a company. The owner is responsible for strategic planning and structuring and chartering organizations to perform the business functions that generate data as part of their processes.
Data Protection.
The implementation of appropriate administrative, technical or physical means to guard against unauthorized, intentional, or accidental disclosure, modification, or destruction of data. Also see Data Security.
Data Quality.
(1) The totality of features and characteristics of data that bears on its ability to satisfy a given purpose. The characteristics of major importance are accuracy, precision, completeness, representativeness, and comparability. (2) An estimation of acceptability or suitability for a given purpose of an object, item, or tangible or intangible thing.
Data Quality Objectives.
Qualitative and quantitative statements that specify the type, amount, and quality of the data required to support decisions. (for example, Hanford Federal Facility Agreement and Consent Order [Tri-Party Agreement] decisions during remedial response activities.)
Data ResourcesData Resources.
A concept that implies data has the same characteristics as other more familiar resources, such as manpower, real property, and machinery. Data resources have such characteristics as cost and value, similar to the better understood resources. Data resources provide the information on the basis of which corporate business activities are planned, coordinated, executed, and evaluated.
Data Security.
(1) The concept that access to units of data be limited to users with proper authority and a "need to know." (2) The protection of information and data so that unauthorized persons or systems cannot read or modify them and authorized persons or systems are not denied access to them.
Data Set.
A collection of related data associated with a specific project that may be used in supporting a decision pertaining to the project.
Data Sharing.
The ability of multiple users to access common data. This ability helps reduce unwanted duplication, inconsistencies, and storage requirements, and speeds up processing.
Data Standards.
The definition of how facts are to be referred to, how they are to be represented, what they will mean, and the rules governing their informational use throughout the enterprise.
Data Steward.
A data steward is any person who generates, maintains, or uses and supports data. The data steward is responsible for maintaining the data with proper regard to the rights and requirements of the company and others.
Data Stewardship Program.
A concept that defines an individual's and company's responsibility to manage data resources in compliance with established standards, policies, and procedures. The Program provides a consistent approach for effective data management.
Data Store.
A point of data collection or storage; data in a state of rest; a time-delayed data flow.
Data Subject Area.
A broad category of information or data centered on a business subject or resource, such as customer, product, or employee.
Data Traceability.
The ability to trace the source of data or a measured value.
Data Trustee.
An individual responsible for administering data in the best interests of the owner or highest authority (for example, Westinghouse Hanford Company, DOE, the U.S. Environmental Protection Agency). A data trustee is responsible for determining policies, procedures, quality requirements, and constraints for data in his or her area of business.
Data Validation.
(1) The process of determining that data collected are appropriate, correct, and complies with applicable standards, rules, and conventions. (2) The process whereby data are accepted or rejected based on a set of criteria. This aspect of quality assurance involves establishing specified criteria for data validation. (3) A systematic review conducted to confirm the degree of truth in an analytical measurement. The process includes the review of all pertinent analysis and QA/QC (Quality Assurance/Quality Control) data compared to recognized standards or criteria. The review is conducted to ensure that data are adequate for their intended use(s).
Data Value Standard.
A basis for identifying the unit of measure, coded or abbreviated name, or identifier and other values that must have stable content and format across application system boundaries. It must have the same meaning to all functions or systems that access it.
Data Verification.
(1) The process of ensuring that data accuracy and correctness are maintained as data is processed. (2) The process of determining if a transfer of information has been accomplished accurately.
Database.
(1) A collection of data logically organized to meet the information and time requirements of a universe of users. (2) A collection of interrelated, largely unique data items or records, in one or more computer files, that may be processed by many different application programs.
Database Administration.
A process that includes performing physical database design and development, monitoring performance, backup, security, recovery, technology, establishing and enforcing database conventions and providing operational support to users.
Database Management System (DBMS).
A computerized system consisting of numerous components, which have as their collective purpose the implementation, processing, management, and protection of databases.
Derived Data.
Data values derived from the values of other data by a specified algorithm.
Distributed Information System.
An information system the data objects and/or processes of which are distributed across two or more computer systems.
Enterprise.
(1) A government agency, a contractor with a government agency, or a company in the public or private sector. (2) A term used in place of corporation, company, government agency, government program and firm. Generally, an enterprise applies to any kind of organization with autonomous control over its own approach to information resource management.
Enterprise Model Set.
A hierarchial model that describes the business in terms of processes, data, organizations, locations, events, and goals.
Entity.
Something of interest to the business (for example, a person, place, thing, or event) about which knowledge workers provide or require facts.
Entity Relationship Model.
A structured representation of the kinds of information required for the enterprise and/or a business area to operate. This model reflects entities and the relationships among them.
File.
(1) A collection of related data that is stored and retrieved by an assigned name. Synonymous with data set. (2) A collection of rows (or records) that have associated columns (or files). The logical equivalent of a table.
File Server.
A network node, usually consisting of fixed-disk storage and a Central Process Unit that stores programs and data shared by users on local area networks (LAN). File servers offer operating system-level file and/or print capabilities, while database servers provide database intelligence such as transaction processing, indexing, logging, and security.
Foreign key.
A column (or combination of columns) whose values are required to match a primary key in a table.
Glossary Standard.
A standard that documents terms agreed upon by all Hanford contractors. These terms are document to preserve agreement that may be lost if not recorded for easy access.
Hanford Local Area Network (HLAN).
The primary LAN used at the Hanford Site (see Local Area Network).
Hardware.
All or part of the physical components of an information system or computer environment.
Information.
(1) The communication of knowledge or intelligence. The meaning assigned to data by means of known conventions or processes. (2) Knowledge that is derived, calculated, extracted, summarized or assimilated from data.
Information System.
The collection (set) of information needed to support the products, services, procedures used by knowledge workers in a business function or process; a system that organizes the storage and manipulation of information. For an information system or process to exist people and procedures are necessary; a computer or other information technology is often a part of the total system or process, but is not necessary for it to exist.
Interface.
(1) The identifiable point of connection or coordination between two or more defined entities, which must be properly coordinated for successful operations. Entities may be either physical items or organizations. (2) Assurance that current and correct information is transmitted from one organization or system to another.
Local Area Network (LAN).
(1) Two or more computing units connected for local resource sharing. (2) A network in which communications are limited to a moderate-size area such as a single office building, warehouse, or campus, and that does not extend across public rights-of-way. A local area network enables personal computers to have access to common data and peripherals. Local area networks typically consist of personal computers with adapter cards, file servers, printers, gateways to departmental or corporate computers, and a network operating system. Database servers are a new technology that also will become key local area network components.
Logical Data Model.
A representation of data entities and relationships, including attributes and primary keys. A logical data model can be used to represent data use throughout an organization or can conceptually represent a single database structure.
Metadata.
Data that define and describe other data, that is, the description of the data resource, its characteristics, location, use and so on. Metadata are used to identify, describe, and define user data.
Network.
A computer communications system linking a series of computer elements. A system of interconnected computing devices that can communicate and share resources. Networks may be private (for one user) or shared (for many users).
Normalization.
A process of altering the format of data to improve the performance of the data in user operations, especially updating.
Operable Unit.
A group of waste sites based on similar waste characteristics, waste volumes, physical proximity, site characterization and remediation strategies, type and numbers of facilities, and presence of priority waste units.
Pedigree.
Documented traceability to the origin of the data.
Physical Data Model.
A representation of how the logical data entities will be implemented on the specific hardware and software used to support the information system.
Primary Key.
A single attribute that uniquely identifies a particular entity instance and chosen to be the standard way to access the entity.
Process.
(1) A predetermined course of events defined by its purpose or by its effect, achieved under given conditions. (2) An active component of an information system.
Protocol Standard.
A standard that identifies a method, regulation, agreement, or external standard adopted for compliance.
Record.
(1) A group of related data elements treated as a unit. (2) A named and usually ordered collection of zero or more data items and data aggregates that represent the occurrence of a set of data values that describe the attributes of a particular entity.
Relational database.
(1) A database organized and accessed according to relationships between data items. (2) A data structure perceived by its users as a collection of tables. A relational database consists of tables, rows, and columns. Most minicomputers and mainframes today have relational database systems available for business use. Typical examples are DB2 a trademark of International Business Machines, Inc. (IBM), and RDB a trademark of Digital Equipment Corporation. Relational databases differ from nonrelational databases in that there are no system dependencies stored within the data (for example, hierarchical databases are not relational because they contain pointers to other data). Structured Query Language (SQL) Server is a relational Database Management System.
Sampling and Analysis Plan (SAP).
Document describing field activities to be conducted during Remedial Investigation (RI) or Remedial Field Investigation (RFI). The SAP is composed of the field sampling plan and the QA project.
Software.
(1) A compilable piece of code. (2) All or part of the programs, procedures, rules, and associated documentation of an information system.
Strategic Data Planning.
A method of defining data requirements from strategic business plans, developing global logical data models, establishing data policies, assessing opportunities, and relating information planning strategies to business requirements.
System.
(1) An organized combination of the services (machines, materials, and people) to accomplish a function. (2) A collection of the electronic data processing jobs and application programs that interact to perform a business function.
Validation (of data), Validated Data.
See Data Validation.
Verification.
To determine or test the truth or accuracy by comparison or reference.
View.
An alternative way of looking at the data in one or more tables. A view is usually created as a subset of columns from one or more tables.
Well-Defined Points.
Features sharply identified as discrete points. Points not well defined (poorly defined) may have features that do not have a well-defined center, such as roads that intersect at shallow angles, or soil boundaries. The selection of well-defined points is made through agreement by the contracting parties.
Workstation.
A computer from which a person uses word processing, spreadsheet, database, and other types of applications to accomplish work, taking advantage of resources shared on the local area network.