Prominent Databases

MS-Access: This is a small scale or desktop database, part of MS-Office.
AS/400: By IBM.
DB2: By IBM.
Oracle: This has been the market leader for some years.
Lotus Notes: By Lotus owned by IBM. This provides groupware services with database facilities.
IMS: By IBM.

Data Organisation

Data is organised hierarchically (do not confuse this with the hierarchical data model, see below).

Fields

Data is organised using the basic unit of the field. A field holds a single data entry, such as a name, a line of an address, a date of birth (DoB), etc. These form the columns of the table (see below), although they are usually not referred to as columns..

Records (Row, Tuple)

A record is a collection of fields. For example, a the details of a customer may form a record, each detail being a field containing information about the customer’s name, address, phone number, etc. Records form rows in a database table. A record is also sometimes called a tuple in a relational database system.

Table

The records for a particular type of data are held together in a table. Often the tables are each held in a separate file, so sometimes the term file is used instead.

Database

The database is a collection of tables (files). In some systems the term database can be used to mean a collection of related files, so it is possible to create more than one database on the same system. e.g. one for sales and marketing data and a separate one for accounts.

top of page

Storing And Retrieving Records

As stated above a record is a collection of fields, forming a row in a table. New rows are added sequentially. As a result this means that the data is not in alphabetic, or any other order.

The Key Field

In order to make finding a record easier, the records are each assigned a unique number, called the key.

Example Record for an Employee

Key	First Name	Family Name	DoB	Start Date
3551	John	Smith	11/05/1960	01/11/1990

The above example employee record shows a single row in a table. The key field is unique for this entry.

Usually the key field is assigned automatically by the database system as an incriminated number when a new record is created. It is normally the case that it is up to the designer of the database to make sure that the key field is included in each table.

Links Between Tables

In order to allow cross-referencing between tables the keys can be used. Often this is achieved through a bridging table or simply by including the cross-reference key as a part of the record entry. For example, if we want to cross-reference the payroll records with the employee record for the employee, John Smith, we can include the key field reference in the payroll record as an additional field. Alternatively we could build a bridging table, which is simply a list of two or more keys, listing matching keys as a single row (record) in the table.

Index

An index is a list if keys that relates the record location to a physical location on the hard drive. Using the Indexed Sequential Access Method (ISAM), the index is used to identify the cylinder and the track where the record is located. This makes disc access many times faster both for reading records and updating records.

top of page

The Design Of Databases

In designing a database consideration needs to be given to the nature of the data, what it will be used for and what database system is being used.

Data Storage Models

There are three storage models for the logical structure of data.

Hierarchical Data Model

The data is organised in a tree-like structure. The principle record is called the root and subsequent layers are referred to as children. Each record is divided into segments. The segments are connected in one-to many relationships. The hierarchical data model is used by earlier databases, such as Information Management System (IMS) by IBM. This type of data model is good for high through put systems, such as those used by banks for transaction processing.

Network Data Model

This is similar to the hierarchical model, but allows for the representation of many-to-many relationships.

Relational Data Model

Data is treated as existing in tables (of two dimensions). The majority of database systems today work on this basis, giving them the name Relational Database Management Systems (RDBMS).

Object Oriented Data Model

There is talk of object databases but the tendency is to simply convert between the objects used by client software and the relational tables of an RDBMS. True object oriented databases are slowly appearing in the market place, e.g. Objectivity.

Database Design

One of the key tools in the design of database systems is the entity relationship model (ERM). The ERM describes the relationships between different aspects of the data, being able to represent one-to-one, one-to-may and many-to-many relationships.

Normalisation

The normalisation of data involves the streamlining of data, removing redundant data as far as is possible to increase the efficiency of data processing by the database.

top of page

Relational Database Management Systems (RDBMS)

RDBMS refers to the software system for managing the data. The relational aspect refers to being able to make relationships between different data records. Sometimes the term Database Management System (DBMS) is used, this is a more general term and omits the relational nature of modern database systems.

RDBMS have three major components:

Data Definition Language: This is the language used to store the data, defining its structure.
Data Manipulation Language: This is the language used to write and read records. The most widely used language is Structured Query Language (SQL,), sometimes stated in spoken form as, “sequel”.
Data Dictionary: This defines the structure of the data and the relationships between different types of data. It is very useful from a design point of view and can also be used in the production of reports.

top of page

Data Mining

Data mining allows for the statistical analysis of large quantities of data from multiple perspectives. Sometimes the term on-line analytical processing (OLAP) is used.

top of page

Data Warehousing

A data warehouse is a database system that stores both current and historical data, allowing for analysis of data for reporting. Due to the historical data held, the quantities of data tens to be very large. When combined with data mining techniques, valuable information can often be uncovered.

Matthew Martin

top of page

�

Managing Data

Contents

Prominent Databases

Data Organisation

Fields

Records (Row, Tuple)

Table

Database

Storing And Retrieving Records

The Key Field

Example Record for an Employee

Links Between Tables

Index

The Design Of Databases

Data Storage Models

Hierarchical Data Model

Network Data Model

Relational Data Model

Object Oriented Data Model

Database Design

Normalisation

Relational Database Management Systems (RDBMS)

Data Mining

Data Warehousing