What is database in an organization?

What is Data?

Before we get into the concept of a database, we should first understand what data is. Put simply, data are pieces of information or facts related to the object being considered. For example, examples of data relating to an individual would be the person’s name, age, height, weight, ethnicity, hair color, and birthdate. Data is not limited to facts themselves, as pictures, images, and files are also considered data.

There are a few key terms that would be useful to help one understand data more, particularly the relation between data and databases.

Fields: Within a database, a field contains the most detailed information about events, people, objects, and transactions.

Record: A record is a collection of related fields.

Table: A table is a collection of related records with a unique table name

Database: A database is a collection of related tables.

What is the Role of Databases in an Enterprise?

Enterprises typically make use of both internal databases and external databases. Internal databases typically include operational databases and data warehouses. The former, operational databases, refer to databases that are actively used in the operations of the business, such as accounting, sales, finance, and HR.

Data warehouses contain data collected from several sources, and the data contained within are generally not used for routine business activities. Instead, data warehouses are usually used for business intelligence purposes. External databases refer to databases external to an organization and are generally accessed over the Internet and are owned by other organizations. An example of an external database is the SEC database.

Components of a Database

The five major components of a database are:

1. Hardware

Hardware refers to the physical, electronic devices such as computers and hard disks that offer the interface between computers and real-world systems.

3. Software

Software is a set of programs used to manage and control the database and includes the database software, operating system, network software used to share the data with other users, and the applications used to access the data.

3. Data

Data are raw facts and information that need to be organized and processed to make it more meaningful. Database dictionaries are used to centralize, document, control, and coordinate the use of data within an organization. A database is a repository of information about a database (also called metadata).

4. Procedures

Procedures refer to the instructions used in a database management system and encompass everything from instructions to setup and install, login and logout, manage the day-to-day operations, take backups of data, and generate reports.

5. Database Access Language

Database Access Language is a language used to write commands to access, update, and delete data stored in a database. Users can write commands using Database Access Language before submitting them to the database for execution. Through utilizing the language, users can create new databases, tables, insert data, and delete data.

What is database in an organization?

What is a Database Management System (DBMS)?

A Database Management System (DBMS) is a well-known term in data analysis. It refers to a collection of programs that enable users to access databases and manipulate, maintain, report, and relate data. A DBMS is often used to reduce data redundancy, share data in a controlled way, and reduce data integrity problems. DBMS is not an information system but is simply software.

The relational model, which saves data in table formats, is the most widely used DBMS. The relational DBMS organizes information into rows, columns, and tables, making it easier to find relevant information. Relational databases are popular because they are easy to extend, and new data categories can be added after the original database is created without large amounts of modification.

The Structured Query Language (SQL) is considered the standard user and application program interface for a relational database, and all relational DBMS software supports SQL. Examples include FileMaker Pro, Microsoft Access, Microsoft SQL Server, MySQL, and Oracle.

More Resources

To keep learning and developing your knowledge of business intelligence and data analysis, we highly recommend the additional resources below:

- [Instructor] The purpose of a database is to organize your data, and to make it available in convenient forms. My SQL is a relational database. A relational database is organized in two dimensional tables comprised of rows and columns. A relational database has tables. It could have one table, or many tables. In this example the database has three tables, customer, item, and sale. A table has rows and columns. In this example, the customer table has three rows and six columns. The columns are named id, name, address, city, state, and zip. A row is like an individual record in a table. In fact, some will use the term record interchangeably with row. In this example, the highlighted row is the record for Mary Smith in the customer table. A column is like a field. And again, some will use the term field interchangeably with column. In this example, the highlighted column is named address and contains an address for each row in the table. Each row in a table has a unique key. The…

Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.

  • Exercise File: Subscribe to access. Ex_Files_MySQL_EssT.zip

Watch courses on your mobile device without an internet connection. Download courses using your iOS or Android LinkedIn Learning app.

What is database in an organization?
  • A data model in which data is organized in a top-down, or inverted tree structure with lower level records subordinate to higher-level records
  • A lower level record is called a child and a higher level record is called a parent
  • Each parent record can have many child records, but each child record can have only one parent record.
  • The top of the tree structure consists of a single node that does not have any parent and is called the root node or root record
  • Relationships formed must be such that only one-to-many or one-to-one relationships exist between a parent and a child.
  • Hierarchical databases are used for very fast, specific-purpose data storage and retrieval. They are used mainly on mainframes.
  • Found in large legacy systems requiring intensive high volume transactions (TPS): banks, insurance companies
  • A hierarchical database stores data according to a set and rigid design. The only way to access data is through the hierarchy – which can be very fast if you know exactly what you want the database to do.
  • Hierarchical databases are constructed for specific purposes where the requirements are not expected to change.

Example: a bank that wants to record information about its customers.

The top-level records may hold information about customers.

The next level down could include records with information about accounts.  For instance, a customer can have a savings account, a current account, and a loan account.

The next level down may include records that stored information about transactions in each account.

Advantages

  • Ease with which data can be stored and retrieved in structured, routine types of transactions.
  • Ease with which data can be extracted for reporting purposes.
  • Routine types of transaction processing are fast and efficient.
  • Accessing or updating data is very fast because the relationships have been predefined

Disadvantages

  • Hierarchical one-to-many relationships must be specified in advance, and are not flexible. Cannot easily handle ad hoc requests for information.
  • Since the structure must be defined in advance, hierarchical databases are quite rigid. There is only one parent per child and no relationships among the child records.
  • Adding new fields to database records requires that the entire database be redefined.
  • A great deal of redundancy because it does not handle many to many relationships

Network Database Model

What is database in an organization?
  • A network database is similar to a hierarchical DBMS, but each child record can have more than one parent record.
  • In-network database terminology, a child record is known as a member.
  • A member can be reached through more than one parent, which is called an owner.
  • Basic Structure:
    • Set: A relationship is called a set. Each set is composed of at least two record types: an owner (parent) record and a member (child) record.
    • A set represents a 1:M relationship between the owner and the member

READ:

Victorian Era Technology and Scientific Discoveries

Advantages

  • More flexible than the hierarchical model because different relationships may be established between different branches of data.
  • Ability to provide sophisticated logical relationships among the records

Disadvantages

  • Network many-to-many relationships must be specified in advance
  • User is limited to retrieving data that can be accessed using the established links between records.
  • Cannot easily handle ad hoc requests for information.
  • It requires that the structure be defined in advance.
  • There are limits to the number of links that can be made among records.

Relational Databases

What is database in an organization?
  • Represents the database as a collection of simple two-dimensional tables called relations.
  • Each individual fact or type of information i.e. entity is stored in its own table.
  • The rows of a relation are referred to as tuples and the column of a relation are referred to as attributes
  • All related tables must have a key field that uniquely identifies each row.
  • Data in different relations is connected through the use of a key field.

Advantages

  • Flexible in that it can handle ad hoc information requests.
  • Easy for programmers to work with.
  • End-users can use this model with little effort or training.
  • Easier to maintain than the hierarchical and network models.
  • Entries can be easily added, deleted, or modified.

Disadvantages

  • A relational DBMS requires much more computer memory and processing time than the earlier models.
  • It allows only text and numerical information to be stored in the database.
  • Cannot process large amounts of business transactions as quickly and efficiently as the hierarchical and network models.
  • Some searches can be time-consuming.

Object-oriented Databases

  • Earlier data models were designed to store text-based data (which includes numbers).
  • Object-oriented databases can store other types of data such as audio, video graphics, pictures that can be combined with text into a multimedia format.
  • For e.g. a hierarchical or network database would contain only numeric and text data about a student – identification number, name, address, etc. By contrast, an object-oriented database might also contain the student’s photograph, a short piece of video, etc.
  • Instead of tables, an object-oriented DBMS stores data in objects.
  • An object can store attributes and methods.
  • Attributes are data in the form of text, sound, video, and pictures.
  • Methods are instructions for actions that can be performed on the object or its attributes.
  • Objects can be placed in a hierarchy so that other objects lower in the hierarchy (subclass objects) can obtain (inherit) attributes from objects higher in the hierarchy (superclass objects). This is known as inheritance.
  • The figure shows four objects in an object-oriented DBMS.
  • The superclass object Employee provides the same set of attributes to each of the subclasses – OfficeWorker, Manager, and ProductionWorker.
  • In other words, every Manager would have a name, address, salary, SS# (as would every OfficeWorker and ProductionWorker).
  • Objects are divided into three parts: the object name, the attributes, and the methods.

READ:

Economics Unit 2 – Business Organization

Advantages

  • Can be used to store complex data types
  • No Primary Keys: The user of an RDBMS has to worry about uniquely identifying tuples by their values and making sure that no two tuples have the same primary key values to avoid error conditions. In an OODBMS, the unique identification of objects is done behind the scenes via OIDs and is completely invisible to the user. Thus there is no limitation on the values that can be stored in an object.
  • A query language is not necessary for accessing data from an OODBMS since interaction with the database is done by transparently accessing objects.
  • Objects in an OODBMS can store an arbitrary number of atomic types as well as other objects.

Disadvantages

  • Lack of OODM standards
  • Steep learning curve
  • High system overhead slows transactions
  • Language Dependence: An OODBMS is typically tied to a specific language via a specific Application Programming Interface (API).  This means that data in an OODBMS is typically only accessible from a specific language using a specific API, which is typically not the case with an RDBMS.
  • Lack of Ad-Hoc Queries: In an RDBMS, the relational nature of the data allows one to construct ad-hoc queries where new tables are created from joining existing tables then querying them. Since it is currently not possible to duplicate the semantics of joining two tables by “joining” two classes then there is a loss of flexibility with an OODBMS. Thus the queries that can be performed on the data in an OODBMS are highly dependent on the design of the system.