Sei sulla pagina 1di 7

Teradata Architecture: Teradata is node Architecture. A Node is collection of hardware and software components.

Teradata system contains one or more nodes. A node is term for a processing unit under the control of a single operating system. There are two types of Teradata Systems. SMP (Symmetric Multiprocessing)--An SMP Teradata system has a single node that contains multiple CPUs sharing a memory pool. MPP (Massively Parallel Processing)-- Multiple SMP nodes working together comprise a larger, MPP implementation of Teradata. The nodes are connected to the BYNET, which allows multiple virtual processors on multiple nodes to communicate with each other.

A node is a basic building block of a Teradata system, and contains a large number of hardware and software components.

What is Shared Nothing Architecture? The Teradata vprocs (which are the PEs and AMPs) share the components of the nodes (memory and cpu). The main component of the "shared-nothing" architecture is that each AMP manages its own dedicated portion of the system's disk space (called the vdisk) and this space is not shared with other AMPs. Each AMP uses system resources independently of the other AMPs so they can all work in parallel for high system performance overall. BYNET :( BanYan NETwork) BYNET is a communication layer between PE and AMP. It acts as message communication layer between PE and AMP. The BYNET hardware and software handle the communication between the vprocs and the nodes.

Hardware: The nodes of an MPP system are connected with the BYNET hardware, consisting of BYNET boards and cables. Software: The BYNET software is installed on every node. This BYNET driver is an interface between the PDE software and the BYNET hardware. SMP systems do not contain BYNET hardware. The PDE and BYNET software emulate BYNET activity in a single-node environment. The SMP implementation is sometimes called "board less BYNET."

The BYNET hardware can carry the following types of messages between nodes:

Broadcast message to all nodes Point-to-point message from one node to another node

Teradata Parallel Database Extension: The Parallel Database Extensions (PDE) software layer was added to the operating system to support the parallel software environment. Parallel Database Extension(PDE) is a software interface layer that lies between the operating system and Teradata database. PDE support the parallelism that gives Teradata database is speed and linear scalability. The operating system can be windows, linux, mp-ras.PDE provides Teradata database with ability to --Run in parallel environment. --Execute Vprocs

Trusted Parallel Application (TPA)


A Trusted Parallel Application (TPA) uses PDE to implement virtual processors (vprocs). The Teradata Database is classified as a TPA. The four components of the Teradata TPA are: PE. AMP. Chanel Driver. Teradata gateway.

Parsing Engine: A Parsing Engine (PE) is a vproc that manages the dialogue between a client application and the Teradata Database, once a valid session has been established. Each PE can support a maximum of 120 sessions. The PE handles an incoming request in the following manner: PE has following components. --Session control --The Session Control component verifies the request for session authorization (user names and passwords), and either allows or disallows the request. --Parser
o o o

Interprets the SQL statement received from the application. Verifies SQL requests for the proper syntax and evaluates them semantically. Consults the Data Dictionary to ensure that all objects exist and that the user has authority to access them.

--Optimizer The Optimizer develops the least expensive plan (in terms of time) to return the requested response set. Which are then passed to the Dispatcher. --Generator It generates the steps provided by the optimizer. --Dispatcher. The Dispatcher controls the sequence in which the steps are executed and passes the steps on to the BYNET for execution by the AMPs. After the AMPs process the steps, the PE receives their responses over the BYNET.

AMP (Access Module Processor): The AMP is a vprocs that controls its portion of the data on the system. The AMPs work in parallel, each AMP managing the data rows stored on its vdisk. AMPs are involved in data distribution and data access in different ways. AMP Responsible for

Lock databases and tables Create, modify, or delete definitions of tables Join tables Insert, delete, or modify rows within tables Sort, aggregate, or format data Retrieve information from definitions and rows from tables.

Channel Driver: Channel Driver software is the means of communication between an application and the PEs assigned to channel-attached clients. There is one Channel Driver per node. Teradata Gateway: Teradata Gateway software is the means of communication between an application and the PEs assigned to network-attached clients. There is one Teradata Gateway per node.

Potrebbero piacerti anche