Professional Documents
Culture Documents
Architecture
Interconnect
BYNET
Disks
No one else
Vdisk Vdisk Vdisk Vdisk Vdisk Vdisk Vdisk Vdisk shares
More nodes
More work
More users
More data
BYNET
Node
Node1 Node2 Node3 Node4 Work
Users
Data
13 2012 WIPRO LTD | WWW.WIPRO.COM
Linear Scalability (Contd.).
Linear Scalability
Teradata
Total
Work
Accom
plished
Non-Linear Scalability
Traditional Transaction
Processing Systems
Host Adapter
Ethernet Adapter
P
Gateway S/W
Parser Engine Parser Engine
BYNET D
Parallel Processing
Linear Scalability - manageable growth via modularity
Experienced Optimizer - CBO
Load Utilities TPT, FastLoad, MultiLoad, TPump, FastExport
Active Data Warehousing
Easy Database Administration
Designed to run the worlds largest enterprise data warehouse
databases
Executes on UNIX, Linux and Windows operating systems
Runs on single or multiple nodes
Provides Network and Mainframe connectivity
Supports Industry standard access language (SQL)
Fault tolerant at all levels of hardware and software
Teradata has data warehousing features embedded into the core of the
database.
Year Events
1979 Teradata Corp founded in L.A. Cal
Development begins on a massively parallel computer
1982 YNET technology is patented
1984 Teradata markets the first database computer
DBC/1012 First system purchased by Wells Fargo Bank of Cal.
1987 First public offering of stock
1989 Teradata and NCR partner on next generation of DBC
1991 NCR Corporation is acquired by AT&T
1992 Teradata is merged into NCR
1996 AT&T spins off NCR Corp with Teradata product
1997 Teradata database becomes industry leader in data warehousing
http://teradata.com/enterprise-data-warehousing/
7225
Hashing Algorithm
0 1 0 A OO 01 09 0A 0F
(Hexadecimal)
00
01 05
02
AMP#
32 2012 WIPRO LTD | WWW.WIPRO.COM
Hashing PI Value
25 26
Defined by
user. Used for
Parsing Engine
data
distribution
Hashing Algorithm and access.
Node-2
AMP AMP AMP AMP
Client
Parsing Engine
Dispatcher
RET Step
29 25 10 50
75
34 2012 WIPRO LTD | WWW.WIPRO.COM
Row Distribution
Explanation
1) First, we lock a distinct RETAIL."pseudo table" for read on a RowHash to
prevent global deadlock for RETAIL.CONTRACT.
2) Next, we lock RETAIL.CONTRACT for read.
3) We do an all-AMPs RETRIEVE step from RETAIL.CONTRACT
by way of an all-rows scan with no residual conditions into Spool 1
(group_amps), which is built locally on the AMPs. The size of Spool 1 is
estimated with high confidence to be 15,000 rows (1,320,000
bytes). The estimated time for this step is 0.21seconds.
4) Finally, we send out an END TRANSACTION step to all AMPs involved
in processing the request.
-> The contents of Spool 1 are sent back to the user as the result of
statement 1. The total estimated time is 0.21 seconds.
Explanation
1) First, we do a single-AMP RETRIEVE step from
RETAIL.CLIENT by way of the unique primary
index "RETAIL.CLIENT.C_CUSTKEY = 993" with no
residual conditions. The estimated time for this step is 0.01
seconds.
-> The row is sent directly back to the user as the result of
statement 1. The total estimated time is 0.01 seconds.
EXPLAIN SELECT
l_orderkey,l_partkey,l_linenumber,l_linestatus
FROM item
WHERE l_orderkey = 54528
Explanation
1) First, we do a single-AMP RETRIEVE step from
RETAIL.item by way of the primary index
"RETAIL.item.L_ORDERKEY = 54528" with no residual conditions
into Spool 1 (one-amp), which is built locally
on that AMP. The size of Spool 1 is estimated with high
confidence to be 4 rows (120 bytes). The estimated time for this
step is 0.02 seconds.
-> The contents of Spool 1 are sent back to the user as the result of
statement 1. The total estimated time is 0.02 seconds.
PE
DSW part of the row hash fed to Hash Map which identifies the
target AMP number.
Master Index
Table Id
+
Row Hash
Query Parallelism
Within-a-Step Parallelism
Multi-Step Parallelism
5.Create your own tables one with UPI and other with NUPI
You are a DBA for a teradata system that has 200 GB of Perm
Space. You create a User MKRT with 60 GB Perm Space.
User MKRT creates a user SALES with 40 GB of Perm
Space. How much Perm Space User MKRT is left with ?
A. 20 GB
B. 60 GB
C. 100 GB
D. 160 GB
A. 140 GB
B. 40 GB
C. 100 GB
D. 200 GB
User B is dropped. How much Perm and Spool User A has now
?
Teradata Database
Teradata User
Difference between Database and User
Perm Space and it use
Spool and Temp Space and their uses
ANSI Mode :
Transactions are always implicit in ANSI session mode.
A transaction initiates when :
The first SQL statement in a session executes
The first statement following the close of a transaction
COMMIT or ROLLBACK/ABORT statements close a transaction.
If a transaction includes a DDL statement, it must be the last statement
in the transaction.
DARJ keeps track of all changes that would have been written
to the failed AMP.
When the AMP comes back online, the DARJ will catch-up the
AMP by applying the missed transactions.
BYNET
CLIQUE-1 CLIQUE-2
Locking prevents multiple users who are trying to change the same
data at the same time from violating the data's integrity. This
concurrency control is implemented by locking the desired data.
You are a DBA for a Teradata system and you need to protect
your system against the failure of an entire node. Which
protection feature would you choose ?
A. Clique
B. Fallback Cluster
C. RAID
D. Database Locks
Index Wizard
Statistics Wizard
Visual Explain
All these would be covered in Teradata SQL Tuning Module
Temporary Tables
Derived Tables Requires Spool Space
Volatile Tables Requires Spool Space
Global Temporary Tables Requires Temp Space
Secondary Indexes
USI
NUSI
Value Order NUSI
Join Index
Hash Index
10
2012 WIPRO LTD | WWW.WIPRO.COM
0
Data Warehouse
A data warehouse is a central, enterprise wide database that
contains information obtained from operational systems,
designed around DSS.
10
2012 WIPRO LTD | WWW.WIPRO.COM
1
Active Data Warehouse
A. OLTP
B. DSS
C. OLDB
D. RDDB
10
2012 WIPRO LTD | WWW.WIPRO.COM
4
Summary
10
2012 WIPRO LTD | WWW.WIPRO.COM
5
References
10
2012 WIPRO LTD | WWW.WIPRO.COM
6
Thank You
10
2012 WIPRO LTD | WWW.WIPRO.COM
7