Oracle Optimizer

RBO vs.
CBO in Oracle Optimizer
Overview
Query
OPTIMIZER Query Optimazation RBO / CBO
Parse
Query rewrite
Result
Query Execution
QEP Generation
Understanding RBO
Released with Oracle 6. Using an ordered list of access methods and join methods on relative cost or each operation. Has a very limited input in determining access paths. Will be removed from the Oracle database Server Normally, it chooses the path from right to left in the from clause. If hint (except RULE hint) is supplied, then it will run under CBO. On some complex queries, it outperforms CBO
RBO ranking
1. Single row by ROWID 2. Single row by cluster join. 3. Single row by hash cluster key with unique key. 4. Single row by unique index. 5. Cluster join. 6. Hash Cluster key. 7. Indexed cluster key. 8. Composite key. 9. Single-column non-unique index. 10. Bounded range search on indexed columns 11. Unbounded range search on indexed columns 12. Sort-merge join 13. Max or Min of indexed columns 14. Order by on indexed columns 15. Full table-scan. (cluster)
(cluster)
(cluster) (cluster) (cluster)
RBO: Determining Access path

First it would evaluate join predicates and filtering predicates. Assign scores to each of the predicates, starting from the last to the first. Pick the lowest score, and evaluate the next. If there were ties, then it would choose arbitrarily based on :

Order based on the FROM clause Age of an index
Not so good things about RBO... RBO has a small number of possible access method. (it
does not recognize IOT, bitmap index, hash join, ) It will process the tables based on how they are ordered on the query. (can be good and most of the time is not so good) Always ranks execution plan based on relative cost in the list, regardless of the data stored in the table. Index scan will always better than table scan, which is not true. Coding for the RBO is halted. All new features require implementation of CBO. RBO uses poor information to break frequently occurring ties.
Understanding CBO
It uses all available information. Dictionary , statistics, histogram, supplied parameter setting. CBO uses Oracle intelligent formula to calculate the COST of a SQL statement. Constantly improving from version to version. It examines all possible access methods (defaulted to 80,000 permutations) Evaluate the costs for each access plan, then simple choose the lowest one
Available information to CBO

Database statistics (partial list ) DBA_TABLES (num_rows, blocks, empty_blocks, avg_space, chain_cnt, avg_row_len, last_analyzed, sample_size, avg_space_freelist_blocks) DBA_TAB_COLUMNS or DBA_TAB_COL_STATISTICS (num_distinct, low_value, high_value, density, num_nulls, num_buckets) DBA_INDEXES (blevel, leaf_blocks, distinct_keys, clustering_factor, num_rows, avg_leaf_block_per_key)
What if there is no statistics ...

If there is no statistics, or bind variables are used, then CBO may use default statistics Default statistics for Oracle 7.3.3 an above

selectivity for relations on indexed columns .009 selectivity for = on indexed columns .004 multiblock read factor 8 remote table average row length 100 # of blocks 100 Scan cost 13 Index levels 1 number leaf blocks/key 1

Initialization parameters that influence CBO cost computation (partial list )
db_file_multiblock_read_count hash_multiblock_io_count hash_area_size sort_area_size bitmatp_merge_area_size

Parameters affecting Cost computation

sort_multiblock_read_count optimizer_index_caching optimizer_index_cost_adj optimizer_percent_parallel optimizer_mode (choose defaulted to all_rows)
First_row applies a heuristic bias to the cost model to promote the use of indexes and nested loop)
CBO: Cost of a SQL statement

Cost is the estimated number of I/O, CPU, Network operations that a statement requires. CPU cost (parse) has little impact on a SQL tuning on most cases. While I/O is the most. Difference between logical and physical I/O It is affected by some parameters:
db_file_multiblock_read_count, sort_area_size, hash_area_size, hash_multiblock_io_count, bitmap_merge_area_size
Cost of Accessing Data

Table scan cost
Number of blocks below HWM Multiblock read factor (default 8) Number of extents is also taken into consederation
Example
Cost: 103 / 8 = 12.875 => 13
Cost of Accessing Data

Index Scan Costs are based on:
Index access
number or levels in the B*-tree number of leaf blocks to examine
Consequent table lookup (optional)
number of blocks accessed
Cost Fast full scan depends on db_file_multiblock_read_count
Cost of Sorting
Data may need to be sorted for:

Order by aggregation Join operation
Sorts are typically CPU intensive, and can be I/O bound if the sort can not fit in memory. Cost depends on sort_area_size and # rows.
Cost of a Join (SM)

Rows from row source 1 are sorted Rows from row source 2 are then sorted by the same sort key Sorted rows from both sides are then merged Cost: sorting, reading tables, I/O for temporary segments
MERGE
Sort
Sort
Row source 1
Row source 2
Cost of a Join (NL)

Outer Loop
Row source 1 is scanned (outer /driving table) Each row returned drives a lookup in row source 2 (inner) Joining rows are then returned Cost: Read driving table and access on inner table. Performance is very dependent on index on inner table
Inner Loop Check for a match
Nested Loop
Access A (Full)
Access B (ROWID) Index Access
Cost of a Join (HJ)

In theory, it is the most efficient joint method. The smaller row source is used to build a hash table and a bitmap The second row source is hashed and checked against the hash table The bitmap is used as a quick lookup to check if rows are in the hash table. It requires single pass for each row source , and more efficient than sorting and merging Row source 1 (build input) Row source 2 (probe)
Hash table and bitmap filter in MEMORY
Output rows
DISK
Join Order Evaluation

The initial permutation is generated by sorting the join order in ascending order of their computed cardinality, as listed in where clause predicates. For each permutation, compute the cost and keep the one with the lowest cost. At any time, keep the current and the best permutation so far. Defaulted to 80,000 permutation Most of times the order will not matter, but sometimes it does matter. (running out of permutation limits ?)
Common CBO problem

The skewness problem Analyzing with wrong data Mixing optimizer in joins Choosing inferior index Joining too many tables Incorrect INIT.ORA settings 30% 25% 20% 20% < 5% < 5%
Using HINT
21
Why use hints ...

Sometimes the optimizer may give an optimal plan. Hint is supplied as a directive and may get ignored. Developers know more about the data. Hints can be used to influence:
The optimization approach (RULE, ALL_ROWS, FIRST_ROWS) The access path for a table accessed (FULL , INDEX,HASH, ...) The join order and method (ordered, leading, star, use_nl, use_merge, use_hash ...)
Hint: Optimization approach

CHOOSE defaulted to ALL_ROWS BEST plan may mean:
Use minimal resource to process all rows affected by the statement ===> ALL_ROWS. (prefer SMJ) Returns the first row of a statement as quickly as possible ===> FIRST_ROWS. (prefer NLJ)
Optimizer has limited information available to determine the BEST plan.
Choosing which optimization

System level, session level , statement level. First_rows returns the first rows as quick as possible. All_rows is for optimal throughput, batch oriented application. Rule, some cases outperform CBO on complex queries First_rows(xxx) or First_rows _nnn , new on Oracle 9i
Hint: Access Paths

FULL ROWID CLUSTER INDEX INDEX_ASC INDEX_DESC INDEX_JOIN INDEX_FFS NO_INDEX AND_EQUAL
Hint: Query Transformation

USE_CONCAT NO_EXPAND REWRITE MERGE NO_MERGE START_TRANSFO RMATION FACT NO_FACT
HINT: Join Orders and Join Operation

ORDERED STAR USE_NL USE_MERGE USE_HASH DRIVING_SITE LEADING HASH_AJ, MERGE_AJ, NL_AJ HASH_SJ, MERGE_SJ, NL_SJ
HINT: Parallel execution and others

PARALLEL NOPARALLEL PARALLEL_INDEX NOPARALLEL_IND EX
APPEND NOAPPEND CACHE NOCACHE PUSH_PRED NO_PUSH_PRED PUSH_SUBQ ORDERED_PREDICAT ES
Hint and View

Not recommended to use hint inside a view, because view can be used in different context. Different behavior on Merge-able view and non merge-able views
Hint and Parallel query

Parallel query will perform direct I/O, bypassing buffer Cache Will be using ROWID hint. ROWID hint will force a checkpoint on the object Be cautious on setting parallel degree, as there are producers and consumers processes for parallel query processes.

Oracle Optimizer

Caricato da

Informazioni sul documento

Descrizione originale:

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

Oracle Optimizer

Caricato da

Copyright:

Formati disponibili

RBO vs.

CBO in Oracle Optimizer

RBO: Determining Access path

Order based on the FROM clause Age of an index

Available information to CBO

What if there is no statistics ...

Available information to CBO

Available information to CBO

sort_multiblock_read_count optimizer_index_caching optimizer_index_cost_adj optimizer_percent_parallel optimizer_mode (choose defaulted to all_rows)

CBO: Cost of a SQL statement

db_file_multiblock_read_count, sort_area_size, hash_area_size, hash_multiblock_io_count, bitmap_merge_area_size

Cost of Accessing Data

Cost: 103 / 8 = 12.875 => 13

Cost of Accessing Data

Consequent table lookup (optional)

number of blocks accessed

Cost Fast full scan depends on db_file_multiblock_read_count

Order by aggregation Join operation

Cost of a Join (SM)

Cost of a Join (NL)

Inner Loop Check for a match

Access B (ROWID) Index Access

Cost of a Join (HJ)

Hash table and bitmap filter in MEMORY

Join Order Evaluation

Common CBO problem

Why use hints ...

Hint: Optimization approach

Optimizer has limited information available to determine the BEST plan.

Choosing which optimization

Hint: Access Paths

Hint: Query Transformation

HINT: Join Orders and Join Operation

HINT: Parallel execution and others

Hint and View

Hint and Parallel query

Potrebbero piacerti anche