Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Partitioning option will let you split the large data set into smaller subsets
which can be processed in parallel to get a better session performance.
Partitioning Terminology
Partition : A partition is a subset of the data that executes in a single thread.
Number of partitions : We can divide the data set into smaller subset by
increasing the number of partitions. When we add partitions, we increase the
number of processing threads, which can improve session performance.
Partition Point : This is the boundary between two stages and divide the
pipeline into stages. Partition point is always associated with a
transformation.
Partition Type : It is an algorithm for distributing data among partitions,
which is always associated with a partition point. The partition type controls
how the Integration Service distributes data among partitions at partition
points.
We can invoke the user interface for session partition session using the menu
Mapping -> Partitions.
Choose any transformation from the mapping and the "Delete Partition Point"
or"Edit Partition Point"button will let you modify partition points.
Pass-through Partition
A pass-through partition at the source qualifier transformation is used to split
the source data into three different parallel processing data sets. Below image
shows how to setup pass through partition for three different sales regions
Once the partition is setup at the source qualifier, you get additional Source
Filter option to restrict the data which corresponds to each partition. Be sure
to provide the filter condition such that same data is not processed through
more than one partition and data is not duplicated.
Now give the value start and end range for each partition as shown below.