Page History

Versions Compared

changes.mady.by.user eric revollo

Saved on Oct 18, 2023

compared with

changes.mady.by.user eric revollo

Saved on Jun 06, 2024

To identify duplicate data...

RrightRight-click on a model-version in the tree, or in the diagram for the model-version, and choose Model > Identify duplicate data.

...

Image Added Image Added
Specify the tables to be included or excluded using SQL wild cards and click click Next.

...

Image Added
The tables matching the criteria in the last step are displayed. Move any specific tables tables not to be included to the Exclude tables list and click Next.

...

Image Added
Specify the the columns to be included or excluded as joining columns using SQL wild cards and click Next.

...

Image Added
Specify which which data types columns it must have to be included and click Next.

...

...

Image Added
- Matching criteria sets which attributes should be checked for duplicate data
- The Sample size section is used to select what data should be used

...

...

- to sample a set number of distinct values for each attribute from the source database

...

...

- to use the values from the Top 10 most frequent values profiling metric
- Equality criteria sets how attributes should be checked for duplicate data
- Click Confirm.
The duplicate data relationships found is displayed.

...

Image Added
- Check or un-check the boxes in the Create column of the table to choose which relationships should be created.
- Check Assign foreign key join type to assign a relationship type to the created relationships so they can be easily distinguished in the diagram.
- Check Profile created duplicate data relations to run profiling on the newly created relationships.
Click Click Finish to create the relationships. Any diagrams open for the model-version is refreshed to show the new relationships.