Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

To identify duplicate data...

  1. RrightRight-click on a model-version in the tree, or in the diagram for the model-version, and choose Model > Identify duplicate data.

...

  1. Image Added  Image Added
  2. Specify the tables to be included or excluded using SQL wild cards and click click Next.

...

  1. Image Added
  2. The tables matching the criteria in the last step are displayed. Move any specific tables tables not to be included to the Exclude tables list and click Next.

...

  1. Image Added
  2. Specify the the columns to be included or excluded as joining columns using SQL wild cards and click Next.

...

  1. Image Added
  2. Specify which which data types columns it must have to be included and click Next.

...

  1. Image Added
  2. Enter the criteria for identifying duplicate data.

...

  1. Image Added
    • Matching criteria sets which attributes should be checked for duplicate data
    • The Sample size section is used to select what data should be used

...

    • Select Distinct value sample size

...

    • to sample a set number of distinct values for each attribute from the source database

...

    • Select Use top 10 most frequent values

...

    • to use the values from the Top 10 most frequent values profiling metric
    • Equality criteria sets how attributes should be checked for duplicate data
    • Click Confirm.
  1. The duplicate data relationships found is displayed.

...

  1. Image Added
    • Check or un-check the boxes in the Create column of the table to choose which relationships should be created.
    • Check Assign foreign key join type to assign a relationship type to the created relationships so they can be easily distinguished in the diagram.
    • Check Profile created duplicate data relations to run profiling on the newly created relationships.
  2. Click Click Finish to create the relationships. Any diagrams open for the model-version is refreshed to show the new relationships.