Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Property Section Azure Data Lake Gen2 Storage Authentication
    • Azure Data Lake Gen2 Storage Account: Azure Data Lake Gen2 Storage Account Name, .
      The token used to read the storage account name in the scripts is $WSL_SRCCFG_azureStorageAccountName$
    • Azure Data Lake Gen2 Storage Account Access Key(Account Key): Azure Data Lake Gen2 Storage Account Access Key also called Account Key, .
      The token used to read the access key is the environment variable: WSL_SRCCFG_azureStorageAccountAccessKey
    • Azure Data Lake Gen2 Storage Account SAS Token: Azure Data Lake Gen2 Storage Account Shared Access Signature (SAS) Token, .
      The token used to read environment variable: WSL_SRCCFG_azureSASToken
  • Property Section Azure Data Lake Gen2 Storage Settings
    • Azure Data Lake Gen2 Storage File System: Azure Data Lake Gen2 Storage File System name, .
      The token used to read the storage file system name in the scripts is $WSL_SRCCFG_azureStorageFileSystem$
    • Azure Data Lake Gen2 Storage File System Directory:  Azure Azure Data Lake Gen2 Storage Directory name where blob exists, .
      The token used to read the directory name in the browse script is $WSL_SRCCFG_azureStorageFileSystemDirectory$
    • File Download Path: Local directory where the file needs to be downloaded for data profiling from the sourceAzure Data Lake Gen2 Storage. For Example Eg: C:\\Source\\Subfolder
      or C:/Source/Subfolder/ The token used to read path name in the browse script is $WSL_SRCCFG_fileDownloadPath$      
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             
  • Property Section Azure Data Lake Gen2 Storage File Filter Options
    • Field Headings/Labels: Indicates whether the first line of the source file contains a heading/label for each field, which is not regarded as data so it should not be loaded. The token used to read field header boolean value in the script is $WSL_SRCCFG_azureDataLakeGen2FirstLineHeader$
    • File Filter Name: Indicates source file name. Provide Azure Blob filename pattern. The file list filters with file extensions, and file name patterns.
      • *.*
      • *.<File Extension>
      • <File Name>.<File Extension>
      • <File Name Start>*

        The Token used to read File Filter Name in the scripts is $WSL_SRCCFG_azureDataLakeGen2FileFilterName$

...

    • Field Delimiter: This is a character that separates the fields within each record of the source file. The field delimiter identifies the end of each field. For Example, comma ( , ),pipe( | ). The token used to reader field delimiter in the script is $WSL_SRCCFG_azureDataLakeGen2FieldDelimiter$
    • Field Enclosure Delimiter: This is a character that delimits BOTH start and end of field value i.e. encapsulates value. A double quote is a common enclosure delimiter. The token used to reader read the field enclosure delimiter in the script is is $WSL_SRCCFG_azureDataLakeGen2FieldEnclosureDelimiter$
    • Record Delimiter: This is to identify how each line/record in the source file is ended/terminated/delineated. Default is '\n' The token used to read the record delimiter value in the script is $WSL_SRCCFG_azureDataLakeGen2RecordDelimiter$          
    • Row Limit for Data Profiling: Number of records to scan for Data Profiling. Data profiling is used to get the column names and data types from the source file. By default, 100 records will be scanned. The token used to read the record delimiter value in the script is $WSL_SRCCFG_azureDataLakeGen2RowLimit$

...