Join files (2) block

❗️

This block will be deprecated in future releases.

The two_join block merges two source files into a single output file based on specified join conditions and matching headers. It acts as a transformation block in a Connect+ dataflow, enabling you to combine data from two separate files using SQL-style join operations before passing the merged output to downstream blocks.

When to use this block

Use this block when your dataflow needs to combine data from two separate files for example, merging bill-level and line-level transaction files before ingestion.

Prerequisites

Before configuring this block, make sure you have:

  • Two source files with a common column to join on
  • Regex patterns that uniquely identify each file
  • Column header names used for joining in each file

Configuration fields

Field nameRequiredDescription
Block nameYesA name for the block instance. The name must be alphanumeric. There is no character limit.
Files 1-2 Join TypeYesThe type of SQL join used to merge the two files. Select LEFT_OUTER_JOIN, OUTER_JOIN, INNER_JOIN, or RIGHT_OUTER_JOIN from the dropdown. Default value: LEFT_OUTER_JOIN.
All Files DelimiterYesThe delimiter used to separate fields in the source files. For example, ,.
File 1 RegexYesThe filename pattern used to identify File 1. For example, BILL_LEVEL_MERGE.*.csv.
File 1 HeadersYesThe column header name in File 1 used for the join comparison. To match on multiple headers, enter the header names separated by a comma. For example, BillNumber.
File 2 RegexYesThe filename pattern used to identify File 2. For example, LINE_LEVEL_MERGE.*.csv.
File 2 HeadersYesThe column header name in File 2 used for the join comparison. To match on multiple headers, enter the header names separated by a comma. For example, BillNumber.
Output FilenameYesThe name of the merged output file. For example, merged_output.csv.
File Join Use CaseNoThe use case for the file join. Select NONE or TRANSACTION_LINE_ITEM from the dropdown. Default value: NONE.
Use Alphabetical SortNoDetermines whether files are sorted in alphanumerical order before merging. Select true or false from the dropdown. Default value: false.
Merge based on common NameNoDetermines whether files are identified and merged based on a common name pattern. Select true or false from the dropdown. Default value: false.
Merge based on common Name Template File OneNoThe filename template for File 1 using the <common> tag to identify the common part of the filename. Select TRANSACTION_LINE_ITEM or NONE from the dropdown. For example, TktDocument_<common>.dat.
Merge based on common Name Template File TwoNoThe filename template for File 2 using the <common> tag to identify the common part of the filename. For example, TktDocument_<common>.dat.
File One is HeaderlessNoIndicates whether File 1 does not contain column headers. Select true or false from the dropdown. Default value: false.
File two is HeaderlessNoIndicates whether File 2 does not contain column headers. Select true or false from the dropdown. Default value: false.
File One Mention Header names (for use in mapping and expressions)NoThe column header names for File 1, separated by commas. Required if File 1 is headerless. For example, BillNumber,Amount,Date.
File Two Mention Header names (for use in mapping and expressions)NoThe column header names for File 2, separated by commas. Required if File 2 is headerless. For example, BillNumber,ItemCode,Quantity.
Match All RegexNoDetermines whether only the files matching the regex pattern are retained on the server and non-matching files are deleted. Select true or false from the dropdown. Default value: false.
All Unique MatchNoDetermines whether duplicate files are identified and deleted. Select true or false from the dropdown. Default value: false.