What is difference between Unique Rows and Unique Rows(Hashset)?
Both steps removes duplicate rows and leaves only unique row occurrences.
For Unique Rows, input stream should be sorted; otherwise, only consecutive double rows are evaluated correctly.
The Unique Rows (HashSet) step tracks duplicates in memory and does not require a sorted input to process duplicate rows.
is there any other difference other than this.suppose i want second value instead of first can we do that by using unique row(Hashset)?
In this scenario Unique Rows and Unique Rows(Hashset) doesn’t work, because its name itself indicate unique not the duplicates.
If you require second value then sort the values and use “Add value fields changing sequence” step to
get the second value.