-
What are you trying to do?
The same excerses as before - 1. identifying mismatching pairs, 2. work out the correct value for 'field X' in the mismatched pair, 3. Fill an upload sheet with different named headers so it can be sent for remediation.
Why are you joining these two CSVs?
The "Report" is made up of these two CSVs - reportdateA = the first 70k rows, reportdatea = the next 69kish rows.
So I could do the UID and column deletes in both, the lookup for 'field X' has to be in a combined sheet.
I also need to do some other filters like when field x for a particular role has a date <today then ignore, and flag when certain columns = 'yes' with some other conditions.
What are you trying to do?
Why are you joining these two CSVs?
It's sometimes better to do all your stuff to the tables (removing columns etc) and then join.
You can always group a table by columns and sum a helper column to get a count of instances, you can join this back to the step before group by so you can get a count of the number of times this appears.