After rewriting the NLJ operator, it has regression on the workload that the right side is very small (NLJ's left input has 1000 rows, right input has 1 rows) the inner-most join() function is ...