this about halves the time spent on level_order_it and drastically reduces the time spent in children_it