Following is the equivalent position from a different perspective in the related clip:
One video frame before this there is a separation between the trail elbow and the torso. One video frame after this Hogan's hands are below his waist and the club head is much further from the target than his hands (the club is at an apx 45 deg angle to vertical from this perspective).
Frames 2 & 3 of my post show the club head is not further from the target than the hands and there is no elbow separation from the torso. This means that your second photo is less than 1 video frame (apx 0.3 seconds) from frames 2 & 3 of my post.