Video Captioning with Commonsense Knowledge Anchors