Scientific knowledge extraction from massive text data