Question from the Natural Language Processing - Fundamentals test

What is special about the BERT algorithm?