Transformer Xl Attentive Language Models