Information from parts of words (Subword Models) and Transformer architectures Slide Suggested Readings: Achieving Open Vocabulary Neural Machine Translation with Hybrid Word-Character Models Revisiting Character-Based Neural Machine Translation with Capacity and Compression