Handling Long-Term Dependencies And Rare Words In Low-Resource Language Modelling