Improving the predictor: Hierarchical N-Gram