ERROR: No abbreviations files found in ...nonbreaking_prefixes

在运行LSTM sentiment analysis 代码中遇到如下报错信息:

WARNING: No known abbreviations for language 'en', attempting fall-back to English version...
ERROR: No abbreviations files found in .../nonbreaking_prefixes
Tokenizing.. Done

原因是tokenizer.perl中的nonbreaking_prefixes.en没有找到。在代码所在路径中创建文件夹 nonbreaking_prefixes ,然后从
https://github.com/moses-smt/mosesdecoder/tree/master/scripts/share/nonbreaking_prefixes 中下载所需文件到该新建文件夹即可。

    原文作者:4v3r9
    原文地址: https://www.jianshu.com/p/1f27b10771f2
    本文转自网络文章,转载此文章仅为分享知识,如有侵权,请联系博主进行删除。
点赞