波特干扰器
-
导入
PorterStemmer
并初始化from nltk.stem import PorterStemmer from nltk.tokenize import word_tokenize ps = PorterStemmer()
-
干一个单词列表
example_words = ["python","pythoner","pythoning","pythoned","pythonly"] for w in example_words: print(ps.stem(w))
结果:
python python python python pythonli
-
在对其进行标记后判断一个句子。
new_text = "It is important to by very pythonly while you are pythoning with python. All pythoners have pythoned poorly at least once." word_tokens = word_tokenize(new_text) for w in word_tokens: print(ps.stem(w)) # Passing word tokens into stem method of Porter Stemmer
结果:
It is import to by veri pythonli while you are python with python . all python have python poorli at least onc .