python统计文本文件内单词数量的方法

本文实例讲述了python统计文本文件内单词数量的方法。分享给大家供大家参考。具体实现方法如下:

# count lines, sentences, and words of a text file
# set all the counters to zero
lines, blanklines, sentences, words = 0, 0, 0, 0
print ‘-‘ * 50
try:
# use a text file you have, or google for this one …
filename = ‘gettysburgaddress.txt’
textf = open(filename, ‘r’)
except ioerror:
print ‘cannot open file %s for reading’ % filename
import sys
sys.exit(0)
# reads one line at a time
for line in textf:
print line, # test
lines += 1
if line.startswith(‘\n’):
blanklines += 1
else:
# assume that each sentence ends with . or ! or ?
# so simply count these characters
sentences += line.count(‘.’) + line.count(‘!’) + line.count(‘?’)
# create a list of words
# use none to split at any whitespace regardless of length
# so for instance double space counts as one space
tempwords = line.split(none)
print tempwords # test
# word total count
words += len(tempwords)
textf.close()
print ‘-‘ * 50
print “lines : “, lines
print “blank lines: “, blanklines
print “sentences : “, sentences
print “words : “, words
# optional console wait for keypress
from msvcrt import getch
getch()

希望本文所述对大家的python程序设计有所帮助。

Posted in 未分类