我正在尝试从IPython笔记本中运行mrjob示例。
有没有办法从IPython笔记本运行mrjob?
from mrjob.job import MRJob
class MRWordFrequencyCount(MRJob):
def mapper(self, _, line):
yield "chars", len(line)
yield "words", len(line.split())
yield "lines", 1
def reducer(self, key, values):
yield key, sum(values)
然后使用代码运行它
mr_job = MRWordFrequencyCount(args=["testfile.txt"])
with mr_job.make_runner() as runner:
runner.run()
for line in runner.stream_output():
key, value = mr_job.parse_output_line(line)
print key, value
同时出现以下错误:
TypeError: <module '__main__' (built-in)> is a built-in class
有没有办法从IPython笔记本运行mrjob?