这是一个创建于 2363 天前的主题,其中的信息可能已经有所发展或是发生改变。
Traceback (most recent call last):
File "China_hownet_journal_end.py", line 296, in <module>
china_hownet.run()
File "China_hownet_journal_end.py", line 281, in run
url_list = self.parse_content_html(html3str)
File "China_hownet_journal_end.py", line 212, in parse_content_html
html = etree.HTML(html3str)
File "lxml.etree.pyx", line 2945, in lxml.etree.HTML (src/lxml/lxml.etree.c:62546)
File "parser.pxi", line 1617, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:93194)
File "parser.pxi", line 1488, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:91938)
File "parser.pxi", line 969, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:88328)
File "parser.pxi", line 577, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:84385)
File "parser.pxi", line 676, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:85488)
File "parser.pxi", line 625, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:84945)
lxml.etree.XMLSyntaxError: line 1046: htmlParseEntityRef: expecting ';'