When I crawl a page containing Japanese with scrapy, a lot of the character string \ uXXXX is embedded in the resulting JSON. To prevent this, write the following in settings.py.
FEED_EXPORT_ENCODING='utf-8'
reference https://doc.scrapy.org/en/latest/topics/feed-exports.html#feed-export-encoding
Recommended Posts