Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

feature request - clean for char(160) \xa0 #198

Open
dornech opened this issue Sep 20, 2020 · 1 comment
Open

feature request - clean for char(160) \xa0 #198

dornech opened this issue Sep 20, 2020 · 1 comment

Comments

@dornech
Copy link

dornech commented Sep 20, 2020

Hi there, many webpages use non-breaking space in textelements, however for subsequent processes this is sometimes troublesome. What's about an option for get() to clean a returned string value, i. e. to replace \xa0 with a normal space automatically?

@Gallaecio
Copy link
Member

I see your point, however I’m not sure it’s worth it doing at the Parsel level. I think it makes sense for post-processing to happen out of Parsel, at a later stage (e.g. using https://github.com/scrapy/itemloaders).

# for free to join this conversation on GitHub. Already have an account? # to comment
Projects
None yet
Development

No branches or pull requests

2 participants