Skip to content

feat: Customizing selector can sometimes cause web crawlers to fail #154

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

kukuze
Copy link

@kukuze kukuze commented Feb 29, 2024

When crawling a webpage, if a custom selector is selected, but the initial page does not have a suitable selection object, it will cause the crawler to fail. If a custom selector is not used, the truly valuable page (which can filter out a lot of useless information through the selector) will have a lot more useless information, such as the "homepage". So this submission can be used when a custom selector is available, and when it is not possible, use "body" as the CSS selector.

@kukuze kukuze closed this Feb 29, 2024
@kukuze kukuze reopened this Feb 29, 2024
@kukuze kukuze changed the title feat: Customizing CSS selectors can sometimes cause web crawlers to fail feat: Customizing selector can sometimes cause web crawlers to fail Feb 29, 2024
@marcelovicentegc marcelovicentegc added the enhancement New feature or request label Feb 29, 2024
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants