Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Anti-spam measures #19

Open
Minoru opened this issue Nov 4, 2021 · 0 comments
Open

Anti-spam measures #19

Minoru opened this issue Nov 4, 2021 · 0 comments

Comments

@Minoru
Copy link
Owner

Minoru commented Nov 4, 2021

Inspired by what fediverse.network does to detect spam instances:

  1. group instances by "topmost registrable domain", e.g. foo.example.com and bar.example.com get grouped by "example.com";
  2. every day, check only one instance of the group. Or, in other words, the check interval for an instance would be number_of_instances_in_a_group*day.

As a result, a spammer that created a million instances won't affect a spider's load at all, as only one of the million instances will be checked every day. It still affects the databse size though, and adds overhead in detecting groups.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant