DataMedics
Member
So I'm working on a plugin (for Xenforo and other forums as well) that will be used to help moderate SPAM posts automatically. But, I need to compile a large, diverse data set of SPAM to train my AI on. I've got one honeypot site of my own I'm already using to collect a few thousand spam posts a day, but I need more diverse types of SPAM so I can train the model to recognize all types of SPAM.
Any forum owners out there willing to share the SPAM from your forum database? You have to have been using the "Hide from public view" option (not the "Permanently delete" option).
Or, even better, if you'd be willing to export both approved posts and deleted posts (no user info, just text from the posts) so I can do a "good posts" vs "bad posts" training run.
Please DM me if you're willing to help out. In exchange, I'll give you beta access when I launch the plugin and a good chunk of free API usage once it goes production.
FYI, I'm planning to make it a freemium type of plugin. Some free usage for smaller, less busy sites, but cap the API usage to require payment for busy sites.
Any forum owners out there willing to share the SPAM from your forum database? You have to have been using the "Hide from public view" option (not the "Permanently delete" option).
Or, even better, if you'd be willing to export both approved posts and deleted posts (no user info, just text from the posts) so I can do a "good posts" vs "bad posts" training run.
Please DM me if you're willing to help out. In exchange, I'll give you beta access when I launch the plugin and a good chunk of free API usage once it goes production.
FYI, I'm planning to make it a freemium type of plugin. Some free usage for smaller, less busy sites, but cap the API usage to require payment for busy sites.
Last edited: