Mark.B
Well-known member
Ooh good. More spammers!Because you're actually getting visitors FROM other countries - especially China.
Ooh good. More spammers!Because you're actually getting visitors FROM other countries - especially China.
Baidu is apparently running amok I started to notice it a couple of days ago.
Code:<Directory /...path/to/..> Order allow,deny Allow from all Deny from 119.63.196. </Directory>
And gone are the buggers...
User-agent: Baiduspider
Disallow: /
Great input Floris, thank you for passing that information on.Seeing how they're not following robots.txt at all, .. after some sneaky testing for a week, ..
Why would you want to block Baidu? Yes, really, I'm asking that question.
Mark Zuckerberg took a visit to the Baidu headquarters a short while ago. This tells me that Facebook is interested in acquiring Baidu down the line. This in turn tells me that Facebook's choice of search engines is Baidu... So, if you want more traffic to your site, Baidu's interested in seeing what you've got. And if successful, they'll direct more 'human' to your site, just like google before them.
These evil little things do not follow robots.txt How would I go about getting rid of them?
I guess via IP addresses in your .htaccess file.
Guys, what is the deal with blocking IP ranges? for example I see the Baidu range being 119.63.196.xxx but if I ban at 119.63.196. will that cause non baidu computers to be blocked? The reason I ask is because the Baidu spider is in Japan, and I too live in Japan and my server hosts some sites based in Japan so I don't want to block them. Though if 119.63.196. is the organisation IP then I'm happy to block it
# Disallow all others
User-agent: *
Disallow: /
We use essential cookies to make this site work, and optional cookies to enhance your experience.