Got the greatly reduced results after using the above statement:grep "Googlebot" logfile.txt | grep -v " 200 " | grep -v " 301 " | grep -v " 302 "
that greps for googlebot and excludes lines with 200, 301, 302
Got the
Got the greatly reduced results after using the above statement:
Everything is either a 303, 304, 403, or 404.
Thanks
grep "Googlebot" logfile.txt | wc -l
grep "Googlebot" logfile.txt | grep " 403 " | wc -l
grep "Googlebot" logfile.txt | grep " 404 " | wc -l
Run those commands separately. It will spit out how many log entries total, and how many for 403 and 404.
Thanks for the info.Years ago when I migrated from vb3 to xf1, my site's revenue dropped. Learned that while I had many good advertisers, the pool wasn't large enough back then to withstand a change. Since then, revenue has dropped every time I make a significant change.
For me the revenue does return, and have since learned what to not do during major changes. I've also reduced costs and learned to diversify the revenue/advertising sources. It makes more money today than it did in the past, even with current events.
I think the suggestions being offered are worthwhile, but for many advertisers any degree of change will disrupt confidence in your site's ability to deliver on their advertising investment.
That surprises me, @Masetrix. The current robots.txt from XF.com looks like this:@briansol
These entries are not a good idea. Users should set their privacy themselves in their settings and excluding postings (goto) is also not good.
Since I've removed these things from robots.txt, the number of hits has increased again, Google indexes more again and my "excluded" content in the GSC is less.
Have a look here:
Unfortunately, I only found out all of this by working with the AMP add-on from @mazzly ...Traffic Down Since VB to XF Migration
Sorry...I'm not exactly where to find all these: * Settings -> Crawl Stats...can find this no problem. What part of the Crawl Stats would be helpful? We've been talking "By Response" area earlier...not sure what other part to screenshot. * Search Results (not sure where to find this)...and...xenforo.com
User-agent: *
Disallow: /community/whats-new/
Disallow: /community/account/
Disallow: /community/attachments/
Disallow: /community/goto/
Disallow: /community/posts/
Disallow: /community/login/
Disallow: /community/admin.php
Allow: /
Sitemap: https://xenforo.com/community/sitemap.xml
Hey Silmarillion. Who were you asking this question? I'm the thread OP...wasn't sure if you were asking me or someone else.What does your current robots.txt look like? Would you share this with us?
Hi Alfuzzy, my question was actually addressed to Masetrix. But of course you are also very welcome to share your file if you like.Hey Silmarillion. Who were you asking this question? I'm the thread OP...wasn't sure if you were asking me or someone else.
Thanks
If "goto" references have ever been used and you block them in Robots.txt later, you will find many new errors in the GSC after a short time. It is better to use "nofollow" or "noindex" tags here.That surprises me, @Masetrix. The current robots.txt from XF.com looks like this:
Code:User-agent: * Disallow: /community/whats-new/ Disallow: /community/account/ Disallow: /community/attachments/ Disallow: /community/goto/ Disallow: /community/posts/ Disallow: /community/login/ Disallow: /community/admin.php Allow: / Sitemap: https://xenforo.com/community/sitemap.xml
What does your current robots.txt look like? Would you share this with us?
Here's my robots.txt. If anyone has suggestions (what to add to it or what to delete from it)...please post. ThanksHi Alfuzzy, my question was actually addressed to Masetrix. But of course you are also very welcome to share your file if you like.
User-agent: *
Disallow: /forums/whats-new/
Disallow: /forums/posts/
Disallow: /forums/tags/
Disallow: /forums/members/
Disallow: /forums/member.php
Disallow: /forums/calendar.php
Disallow: /forums/account/
Disallow: /forums/attachments/
Disallow: /forums/goto/
Disallow: /login/
Disallow: /forums/members/
Disallow: /forums/admin.php
Allow: /
User-agent: Mediapartners-Google*
Disallow:
Sitemap: http://{yourdomain.com}/forums/sitemap.xml
Oh so the forum is now residing in the root directory for your domain? If so, yes: delete the /forums part for all those entries so it looks like this:Hello djbaxter...thanks for the help.
Yes there are some lines in the robots.txt for both XF and WP. The WP lines are probably some legacy stuff from years ago. As far as the crawl delay...I think I read somewhere that crawlers (at least Google crawler)...ignores any crawl delay in robots file. But if it makes sense to remove the crawl delay line and also simplify things...it can definitely be removed.
In the robots example posted above...I'm assuming the file structure on this website may be different than my site due to the "forums" directory included in the file paths.
View attachment 249587
Do I need to include the "forums" file path in my robot's (my site doesn't have a specific "forums" sub-directory).
Thanks
User-agent: *
Disallow: /whats-new/
Disallow: /posts/
Disallow: /tags/
Disallow: /members/
Disallow: /member.php
Disallow: /calendar.php
Disallow: /account/
Disallow: /attachments/
Disallow: /goto/
Disallow: /login/
Disallow: /members/
Disallow: /admin.php
Allow: /
User-agent: Mediapartners-Google*
Disallow:
Sitemap: http://{yourdomain.com}/forums/sitemap.xml
We use essential cookies to make this site work, and optional cookies to enhance your experience.