The Yandex index robot is redirected when requesting this site so the robots.txt file will not be used. What's the fix?
We have a website:www.example.com which redirects to https://www.example.com.
Therefore when you type www.example.com/robots.txt you are redirected to https://www.example.com/robots.txt.
Another point to note:
The main mirror in Yandex will be decided by the crawler, it won't allow us to set www.example.com or set https://www.example.com as the main mirror.
Please note: This error that I am getting
The Yandex index robot is redirected when requesting this site so the robots.txt file will not be used is from this page :
https://webmaster.yandex.com/robots.xml
Update:
We updated our Robots.txt to the following after the answers.
User-agent: Yandex
Sitemap: https://www.example.com/sitemap.xml
Disallow: /some_pages/
Host: https://www.example.com
User-agent: *
Sitemap: https://www.example.com/sitemap.xml
Disallow: /some_pages/
You can set a main mirror in Yandex.
I think you need something like this:
Host: myhost.ru # uses
User-agent: *
Disallow: /cgi-bin
User-agent: Yandex
Disallow: /cgi-bin
Host: www.myhost.ru # is not used
or even:
# domen.myhost.com is the main mirror
# www.domen.myhost.com, is the correct use of
# the host Host directive:User-Agent: *
Disallow:Host: domen.myhost.com
The Host directive should contain:
An HTTPS protocol indication if the mirror is only available via a protected channel (Host: https://myhost.com).
One concrete domain name that corresponds to RFC 952 and is not an IP-address.
The port number, if necessary (Host: myhost.com:8080).
More information : https://yandex.com/support/webmaster/controlling-robot/robots-txt.xml
Comments
Post a Comment