<div style="display:inline;float:right;margin-left:1em"><g:plusone href="https://www.searchenginegenie.com/blog-seo/mattcutts-discusses-pr-sculpting/"></g:plusone></div>
<div style="display:inline;float:right;margin-left:1em"><g:plusone href="https://www.searchenginegenie.com/blog-seo/mattcutts-discusses-pr-sculpting/"></g:plusone></div>
{"id":674,"date":"2009-03-27T00:35:00","date_gmt":"2009-03-27T04:35:00","guid":{"rendered":"http:\/\/www.searchenginegenie.com\/blog-seo\/mattcutts-discusses-pr-sculpting\/"},"modified":"2012-09-20T01:54:24","modified_gmt":"2012-09-20T05:54:24","slug":"mattcutts-discusses-pr-sculpting","status":"publish","type":"post","link":"https:\/\/www.searchenginegenie.com\/blog-seo\/mattcutts-discusses-pr-sculpting\/","title":{"rendered":"Mattcutts discusses PR sculpting:"},"content":{"rendered":"<p><object height=\"344\" width=\"425\"><param name=\"movie\" value=\"http:\/\/www.youtube.com\/v\/nM2VDkXPt0I&amp;hl=en&amp;fs=1\"><param name=\"allowFullScreen\" value=\"true\"><param name=\"allowscriptaccess\" value=\"always\"><embed src=\"http:\/\/www.youtube.com\/v\/nM2VDkXPt0I&amp;hl=en&amp;fs=1\" type=\"application\/x-shockwave-flash\" allowscriptaccess=\"always\" allowfullscreen=\"true\" height=\"344\" width=\"425\"><\/embed><\/object><\/p>\n<p>Matt Cutts, talks about the best ways to stop Google from crawling your content, and how to remove content from the Google index once we&#8217;ve crawled it.<\/p>\n<p>Sebastine explains pretty well on that topic:<\/p>\n<p>As for password protected contents, are you sure that you don&#8217;t index those based on 3rd party signals like ODP listings or strong inbound links?<\/p>\n<p>You totally forgot to mention the neat X-Robots-Tag that allows outputting REP tags like &#8220;noindex&#8221; even for non-HTML resources like PDFs or videos in the HTTP header. That&#8217;s an invention Google can be very proud of. \ud83d\ude42<\/p>\n<p>@Ian M<br \/>Actually, Google experiments with Noindex: in robots.txt, but that&#8217;s &#8220;improvable&#8221;.<\/p>\n<p>@Google<\/p>\n<p>Currently Google interprets Noindex: in robots.txt as (Disallow: + Noindex:). I think that&#8217;s completely wrong, because:<\/p>\n<p>1. It&#8217;s not compliant to the Robots Exclusion Standard.<\/p>\n<p>2. It confuses Webmasters because &#8220;noindex&#8221; in robots.txt means something completely different than &#8220;noindex&#8221; in meta tags or HTTP headers.<\/p>\n<p>3. Mixing crawler directives and indexer directives this way is a plain weak point that will produce misunderstandings resulting in traffic losses for Webmasters and less compelling contents available to searchers. All indexer directives (noindex,nofollow,noarchive,noodp, unavailable_after etc.) do require crawling when put elsewhere. I do Webmaster support for ages and I assure you that Webmasters will not get it. If nobody understands it and adapts it, it&#8217;s as useless as Yahoo&#8217;s robots-nocontent class name that only 500 sites on the whole Web make use of.<\/p>\n<p>4. The REP&#8217;s &#8220;noindex&#8221; tag has an implicit &#8220;follow&#8221; that Google ignores in robots.txt for technical reasons (it&#8217;s impossible to follow links from uncrawled pages). When I put a robots meta tag with a &#8220;noindex&#8221; value, then Google rightly follows my links, passes PageRank and anchor text to those, and just doesn&#8217;t list the URL on the SERPs. When I do the same in robots.txt Google behaves totally different, for no apparent reason. (Of course there&#8217;s a reason but I want to keep this statement simple.)<\/p>\n<p>Having said all that, I appreciate it very much that Google works on robots.txt evolvements. Kudos to Google! However, please don&#8217;t assign semantics of crawler directives to established indexer directives, that doesn&#8217;t work out. I see the PageRank problem, and I think I know a better procedure to solve that. If you&#8217;re interested, please read my &#8220;RFC&#8221; linked above. \ud83d\ude09<\/p>\n<p>@all<\/p>\n<p>Do not make use of experimental robots.txt directives unless you really know what you do, and that includes monitoring Google&#8217;s experiment very closely. If you&#8217;ve the programming skills, then better make use of X-Robots-Tags to steer indexing respectively deindexing of your resources on site level. X-Robots-Tags work with HTML contents as well as with all other content types.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Matt Cutts, talks about the best ways to stop Google from crawling your content, and how to remove content from the Google index once we&#8217;ve crawled it. Sebastine explains pretty well on that topic: As for password protected contents, are you sure that you don&#8217;t index those based on 3rd party signals like ODP listings [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[14],"tags":[],"class_list":["post-674","post","type-post","status-publish","format-standard","hentry","category-mattcutts-video-transcript"],"_links":{"self":[{"href":"https:\/\/www.searchenginegenie.com\/blog-seo\/wp-json\/wp\/v2\/posts\/674","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.searchenginegenie.com\/blog-seo\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.searchenginegenie.com\/blog-seo\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.searchenginegenie.com\/blog-seo\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.searchenginegenie.com\/blog-seo\/wp-json\/wp\/v2\/comments?post=674"}],"version-history":[{"count":1,"href":"https:\/\/www.searchenginegenie.com\/blog-seo\/wp-json\/wp\/v2\/posts\/674\/revisions"}],"predecessor-version":[{"id":1150,"href":"https:\/\/www.searchenginegenie.com\/blog-seo\/wp-json\/wp\/v2\/posts\/674\/revisions\/1150"}],"wp:attachment":[{"href":"https:\/\/www.searchenginegenie.com\/blog-seo\/wp-json\/wp\/v2\/media?parent=674"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.searchenginegenie.com\/blog-seo\/wp-json\/wp\/v2\/categories?post=674"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.searchenginegenie.com\/blog-seo\/wp-json\/wp\/v2\/tags?post=674"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}