Archive SEO | SEO BLOG

SEO

Search engines page cache limit – search engine optimization tip – march 18th 2005

All search engines have a cache limit for pages they crawler, The html of a page is considered as the size of the page the search engine crawler,

For yahoo bot ( yahoo slurp ) cache limit is 500 KiloBytes, Maximum size of a page yahoo robot crawls is 500 KB,

For Google robot ( Googlebot ) cache limit is Unknown, Before couple of months the cache limit was only 101 kb, Pages above that size was partially indexed, Anything above the 101 kb limit was ignored, Now it has changed and googlebot is known to index files of size more than 400 kb, So it is unknown exactly what the new cache limit is for googlebot,

SEO No Comments

Is Robot exclusion bad for a site – Does excluding robots from certain sections of your site bad – seo tip march 17th 2005

Some website owners are worried about using robots exclusion on their sites because they think blocking a crawler from certain pages or certain sections are bad, From search engine genie point of view we recommend to use robots exclusion freely, Make sure you use proper syntax a wrong syntax might confuse the search engine crawler and might get the site dropped from the index,

Excluding robots from certain sections of the site is very good in various ways,

1. You can prevent search engine crawlers from eating up unnecessary bandwidth of your site, For some hosting bandwidth is costly and it is important bandwidth is saved by excluding crawlers from certain sections,

2. Excluding robots from comments section if you run a blog is very good, It prevents robots from seeing or giving credit to any unnecessary links, There might be some bad links left by spammers and those links can be prevented from crawling by excluding those pages from the crawlers, Linking to bad neighbourhood is not good,

3. Excluding robots also helps in preventing sensitive information of your site not getting exposed in search results, There are sites who dont prevent the crawlers properly and those pages contain sensitive data link credit card details, important database etc, So for precautionary measures it is best to exclude the robot from certain areas,

4. Excluding robots also helps in preventing duplicate contents of your site from being crawled, If you have a dynamic site and there are 2 versions of the dynamic pages one search engine friendly and other dynamic with query string it is better to block one version and block the other one that way search engines don’t have to worry about duplicate content on your site,

There are still more benefits from excluding pages / sections from crawlers so do it safely with proper syntax, Use if well if you have big ecommerce dynamic site, Dynamic sites are known to render lots of pages to the crawlers by error, Hope this tip helps,

SEO Blog Team,

SEO No Comments

Is it bad to duplicate keywords and content in meta description tags? – search engine optimization tip march 16 2005

Meta description tags play a very small role in search engine ranking these days, It doesn’t matter whether the meta description tags are duplicated across your pages, Just make sure your important keywords/ phrases are present in those meta description tags,

It is common for sites to have similar meta description tag, Especially in ecommerce sites it is common to have duplicated meta tags, There is nothing wrong in it, It doesn’t hurt to have duplicated meta description, Also you should remember it doesn’t help much, So it is better to just optimize them and leave it as it is,

SEO Blog Team,

SEO No Comments

Potential reasons for Sites Crawled But Not Indexed – search engine optimization tip – march 15th 2005

For a more detailed list of search engine crawlers contact us and we will send it to you for free,

Many of us have noticed sometimes search engines keep visiting the site regularly but dont index the site nor show it in the site: command,

There are various reasons for this to happen,

1. The domain is an expired domain, if the domain expires and not registered for a certain period of time google imposes a expired domain penalty on that domain, That domain is left to suffer for certain number of months, in that period googlebot keeps visiting the site but they don’t index it and they don’t show the site in site: command too,

2. An other reason is the domain is a new domain and if the domain is a new domain sometimes the crawler regularly visits the site and it doesn’t show up in the index for a long time, There is nothing wrong with this, probably google index is taking longer time to expand, you just have to wait till google updates its index,

3. An other possible reason is the site is banned from the search engines for any particular onpage factor, In that case search engines periodically checks to see whether the onpage spam tactics is removed and as soon as they see the spam being removed they might reinclude the site into the index, So for people who are complaining that their site was previously indexed and listed in google but suddenly it disappeared from the index and googlebot keeps visiting the site it is good to look at your onpage work and see if there is any spam tactics like hidden text, cloaking, keyword stuffing etc,

4. An other important reason could be that the site was permanently banned from the search engines, Even here search engine crawlers visit the site following existing links to the site but they don’t index the site because the site is banned, This is common with Yahoo slurp yahoo’s robot, yahoo slurp is known to visit the site and don’t index the site if the site is banned,

SEO No Comments

List of top search engine user-agents – search engine optimization tip march 14th 2005

List of top search engine user-agents

We get lot of mails from people who want to know the names of the leading user-agents, We will be pleased to give the information, Identifying the useragents is a very important criteria in search engine optimization, A regular visit by search engine robots like Googlebot, yahoo slurp etc is a good sign,

Here is the list of top search engines crawlers,

Googlebot/2.*ooglebot@googlebot.com)
Yahoo – Yahoo Slurp
Msn – Msnbot
Lycos – Lycos_Spider_(T-Rex)/3.
Teoma- Mozilla/2.0 (compatible; Ask Jeeves/Teoma)

For a more detailed list of search engine crawlers contact us and we will send it to you for free,

SEO No Comments

Does search engines crawl Javascript links ? – search engine optimization tip – march 13th 2005

Scripts are complex codings which difficult for some browsers and crawlers to read, Some tough javascripts are read only by advanced browsers, search engine crawlers are not advanced browsers some of the browsers that search engines use find difficult to index high graphics, So it is best to avoid too much javascript,

Prevent creating menus in javascript or in any scripting language, Create menus in simple html or other crawler understandable language, Javascript menus wont be crawled by search engines, Best is to avoid using clickable menus and other important inner page links in javascript,

if you cannot avoid using javascript menus just add the links to a sitemap and attach the sitemap to the homepage or any other important page,

SEO BLog Team,

SEO No Comments

Does outgoing links affect your site – seo tip march 12th 2005

Outbound links are links going out from a site to an other site, Whole internet/WWW was built upon links, Inbound links and Outbound links make up the web, Outbound links are good to maintain the quality of a site, search engines like links, they like outbound links too, If you link out to a collection of quality sites then definitely there is a small boost to that page,

Jon M. Kleinberg proposed that hubs are a collection of quality links, More information on hubs and authorities in this paper, http://www.cs.cornell.edu/home/kleinber/auth.pdf

Outbound links are good for usability too, For certain references it is important to give the source so that people are guided in the correct way, Especially non commercial sites need to link out freely for people to find relevant information if they are found elsewhere on the web,

SEO Blog Team,

link building, SEO No Comments

Do tracking parameters dilute ranking of page? – search engine optimization tip march 10th 2005

tracking parameters dilute ranking of page?

many will be having question whether tracking URLs like the one below will dilute pagerank/link popularity/ranking,

www.myURL.com/page.htm?trackingid=theirURL

Nope they dont dilute much, if you think they are diluting the link better option is to 301 redirect them to the main URL, that way all link popularity is passed to the main URL and no duplicate pages are formed,

SEO No Comments

Do search engines index flash?? – search engine optimization tip march 9th 2005

Flash is not good for search engines, Most of the search engines have difficulty in parsing code from the complex DHTML coding of flash, Google has been recently reported on following links from flash .SWF files, We have seen google read text within a flash file and follow links from a flash file,

But yahoo don’t read flash they are not sophisticated to do so, Similarly MSN, gigablast and lot of other search engines don’t follow flash, Best bet is to avoid flash sites if you are planning to do search engine optimization,

Flash has always been a hindrance for search engines better avoid designing full sites with flash,

SEO BLOG TEAM,

SEO No Comments

What is duplicate content for search engines – search engine optimization tip March 8th 2005

Various search engines have various thresholds on duplicate content issues, Some search engines like yahoo, exalead are unable to detect duplicate contents across sites, they seem to detect within a site but are not able to detect across sites, Best is to make the pages atleast 5 to 7% different from other pages of the site,

Google is the best search engine on detecting dupe contents, They strip away the main template of the site and take the remaining part into their algorithm consideration, We recommend making the page atleast 8 to 15% different from other pages to avoid dupe content penalty for a particular page, Remember to give proper file names if you cant create too unique pages, File names are indexed by search engines and good 5 or 6 word file names add upto unique contents,

Overall 10% is the best bet to make pages different,

SEO BLog Team,

SEO No Comments

SEO

Search engines page cache limit – search engine optimization tip – march 18th 2005

Is Robot exclusion bad for a site – Does excluding robots from certain sections of your site bad – seo tip march 17th 2005

Is it bad to duplicate keywords and content in meta description tags? – search engine optimization tip march 16 2005

Potential reasons for Sites Crawled But Not Indexed – search engine optimization tip – march 15th 2005

List of top search engine user-agents – search engine optimization tip march 14th 2005

Does search engines crawl Javascript links ? – search engine optimization tip – march 13th 2005

Does outgoing links affect your site – seo tip march 12th 2005

Do tracking parameters dilute ranking of page? – search engine optimization tip march 10th 2005

Do search engines index flash?? – search engine optimization tip march 9th 2005

What is duplicate content for search engines – search engine optimization tip March 8th 2005

Blogroll

Categories