Word Order in Title Tag

Not so far ago, software that is declared to be site content analyzers pays attention to the position of keyword(s) in title of the page. Best mark is given to pages, where keyword(s) holds first position in title. For instance, if our keyword is “apple”, then such title “apple that tastes perfectly” will receive better mark than “perfectly tasting apple”.

Such attention of software developers to position of a keyword in page title made me interested in this subject, therefore I have tried to check it in practice. I have a site that contains index page, categories of products and product pages. Primarily, the titles were “site name, keyword” on the index page and “category name, keyword” on category page. I have changed places having put keywords before other words.

As a result, after cache update, site index dropped 3 positions down, category pages dropped 10 positions down. SERPs in this field stayed steady for three months, inlucing update time, so I hardly believe this drop concerns something other than title changes.

Now I have put everything back, but still there are no results. I shall inform you about changes, if they occur immediately.

Maybe you have any opinions concerning subject of this post?

Usage of Specific Tags in Text

Oftenly masters, who try to optimize a page are getting too enthusiastic with the process and make mistakes, being unaware of that they are doing wrong things.

A good example is a site that one of my partners have asked me to take a look at. Ordinary site about traveller’s bags and suitcases. I felt satisfied with code, links, navigation, scripts, but the decoration of text made me surprised and jolly.

Each product page had a title with a name of model, h1 tag with the same information, couple of links - to testimonials, reviews, cart and support, description of a model (in bold text) and “other models of category” with a list of models in bold text again.
Read more…

Optimization of Unfinished Project

Sometimes, you come to a decision to make a site, find keywords, some initial information, but general site’s content is going to come later - for instance in two months. It is obvious, that you’d better spend these two months making some optimization in advance. I’ve heard opinions that such optimization is worthless and even harmful for the future of site, but due to my own experience it helps to make site displayed in SERPs earlier, especially if you make everything wisely, therefore we shall discuss optimization of unfinished projects.

What are the first aims of a search engine, when it starts spidering a site? Inbound links and outbound links are considered too, but what about the index page itself. Search engine is interested in getting the subject and contents of a page, for proper ranking of it’s content in SERPs. Therefore we find out that for creation of initial, let it be “alpha” version of a site, we need some information and strategy to get startet.
Read more…

Aged, Used, Expired Domains

Due to sandbox problem, various penalties and filters, some of my friends have tried to use aged, used and expired domains. Results are surprisingly contradictory.

Expired domains appeared to be bad idea in 95% of cases, because usually Google filters expired domains out for a couple of months, especially if such domains were listed in dmoz. There were only few cases, when domain name has been listed in dmoz.

Used domains are good if you make everything wise. For instance you have a lot of information on some subject. And you manage to rebuy the domain that perfectly fits your needs. It is a wise decision to buy hosting account, where it was primarily hosted and replace old information step by step - not do delete everything and set up absolutely different site. Google tracks changes of IP address, so moving to other host can cause Page Rank loss and temporary penalty. Immediate change of all content of site can cause the same penalty, unless it was a dynamic site that changed daily.

Aged domains are same thing to used, but they are good because of their age - they are usually bypassing sandbox and a couple of other penalties and filters. Again, the only condition of building site on such domain is wise use.

This information is taken from my own experience and thoughts shared by friends. If you disagree or have something to add - waiting for your comments.

Page Length: How to Avoid Long or Bad Indexing

You may know that page length affects indexing time to time. Not so far ago, Google was reading up to 100th kilobyte of page and didn’t read it after 100th kilobyte for the purpose that the surfer unlikely will read it up to 100th kilobyte and further. Of course this is about text content only. Images, graphic elements and invisible elements are excluded from calculation of page length.

Remember, that even though ivisible elements are usually excluded from calculation of page length, too heavy page is usually read worst. For instance, I’ve seen lots of pages overstuffed with things that can be moved outside: CSS (which can occupy half of total page length), Ad blocks, which can be moved to include PHP files and so on.

I usually make pages that are not longer one screen, especially those targeted on the majority of surfers around. It can seem unbelievably by many surfers don’t know what scrolling is, so it is better to put everything in screen. This site building strategy is also good for search engines, because short pages can be devoted to keywords, which could be in a mess on a single page. For instance if I’d had a page telling about apple, apple pies and apple trees, I’d divide it into three pages - about apples, about apple pies and about apple tries. This way I’d increase relevancy of each single page and facilitate indexing because they’ve become three times shorter each.

PageRank Explained

Appraisal of link importance.
Term “Link Popularity” is a bit incorrect. It would be much more close to what it means if it would be called “Link Topology”, because this method considers relationship of links along with quantity. However, as a result of analysis we receive “importance” of a page. This is not what “relevancy” is. Relevancy shows how contents of your page correspond to a particular search query. “Importance” shows value of page, regardless of it’s contents. Any inbound link states that this page has some value and it increases it’s “importance” this way. The more rating it has, the more “important” it is.
Not all links are making equal contribution in page’s rating. Some of linking pages can be more important than others and so on, thus outbound link from such page is more important.
So, “Important page is a page that has links from important pages. Exclusive circle? Yes, it’s rather easy to understand subconsciously. For instance a link from NASA will be more important than a link from your cousin’s Kate homepage – not because NASA loves you more, but because there are thousands of sites linking to NASA and just a couple of them linking to Kate’s.
How the “importance” is measured.
Though it is easy to understand on instance of relationship between two pages, measuring of importance of milliards of related pages seems hopelessly complicated. Indeed this is really complicated, but not hopelessly – everything’s almost easy. Such measures demand lots calculations, but fortunately we shan’t invent anything new. We can just take ready formulae from scientific sources.
Larry Page and Sergey Brin, the founders of Google and first developers of it’s algorithm have published “The Anatomy of Large Hypertext Search Engine”. You can download it from http://www-db.stanford.edu/pub/papers/google.pdf in PDF format. The document describes the Page Rank technology – method of appraisal of page important, measured proceeding from pages linking to the appreciated page.

So, the Page Rank formula. It looks complicated, but it just looks so. In practice, you will need just a little knowledge of algebra (I don’t know whether algebra is studied in such volumes in schools of Her Majesty’s Land and US, but in my country math is studied since 7 y.o; algerbra and senior math since 10 y.o.)

For instance, there are page A, which has inbound links from other pages. Let’s call them T1, T2, T3, and so on up to Tn.
No math yet, we’ll just give names to things that we are going to speak about. Imagine that A is your homepage and T1-Tn – other pages, which contain hyperlinks pointing your page. For instance, T2 can be a homepage of your cousin Kate (if this helps in understanding ;) )
PageRank of page A is calculated using the following formula:

PR(A) = (1-d)+d [PR(T1)/C(T1)+PR(T2)/C(T2)+PR(T3)/C(T3)+…+PR(Tn)/C(Tn)]
In case if it looks complicated for you, let’s divide it in three groups:
PR (A) means PageRank of a page A – value we are trying to calculate. This expression just defines the problem – all calculations will be on the other side of “=”.
( 1-d) + d – fade ratio. Don’t pay attention to it. Page and Brin recommend to measure it equally to 0,85. so we will set it 0,85 and forget about it. Though it is important if you create a search engine, our calculations allow taking ready value. We are just going to calculate expression in brackets, multiply it by 0,85 and add 0,15 to the result, as it is mentioned in formula.

Now let’s get back to the expression in brackets and write it as follows:
[ PR(T1)/C(T1)+
PR(T2)/C(T2)+
PR(T3)/C(T3)+…+
PR(Tn)/C(Tn)]
It’s easy to see that T1, T2 and T3 are that pages, which link to A. I hope it’s easy to make calculations with these simple formulas you have received after dividing. Obvious difficulty is just in quantity of calculations.
PR means Page Rank of T1, T2… Tn pages. The only novelty that appears in this formula is C – quantity of hyperlinks on the given page. C(T2) is common quantity of outbound links on T2 page, e.g. links of such kind:
http://www.av.com
This link is an inbound link for page, where it points.
Having united these three components, which we have previously divided, we can define sequence of actions applying this formula to any particular page.
Create a list of all pages, which link to this page.
Define the following values for each page:
PageRank, outbound links,
Divide PageRank by outbound links (e.g. if PageRank is 6 and there are three outbound links the result will be 6:3=2. )
Make sum of such results for each inbound link.
Apply fade ratio to the result.

Free Hosts and 3rd Level Domains

You may have noticed that SERPs are full of 3rd level domains for the last five or six month. There are a lot of questions coming to me by e-mail, which concern this subject. The majority of webmasters ask how Google defines that a domain is a free hosting and how to imitate it. Other ask why are they dropping out from SERPs, while domains, where subdomains are based on are still in SERPs.

First. There are not special definition of a free host. Any unsandboxed domain can run a lot of subdomains that will hold good positions in SERPs, untill Google spots them to be harmful and wipes them out. That is because of blogs and guestbooks spam Google is filtering out 3rd level domains. If use gray and white methods wisely, you can hold good positions and never drop out.

Concerning subdomains that have been dropped out. The problem is that in my opinion, Google is filtering such domains and their subdomains manually, by somebody’s abuse or smth. like this.
However, they see that domain is good itself - it gives free space to people that create homepages on it, or something like that. That is why they do not ban, but filter out all subdomains. They get sandboxed or lowered in SERPs.

If you have something to tell about - share your experience of managing subdomain based projects.

Getting Out from Sandbox (Opinion)

Recently, I have discussed sandbox problem with one of my friends, who has lots of sites and deals mostly with global SEO, than with separate sites. He says that one of useful methods of getting out from sandbox is to get rid of all inrelevant pages and update your site with as much fresh content as you have.

I have tried this method on one of my sites, that was sandboxed and stood lower than 100th page in SERPs. It seems to have already helped (two weeks) - some pages are already on first page, some pages are in first 10 pages of SERPs. Here is what I’ve made:

I referred to sites description and keywords and picked out a couple of most important. I left all pages relevant these keywords and keyword phrases having deleted the rest. I mean pages that doesn’t correspond general keywords. Usually if you build an ordinary site and care SEO only afterwards, site appears to be half-filled with various dust (that is why my latest sites are thought down to the minutest details: even each sentence and each HTML tag is inspected to correspond my aim.)

Сlean out everything unnecessary. Even “contacts” page can be deleted: you can put your e-mail, address and/or other contacts on your index page, where they’ll be accessible and easy to find. You can target “Contacts” link in your navigation to index page with anhcor on exact row. It just seems that there’s nothing to delete. In fact you can do much. After deleting of all dust, inspect your text content. Try to keep it short and keyword rich. Throw out everything unnecessary, because it is a commercial website, not a novel, where you must give some glamourous idioms and pretentious locutions. Plain text is perspective for SEO purpose and understandable for foringers, which may also visit your website. Probably all of you know that Russian greeting sounds “Zdravsvujte”, but I think that probably nobody will understand it’s synonim “Priverstvuju vas, dobro pozhalovat”. Care everybody, who may visit your site.

After completing your cleaning work, start adding new content. It is much better to add a bit, but daily, than to hold a week and then bring down 100 pages. Regularity is very important.

Using this strategy, you can make good results in recovering a site from sandbox. Good luck!

//Max

Broken Links Software

I have already mentioned how broken links and inaccessible site directories can corrupt SEO. Recently I have found good software to find broken links on your site:

You can check your site for broken links online at: http://www.dead-links.com

Or download useful broken links check software: http://home.snafu.de/tilman/xenulink.html

Keep your site clean and cute :)

//Max

Sandbox Amnesty

Have anybody had his sites recovering from sandbox in June or July? If yes, please, share the following details: age of domain, quantity and PR of inbound links, some other circumstances.

Analyzing summary from forums and private discussions I have concluded that mass sandbox amnesty took place in May. Then a lot of sites were sandboxed in about 5-10 of June, since Burbon update.

Since then, sites are recovering from sandbox in solitary instances. It seems that Google is giving mass sandbox amnesty time to time. What do you think?

Strange Indexed Pages Quantity

Recently I have noticed that quantity of my “indexed pages” in Google has overcome real quantity of pages on my site. The real quantity is 150 pages, while Google shows bout 300. Checking and counting indexed pages through “site:” search, I found actual quantity of indexed pages and number of pages displayed by Google differ. It is more likely to be a bug, but I think that it can be some kind of concealment, just like they’ve did it with backward links.

Dedicated or Shared IP?

There’s much controversy about usage of dedicated and shared IPs in SEO. During forum debates I have seen a lot of opinions concerning this question, so I’d like to give you some excerpts of discussion to think about:

- Google dislikes shared IPs; one should park domains on dedicated IPs.
- Google don’t care if you have shared or dedicated IP for each domain
- You need do have IPs from different networks to make safe linking between such domains
- If you have your domain on shared IP and there is a cheater or spammer having his domain on the same IP, you can get banned along with this cheating site
- If such cheating site will be banned, you won’t be banned too, but lowered in SERPs
- You will experience no changes, when site from same IP will be banned

You can see that there are much opinions and guessing about this problem I have concluded that:

If you have a couple of domains with sites, that are sometimes similar in code, names of images and affiliate links - you must avoid putting these sites on domains parked on same IP, or IPs from same network. In case if sites are completely unlike and you don’t link between them - it is not necessary to host them on different IPs.

IPs from the same network means that IPs vary in last digit(s): for instance xxx.xxx.xxx.1 and xxx.xxx.xxx.2 - Google will definitely ban or penalize you for cross linking of sites hosted on domains parked on close IPs. IPs from various networks must vary in at least third group of digits.

How do you find out whether your domain is parked on dedicated or shared IP and what sites are hosted on this IP along with yours? Try http://whois.sc/IP adress

Picture Optimization

I have noticed that pictures that have proper titles and links are giving about 15% of all target traffic that comes to one of my site. Did anybody tried to do the same? I mean get target traffic by optimizing pictures?

Usually I give a name that corresponds to general site title and specific item of pic. For instance in the site is about cars and there is a Lamborghini Diablo on a picture, I’d call it lamborghini_diablo_cars.jpg, or cars_lamborghini_diablo.jpg - depending on which keyword it must be stressed.

What do you think about optimization of pictures? Do you use pictures optimization on your sites?

//Max

Indexing How-to.

Normally, a small site with proper structure, correct linking, clean code and a couple of good backlinks gets indexed in very short time and you start finding it in SERPs. But what about large sites and doorways?

Some webmasters are building sites with over 100k pages with several levels of linking. If you have troubles with indexing of such sites, then use these tips to get your huge site indexed quicker:

- You need some links to force GoogleBot read your site, so buy, or exchange links.
- Don’t point all links to the index page of your site, but put several of them to the index page and put at least one link per each subcategory. This means that if, for instance, you have four levels of linking - each category of each level must have an inbound link.
- If you haven’t got enough links to feed all your subcategories, put them to second level only and when it will get properly indexed, change links to feed thrid level. However, this way is not recommended.
- In case if your site stays unindexed, refer to some site analyzing software that will spot broken links, inaccessible areas of your site and other mistakes related to linking.
- Check your .htaccess and robots.txt files for consistency with current project.

It is also possible to create site’s categories and subcategories on subdomains of general domain and link there. It’s up to you what strategy to choose, but remember - there’s nothing impossible in SEO, so even very huge site can be forced to be indexed.

//Max

Publishing Borrowed Content

Sometimes you need to put somebody’s content on your site. This may be some unchangeable documents, FAQs, excerpts from articles, pdf documents, etc. However, you risk to get penalized for duplicate content. I have got such problem and this how I guess it can be solved:

I have a couple of downloadable archives with software of side publisher, but I had to put support documentation of this side publisher on download and support pages. Of course it is nonsense to rewrite long documents, which sometimes contain over 100 pages. I wrote short preview passages for each document with a link to the original document. Then I forbid indexing of full documents. That’s how it works on my site over a year and I have no problems with duplicate content penalty.

Sincerely yours, Max.

Link Popularity, Relevancy and Link Text

I want to discuss relationship of Link Popularity, relevancy and text links. As you surely know, link popularity is counted exactly by number of links pointing certain page, but these are just basics. Method of building certain value of Link Popularity that I have already described was working on Google few years ago, but it had changed and that is the point of interest - what changes occur and how can we improve our link popularity strategies?

Certainly, when relevancy became one of the most important criteria of appreciation of each page, it has also affected Link Popularity calculation. Due to my little investigation, a page that has ten inbound links with same keyword in link text will have better link popularity than a page that has the same ten inbound links, but with different keywords inside this link.

However, there is some limit of quantity of same inbound links, and you risk to get your site filtered or banned for spamming if you cross this limit. I tried to guess what can I do in this situation and the only thought I had is to put synonims and associated words after each thenth ibound link to my sites. For instance if I’d had a site about gourmet food, I’d put 10 links with “gourmet”, ten links with “delicious” and ten links with “tasty” keywords. This will dilute keyword density, but prevent my site from loosing link popularity, because Google knows that
these keywords are synonyms.

The other way I think about is that Google considers two phrases to be same when they are identical by symbols. If you add some extra words like adding “gourmet food” to “gourmet” - you will lose relevancy of your general keyword “gourmet”, but popularity is lost because of dilusion by something meanful and I thought what will happen if I add something senseless to my keyword? For instance if I add “+” to “gourmet”, then “gourmet +” will be different to “gourmet”, but at the same time it won’t gain any additional meaning. I consider the same tactic, when choosing domain names. Google likes, when domain is corresponding site title and most frequent keywords on site, but the majority of good domains are already taken. At the same time if you buy a domain that contains extra words, it is loosing relevancy. That is why I oftenly buy domains like “gourmet1″ or “gourmet-1″ that are containing my general keyword and a meaningless symbol that won’t affect relevancy and popularity.

These are techniques I use, but I’m sure you have much interesting to tell too. Waiting for your opinions.

Domain or subdomain? Sandbox question.

Recently I have received an e-mail with the follwing question:

Is the whole domain, or just a page gets sandboxed?

Due to my research in this field whole domain and subdomains are sanboxed too, unless this is a “hilltop” domain like geocities.com, when single subdomain can get sandboxed, while other subdomains, domain, and domain’s children pages experience no affection of this filter.

Please, give your opinions concerning sandbox. I think that it is affecting pages, which gain a lot of backlinks in short period of time, but some of my friends have different opinions concerning this question. Well, what do you think?