How to Make Sure Your Site Gets Properly Crawled and Indexed by Robots

Search engines have robots that come to your siteinstead of your beloved Internet Explorer, Firefox,
and grab everything there is to grab. But becauseOpera or whatever browser you are attached to, you
competition is so fierce, there is no way to get in thego dig on the internet and download a version of the
search engines, unless you pay for ads or hire a SEOvenerable Lynx browser?I'll tell you what would
(Search Engine Optimization) consultant, right?happen, and some will probably accuse me of giving
Wrong!Even if you pay big money, if your site is notaway one of the secrets the SEO corporate
properly seen by the robots used by search enginescommunity does not want you to know:You will be
for indexing, chances are many of your pages willable to see your site very close to the way a robot
never make it.In this article I will discuss the importancesees it. You will be able to look for errors in your
of having your website structured properly, thepages and track down navigation errors that might
importance of using the old fashioned hyperlinksblock a robot from seeing portions of your site.In plain
versus the modern Flash menus, scripts andEnglish, let's say you built a great looking site. There is
extensions and provide you with a very simple andan index page, the first page one sees when entering
free tool that will allow you to see your site in a similaryour site. On that page you have the most incredible
fashion most indexing robots do. But first, let's defineFlash navigation system, with a huge button pointing to
some of the concepts.What is a www robot?A robotyour products and services and the rest of the site. If
is a computer program that automatically reads webLynx goes to your index page and will not see a
pages and goes through every link that it finds.The firststandard link, it will not be able to see the rest of your
robot was developed by MIT and launched in 1993. Itsite. There are extremely high chances that a lot of
was named the World Wide Web Wander and itsindexing robots will not see your site either.You will
initial purpose was of a purely scientific nature, itsthen understand why your very large site, that has one
mission was to measure the growth of the web. Theof the most intricate and functional Flash based
index generated from the experiment's results provednavigation systems on the planet never makes it high
to be an awesome tool and effectively became theinto the search engines, even after all your efforts of
first search engine. Most of the online stuff we can'tmanually submitting it everywhere. It's simply because
live without today was born as a side effect of someyou forgot to add basic hyperlinks. It's because when
scientific experiment.What is a searchyou submit a site - even manually - all that really
engine?Generically, a search engine is a program thathappens is you telling the search engine "hey, Mr.
searches through a database. In the popular sense, asSearch Engine, whenever you think you can find some
referred to the web, a search engine is considered totime, please send your trusty robot to my site".Folks,
be a system that has a user search form, which canrobots can't usually use a navigation menu made in
search through a repository of web pages gatheredFlash, Java script, PHP, etc. and will not be able to get
by a robot.What is a bot? What is a spider? What is ato your pages, it's as simple as that.How do I get
crawler?Bot is just a shorter, cooler (for some) versionLynx?Lynx first started life as a UNIX application,
of the word robot. Spiders and crawlers are robots,written by the University of Kansas as part of their
only the names sound more interesting in the presscampus-wide information system. It then became a
and within metro-geek circles. For reasons ofgopher application (a pre-web search tool), then a web
consistency, I will use the term robot throughout thisbrowser. The official page for Lynx is however, if you
article, when referring to spiders, crawlers and bots.Areare not a Linux geek, used to play with binary
there other... things that crawl out there?Oh yeah, butdistribution files and used to compiling your own apps
these things are way beyond the scope of this article.(don't worry about what I just said), you might want to
Well, for the conspiracy theory aficionados, let's see...find a version that someone else already made usable
we have worms - self-replicating programs, webantsfor your computer. For example, if you are a PC user
(or ants) - distributed cooperating robots, autonomousrunning Windows, you might want to check links to
agents, intelligent agents and many other bots and"Win32 compiled versions". At the time of writing, one
beasties.How do robots work?As with all other thingssuch site is (called a distribution site) where you can
technical, I believe that the only way you will utilize adownload a version that will install onto Windows
technology to its full potential and to your bestmachines in a fashion that will be familiar to non-geeks.
advantage is if and when you understand how thatAfter you install the browser, you might want to read
technology works. When I say how it works, I don'tthe documentation. To get you going and to alleviate
mean intricate technical details, but fundamentalyour beginner frustrations, I'll tell you that you must
processes, big picture stuff.Generally, robots arepress the G key (as in "go"), then type the complete
nothing but stripped down versions of web browsers,URL of the site you want to browse (starting with "
programmed to automatically browse and recordthen hit Enter. Use the arrows to navigate.Bottom line,
information about web pages. There are some veryuse Lynx to verify that every page of site is
specialized robots out there, some that look only foraccessible and let the robots do all the work for you.
blogs, some that index nothing but images. Many (suchYou'll save yourself a lot of aggravation and maybe
as Google's GoogleBot) are based on one of the firstsome money that you would waste on advertising
popular browsers, called Lynx. Lynx was initially a pureyour otherwise non-indexable site.---
text browser, therefore, in today's internet Lynx wouldAndrei co-owns Bsleek - a company that specializes
be extremely robust and fast. Basically, if you canin web design, hosting, promotional items, printing,
program, you can take Lynx, modify it and make atradeshow displays, logos, CD presentations, SEO and
robot.So how do these things actually work? They getmore. Andrei has amassed an extensive technical
a list of websites, and literally start "browsing" them.knowledge and experience through his career as the
They come to your site and then start reading theCIO for a major travel management company and
pages and following every link, while storing differentthrough his past careers in military research, data
information, such as page titles, the actual text of theacquisition and airspace engineering. He also consults
page, etc.Based on the above, what would happen iffor Trinity Investigations, a New York based PI firm.