Professional Documents
Culture Documents
By
Satyabrata Garanayak
Introduction
There are millions of websites on the World Wide
Web (WWW). How we can search a particular
web page out of billion of pages ???
Answer is !!!
Search Engine
Users
Web Browser
Search Engine
Language Translation
Update Information
Quick search
Relevant result
Offers excellent spell checking, easy access to
dictionary definitions, integration of stock quotes,
street maps and more.
Option to find more web pages, i.e. similar pages
Each search engine has its own catalog or database of
collected web pages, so you will get different
results/hits by using different search engines.
Continue
Three major components of crawler based search engine are1.The Crawler (Spider)
The crawler/spider visits a web page, reads it, and then follows links to
other pages within the site. The spider will return to the site on a regular
basis, such as every month or every fifteen days, to look for changes.
2.The Index
Everything the spider finds goes into the second part of the search
engine, the index. The index will contain a copy of every web page that
the spider finds. If a web page changes, then the index is updated with
new information.
3.The Search Engine Software
This is the software program that accepts the user-entered query,
interprets it, and shifts through the millions of pages recorded in the
index to find matches and ranks them in order of what it believes is most
relevant and presents them in a customizable manner to the user.
Continue
World Wide
Web
Crawler
Downloader
Indexer
Database
Search
Interface
(Web
Browser)
Query Engine
(Search Engine)
Continue
Crawler-based search engines are constantly
searching the Internet for new web pages and
updating their database of information with
these new or altered pages.
Examples of crawler-based search engines are:
DMOZ Directory
Yahoo! Directory