You are on page 1of 2

There are different techniques to develop a search engine that finds the information rather than actually

to matching millions pages. Search should default to finds with in section not the entire page. Today
popular search engine are word scanners, not usually searching for section of information, the exact
name should be requested.

1. Paradigm shift:
Searching for actual information requires: a new way of thinking about
searching text; and new ways to group text within a page. it is too early stage of
development to attempt hunting information by looking at all text on a page;
advanced techniques are needed to split a page into related sections. Just as
grouping letters into words is a major step above searching for strings of
characters/letters, grouping words into sections or paragraphs is the next major
break-through in search-engine technology

Forget search engines today.

Forget the way popular search-engines work today (2006): they are
primitive compared to techniques developed 25 years ago at NASA: almost no
search-engines today can pinpoint information the easy way, but rather obsess
on matching millions of related pages: it appears to be "search-engine envy":
with "my results are bigger than yours." Judging search-performance should not
be based on "how many millions of pages were matched"

Many search-engines display ads on every page: it is in the interest of


those ads to extend & prolong a search to display many pages of ads, rather
than to pinpoint information.

Search with in section:

Plan the search engine to look with in section of text rather than searching the
entire page of matching words. Often related words, that pinpoint a topic, usually occur within one
sentence of each other. If paragraphs are too difficult to determine, allow a search-bracket of n-words
(such as 30 words) to confine the search to logically related text

Search literal words:

When a lot of information gather it is critical to differentiate between a/an/the.


Assuming to ignore some words just empowers a bias that insults the intelligence of
potential users. By chance have a rare option to ignore a list of words that includes
"a/an/the/of/in" but, by default, search for every word specified. Let users learn to
except restrictive words; implicit skip of some words is as limiting as implicit declaration
of misspelled variable-names in computer software: don't do it (If someone misspells
"off" as "of" then what happens? See? Understand the danger of implicitly ignoring
words.)
Search literal characters.

If able technically, expect to search for literal strings such as "Project XRAY-10/NOVA" where the
dash ('-') and slash ('/') are critical to the search: in practice, the searched text can have those characters
converted to spaces when they are not in the search-phrase requested by a user; however, if pre-storing
the searched words, then both forms can be indexed/stored (both "XRAY 10 NOVA" and literal name
"XRAY-10/NOVA" can be indexed).

You might also like