You are on page 1of 31

Search

Strategy Presented by

Dr. Angelina P. Resurreccion


Director of Libraries, Rizal Technological University
Lecturer : UE, BU, PUP

July 16, 2008


Jose Rizal University, Shaw Blvd, Mandaluyong City
Sponsored by OCLC
Search Strategy in Using the
Internet
Search engines :
Google
Yahoo
msn
Other websites

Web Harvesting
Purpose
Digitize
information
The internet…
The Internet or the Web is the largest
online source of information available. It is
spread over millions of web pages.
The problems posed by such a wealth of
information are:
how to find the right information in the World
Wide Web quickly and easily, and
how to preserve them.
*World Wide Web, also called WWW and
the Web, is only a part of the Internet.
What is Web harvesting?

Web harvesting is the act of data collection


off Web sites, typically for data analysis.
An implementation of a Web crawler that
uses human expertise or machine
guidance to direct the crawler
to URLs which compose
a specialized collection
or set of knowledge.
Web harvesting

is also known as Web farming,


Web mining and
Web scraping, sometimes
also called as
crawling or spidering.
Purpose of Web Harvesting

To capture
materials in danger
of disappearing.
To capture a
particular event, or
moment in time.
To build a
collection of
similar or related
materials.
Approaches to Web
Harvesting
Subject-based crawl
Event-based crawl
Format-based crawl

**web crawler = (also known as a web spider or


web robot) is a program or automated script
which browses the World Wide Web in a
methodical, automated manner.
Can you use Search Engines ?
Search Engine
is an information retrieval
system designed to help find
information stored on a computer
system.
Results of Subject Search for Rizal
Technological University (Google)
Other Sites with Free
Access

Dictionaries and Thesaurus


Encyclopedias
Almanacs
Citation Manuals
Books
Serial Articles
Specialized fields
Dictionaries and Thesaurus
American Heritage Dictionary
4th Ed. 2000. Includes over 90,000 entries, 70,000
audio word pronunciations, 900 full-page color
illustrations, language notes, and word-root
appendixes. http://www.bartleby.com/61/
Dictionary.com Free online English dictionary,
thesaurus and reference guide, crossword puzzles
and other word games, online translator and Word of
the Day. http://dictionary.reference.com/
Roget’s II: The New Thesaurus, Third Edition. 1995.
Contains 35,000 synonyms in an easy-to-use format,
this thesaurus features succinct word definitions and
an innovative hyperlinked category index by the
Editors of the American Heritage® Dictionary.
http://www.bartleby.com/62/
Encyclopedias
Wikipedia, the free encyclopedia English-language version
of Wikipedia, the online encyclopedia built and edited by users.
Find encyclopedia entries on almost any topic.
Columbia Encyclopedia, 6th ed. 2001.Contains over
50,000 articles, 40,000 bibliographic citations and 80,000 cross-
references. http://www.bartleby.com/65/
The Encyclopedia of World History, Renowned
historian Peter N. Stearns’s comprehensive chronology of more
than 20,000 entries that span the millennia from prehistoric
times to the year 2000. http://www.bartleby.com/67/
Encarta Encyclopedia. An abridged version of the
award-winning Encarta Encyclopedia on CD-ROM, as
well as a dictionary and atlas service.
http://encarta.msn.com/artcenter_0/Encyclopedia_Arti
cles.html#tcsel
Almanacs……..

Infoplease Information Please has been providing


authoritative answers to all kinds of factual questions
since 1938—first as a popular radio quiz show, then
starting in 1947 as an annual almanac, and since 1998
on the Internet. http://www.infoplease.com/
World Factbook The U.S. government's complete
geographical handbook, featuring 267 full-color maps
and flags of all nations. Each country profile tracks such
demographics as population, ethnicity and literacy
rates, as well as political, geographical and economic
data. http://education.yahoo.com/reference/factbook
http://www.nap.edu/about/availpdf.phtml

This site allows you to read more than 3000 books for free.
ILOSearch offers searching through 27,000+ documents
http://www.informedlibrarian.com/
Free Online Access to scientific publications
www.inasp.info
http://www.escholarlypub.com/cwb/oaw.htm
This annotated webliography presents a wide range of electronic
resources related to the open access movement that were freely
available on the Internet as of April 2005.
http://www.planetpdf.com/free_pdf_ebooks.asp?CurrentPage=1
The aim of the Directory of Open Access Journals is to increase the visibility and
ease of use of open access scientific and scholarly journals thereby promoting their
increased usage and impact.
The Directory aims to be comprehensive and cover all open access scientific and
scholarly journals that use a quality control system to guarantee the content.
In short a one stop shop for users to Open Access Journals.

http://www.doaj.org/home
http://aera-cr.asu.edu/links.html
www.lesley.edu/faculty/kholmes/libguides/edjournals.html

http://highwire.stanford.edu/lists/freeart.dtl
http://www.freefulltext.com/
OPEN ACCESS ELECTRONIC JOURNALS
Aardvarknet.info (Asian Resources for Librarians) provides direct links to various free Asian
online databases and fulltext electronic journals. Searchable and browsable. Subjects include
library and information science, medical sciences, social sciences, agriculture, education,
science and technology.
Advances in Theoretical and Mathematical Physics is a bimonthly publication of the
International Press, publishing papers on all areas in which theoretical physics and
mathematics interact with each other.
AGORA (Access to Global Online Research in Agriculture). It is an initiative to provide free
access to 474 journals from major scientific publishers in the fields of food, agriculture,
environmental science and related social sciences.
Bioline International. Bioline International is a not-for-profit electronic publishing service
committed to providing access to quality research journals published in developing countries.
Publishes in the following areas: health (tropical medicine, infectious diseases, epidemiology,
emerging new diseases), biodiversity, the environment, conservation and international
development. Features 30 peer-reviewed journals from Brazil, Cuba, India, Indonesia, Kenya,
South Africa, Uganda, Zimbabwe. Many journals are available free of charge.
BioMed Central. All original research articles published by BioMed Central are made freely
and permanently accessible online immediately upon publication. Approximately 130 titles are
available full text. Subject areas include biology and biomedicine.
BUBL Information Services. A national (UK oriented) information service for the higher
education community. Links to over 10,000 quality Internet resources covering all subject
areas, and sub-divided by type. Search or browse. Excellent for tracking down UK institutions.
CiteSeer is a scientific literature digital library and search engine that focuses primarily on
the literature in computer and information science. CiteSeer indexes the full-text of the
entire articles and citations. Full boolean, phrase and proximity search is supported
D-Lib Magazine is a solely electronic publication with a primary focus on digital library
research and development, including but not limited to new technologies, applications,
and contextual social and economic issues.
Directory of Open Access Journals (DOAJ) is a database of the University of Lund
(Sweden) which provides access to quality controlled scientific Open Access Journals
covering several subject areas. Open access journals use a funding model that does not
charge the readers. Approximately 1,670 titles are available in full text. Currently 414
journals are searchable at article level. As of today 76,402 articles are included in the
DOAJ service.
EdResearch Online is produced by ACER Cunningham Library. 70% of the documents are
linked to freely available full text documents.
Electronic Information for Libraries (eIFL). Led by Open Society Institute/Soros
foundcation network. It provides links to different full-text journals in the Internet.
Electronic Journal Miner. Search for e-journals at this site using keywords, or browse e-
journals by title or by LC subject headings; you can limit searches to (i) free publications,
(2) peer-reviewed publications. Hosted by the Colorado Alliance of Research Libraries.
The database currently contains 6,960 titles.
FindArticles is a vast archive of published articles that you can search for free. Constantly
updated, it contains articles dating back to 1998 from more than 300 magazines and
journals.
FreeFulltext.com provides direct links to over 7000 scholarly periodicals which allow some
or all of their online content to be viewed by ANYONE with Internet access for free
(though some may require free registration). The issue(s) which are available for free are
indicated for each title on the alphabetical periodical lists. Date coverage is from 1997 to
2003. Journals are arranged by title.
Free Medical Journals is dedicated to the promotion of free access to medical
journals over the Internet. It lists a large number of medical journals that currently
provide free full-text access on the WWW.
Google Scholar. It enables you to search specifically for scholarly literature,
including peer-reviewed papers, theses, books, preprints, abstracts and technical
reports from all broad areas of research. Use Google Scholar to find articles from
a wide variety of academic publishers, professional societies, preprint
repositories and universities, as well as scholarly articles available across the
web."
Hartford Institute for Religion Research. It is research based information on the
social scientific study of religion.
HighWire Press, Internet Imprint of the Standford University Libraries. Over 15
million articles from over 4,500 PubMed journals, including 939,111
free full text articles and from 868 HighWire-hosted journals. Covering
several subject areas.
HINARI (Health InterNetwork Access to Research Initiative). It provides free or
very low cost online access to 2300 major journals in biomedical and related
social sciences to local, non-profit and academic institutions in the health sector
in 113 developing countries.
INASP Guide to Open Access Resources. Led by International Network for the
Availability of Scientific Publications (INASP). Over 17,000 full-text journals in all
disciplines from major publishers.
OAIster is a project of the University of Michigan Digital Library Production Service
which aims to create a collection of freely available, previously difficult-to-
access, academically-oriented digital resources that are easily searchable by
anyone. It has 5,740, 091 records from 522 different institutions.
PLoS Biology is a peer-reviewed, open-access journal published by the Public
Library of Science (PLoS), a non-profit organization committed to making
scientific and medical literature a public resource.
PLoS Genetics reflects the full breadth and interdisciplinary nature of genetics and
genomics research by publishing outstanding original contributions in all areas of
biology.
PLoS Pathogens is an open-access, peer-reviewed journal published monthly by the
Public Library of Science (PLoS). Subjects include bacteria, fungi, parasites,
prions and viruses cause a plethora of diseases that have important medical,
agricultural, and economic consequences.
POPLINE (POPulation Information onLINE) provides worldwide coverage of
population, family planning, and related health issues, including family planning
technology and programs, fertility, and population law and policy.
Project Euclid. Cornell University Library is proud to launch Project Euclid, an
initiative to advance effective and affordable scholarly communication in
theoretical and applied mathematics and statistics.
PubChem is a freely accessible database created by NIH in 2004 to provide
information about small organic molecules. It is designed for use as a research
tool and as a starting point that may lead to the development of new medications.
The database connects chemical information with biomedical research and
clinical information, organizing facts in numerous databases into a unified whole.
PubMed Central is a digital archive of life sciences journal literature. PMC currently
provides free and unrestricted access to the full text of 104 life sciences journals,
with more to come.
SCIRUS. A specialist search engine for scientific, technical and medial information
sources. It offers two types of services: Web sources provide information for
which no subscription or online registration is required. Scirus searches the
entire Web and excludes sites with no scientific content. Scirus currently covers
over 200 million science-related Web pages.
Social Science Online Periodicals file on Internet provides access to the full text
articles from over 200 scientific periodicals in the social sciences. It is
permanently being updated and includes: a subject table, a title index and a
section overview.
Social Science Research Network (SSRN) consists of two parts: an Abstract
Database containing abstracts on over 30,900 scholarly working papers and
forthcoming papers and an Electronic Paper Collection currently containing over
15,000 downloadable full text documents.
SourceOECD is the online library of the Organisation for Economic Co-operation
and Development. The University Library is not a subscriber but access to the
search engine and some parts of the website is free.
Treasures of the Internet provides links to primary databases, photo galleries, as
well as stable archives and directories available in the internet. It has search
feature and an index for quick access to various databases.
OPEN ACCESS ELECTRONIC BOOKS
Digital Book Index provides links to more than 105,000 title records from
more than 1800 commercial and non-commercial publishers,
universities, and various private sites. About 66,000 of these books,
texts, and documents are available free, while many others are
available at very modest cost.
Google Print gives access to full text of books.
MERLOT is a free and open resource designed primarily for faculty and
students of higher education. Subjects include arts, business,
education, mathematics, statistics, social science, science and
technology.
Online Books Page is a website that facilitates access to books that are
freely readable over the Internet. Listing over 20,000 free books on the
Web.
Project Gutenberg is the first and largest single collection of free
electronic books, or eBooks.
Questia is the first online library that provides access to the world's largest
online collection of books and journal articles in the humanities and
social sciences, plus magazine and newspaper articles. Includes 2,500
books and 9,000 articles.
Refdesk.com - "The single best source for facts on the Net." Since
1995, Refdesk is a free and family-friendly web site that indexes and
reviews quality, credible, and current web-based resources.

lii.org – Librarian’s Index to the Internet. provide a well-organized


point of access for reliable, trustworthy, librarian-selected Internet
resources

LibrarySpot.com - a free virtual library resource center for


educators and students, librarians and their patrons, families,
businesses and just about anyone exploring the Web for valuable
research information.
Other useful sites
UNESCO: Free & Open Source Software Portal http://
portal.unesco.org/ci/en/ev.php
-URL_ID=12034&URL_DO=DO_TOPIC&URL_SECTION=201.html
Open Source Windows (list of open source software for Windows)
http://www.opensourcewindows.org/
Guest, Ken. (2005, September 05) Alternatives to Windows Software.
http://www.linux.ie/newusers/alternatives.php
Lund University Libraries : Directory of open access journals http://
www.doaj.org/
Public Library of Science: PLoS Biology -a peer-reviewed, open-access
journal. http://biology.plosjournals.org/perlserv/?request=index-html
Thank you !!!

You might also like