You are on page 1of 87

A PPROJECT REPORT ON

***************Title of the Project***************

Submitted to ******************** University for the partial


fulfillment of the Requirement for the

Award of Degree for

***********Course Name*************

Done By

Mr. /Miss *********************************************

**************Institute of Management and Computer Science


CERTIFICATE

This is to certify that Mr., /Miss ****************** bearing Roll

No. ********************** have developed software project

Titled **************** For ********************* Software

Solutions as Partial fulfillment for the award of the Degree of

**************

Head of the Department Principal

******College Name******

External
ACKNOWLEDGEMENT

At every outset I express my gratitude to almighty lord for showering his grace and blessings
upon me to complete this project.

Although our name appears on the cover of this book, many people had contributed in some form
or the other form to this project Development. We could not done this project without the
assistance or support of each of the following we thank you all.

I wish to place on my record my deep sense of gratitude to my project guide, Mr. ******, *****
Software Solutions, for his constant motivation and valuable help through the project work.
Express my gratitude to Mr. ******, Director of ******* Institute of Management & Computer
Sciences for his valuable suggestions and advices throughout the ***** course. I also extend my
thanks to other Faculties for their Cooperation during my Course.

Finally I would like to thank my friends for their cooperation to complete this project.
ABSTRACT
Conventional spatial queries, such as range search and nearest neighbour retrieval, involve only
conditions on objects’ geometric properties. Today, many modern applications call for novel
forms of queries that aim to find objects satisfying both a spatial predicate, and a predicate on
their associated texts. For example, instead of considering all the restaurants, a nearest neighbour
query would instead ask for the restaurant that is the closest among those whose menus contain
“steak, spaghetti, brandy” all at the same time. Currently, the best solution to such queries is
based on the IR2 -tree, which, as shown in this paper, has a few deficiencies that seriously
impact its efficiency. Motivated by this, we develop a new access method called the spatial
inverted index that extends the conventional inverted index to cope with multidimensional data,
and comes with algorithms that can answer nearest neighbour queries with keywords in real
time. As verified by experiments, the proposed techniques outperform the IR2 -tree in query
response time significantly, often by a factor of orders of magnitude.
INDEX
CHAPTERS Page No:

1 INTRODUCTION

1.1 PROBLEM STATEMENT


1.2 EXISTING SYSTEM
1.3 MOTIVATION
1.4 PROPOSED SYSTEM
1.5 SCOPE
1.6 OBJECTIVE

2 LITERATURE SURVEY

2.1 INTRODUCTION
2.2 DEFINITIONS
2.3 CHARACTERISTICS
2.4 PROBLEMS ASSOCIATED
2.5 TECHNIQUES AND ALGORITHMS APPLICABLE
2.6 OTHER SUGGESTED SOLUTIONS
2.7 APPLICATIONS

3 SOFTWARE REQUIREMENTS SPECIFICATION


3.1 OVERALL DESCRIPTION
3.2 OPERATING ENVIRONMENT
3.3 FUNCTIONAL REQUIREMENTS
3.4 NON-FUNCTIONAL REQUIREMENTS
3.5 COMMUNICATION INTERFACES

4. SYSTEM DESIGN

4.1 USECASE DIAGRAM soon…….


All diagrams with brief explanation.

5. TESTING

5.1 TEST CASES

5.2 TEST RESULTS

6. OUTPUT SCREENS( NOT MORE THAN 10 PAGES( AT MOST TWO SCREENS PER
PAGE))

7. CONCLUSION AND FUTURE SCOPE

APPENDIX

REFERENCES
LIST OF TABLES

Operational Feasibility Table

Data Base Tables

User Login Table

CurLsr Table

CurNu Table

+ve Test Cases Table

-ve Test Cases Table

LIST OF FIGURES

Fig 3.1 Structure of Data Mining

Fig 3.2 DFD Diagrams

Fig 3.3 JVM

Fig 3.4Java Complier

Fig 3.5 Java IDE

Fig 3.6 TCP/IP OSI

Fig 3.7 Web Archive

Fig 4.1 DFD Diagrams

Fig 4.1.1(a) Context Level DFD


Fig 4.1.2 (b) Login DFD Diagram

Fig 4.1.3 (c) User Data Flow

Fig 4.1.4 (d) 1st level DFD

Fig 4.1.5 (e) 2nd Level DFD

Fig 4.2.1 (a) Use Admin Activities

Fig 4.2.2 (b) Admin Data Flow

Fig 4.2.3 (c) 1st level DFD

Fig 4.2.4 (d) 2nd Level DFD

Fig 4.2.5 E-R Diagram

Fig 4.3.1(a) Class Diagram

Fig 4.3.2(b) Use Case Diagram

Fig 4.3.3 (C) Sequence Diagrams

Fig 4.3.4 (b) User Login Sequence

Fig 4.3.5 (c) UserSequence to View Location

Fig 4.3.6 CollaborationDiagrams

Fig 4.3.7Activity Diagram

Fig 4.3.8 State chart Diagram

Fig 4.3.9 Component Diagram

Fig4.3.10 Deployment Diagram

Fig 5.1 Test Methods


INTRODUCTION
What is Data Mining?

Fig 3.1 Structure of Data Mining

Generally, data mining (sometimes called data or knowledge discovery) is the process of
analyzing data from different perspectives and summarizing it into useful information -
information that can be used to increase revenue, cuts costs, or both. Data mining software is one
of a number of analytical tools for analyzing data. It allows users to analyze data from many
different dimensions or angles, categorize it, and summarize the relationships identified.
Technically, data mining is the process of finding correlations or patterns among dozens of fields
in large relational databases.

How Data Mining Works?

While large-scale information technology has been evolving separate transaction and analytical
systems, data mining provides the link between the two. Data mining software analyzes
relationships and patterns in stored transaction data based on open-ended user queries. Several
types of analytical software are available: statistical, machine learning, and neural
networks.Generally, any of four types of relationships are sought:

 Classes: Stored data is used to locate data in predetermined groups. For example, a
restaurant chain could mine customer purchase data to determine when customers visit
and what they typically order. This information could be used to increase traffic by
having daily specials.

 Clusters: Data items are grouped according to logical relationships or consumer


preferences. For example, data can be mined to identify market segments or consumer
affinities.

 Associations: Data can be mined to identify associations. The beer-diaper example is an


example of associative mining.

 Sequential patterns: Data is mined to anticipate behavior patterns and trends. For
example, an outdoor equipment retailer could predict the likelihood of a backpack being
purchased based on a consumer's purchase of sleeping bags and hiking shoes.

Data mining consists of five major elements:

1) Extract, transform, and load transaction data onto the data warehouse system.
2) Store and manage the data in a multidimensional database system.
3) Provide data access to business analysts and information technology professionals.
4) Analyze the data by application software.
5) Present the data in a useful format, such as a graph or table.

Different levels of analysis are available:

 Artificial neural networks: Non-linear predictive models that learn through training and
resemble biological neural networks in structure.
 Genetic algorithms: Optimization techniques that use process such as genetic
combination, mutation, and natural selection in a design based on the concepts of natural
evolution.

 Decision trees: Tree-shaped structures that represent sets of decisions. These decisions
generate rules for the classification of a dataset. Specific decision tree methods include
Classification and Regression Trees (CART) and Chi Square Automatic Interaction
Detection (CHAID). CART and CHAID are decision tree techniques used for
classification of a dataset. They provide a set of rules that you can apply to a new
(unclassified) dataset to predict which records will have a given outcome. CART
segments a dataset by creating 2-way splits while CHAID segments using chi square

Tests to create multi-way splits. CART typically requires less data preparation than
CHAID.

 Nearest neighbor method: A technique that classifies each record in a dataset based on a
combination of the classes of the k record(s) most similar to it in a historical dataset
(where k=1). Sometimes called the k-nearest neighbor technique.

 Rule induction: The extraction of useful if-then rules from data based on statistical
significance.

 Data visualization: The visual interpretation of complex relationships in


multidimensional data. Graphics tools are used to illustrate data relationships.

Characteristics of Data Mining:

 Large quantities of data: The volume of data so great it has to be analyzed by


automated techniques e.g. satellite information, credit card transactions etc.
 Noisy, incomplete data: Imprecise data is the characteristic of all data collection.
 Complex data structure: conventional statistical analysis not possible
 Heterogeneous data stored in legacy systems

Benefits of Data Mining:

1) It’s one of the most effective services that are available today. With the help of data
mining, one can discover precious information about the customers and their behavior for
a specific set of products and evaluate and analyze, store, mine and load data related to
them
2) An analytical CRM model and strategic business related decisions can be made with the
help of data mining as it helps in providing a complete synopsis of customers
3) An endless number of organizations have installed data mining projects and it has helped
them see their own companies make an unprecedented improvement in their marketing
strategies (Campaigns)
4) Data mining is generally used by organizations with a solid customer focus. For its
flexible nature as far as applicability is concerned is being used vehemently in
applications to foresee crucial data including industry analysis and consumer buying
behaviors
5) Fast paced and prompt access to data along with economic processing techniques have
made data mining one of the most suitable services that a company seek

Advantages of Data Mining:

1. Marketing / Retail:

Data mining helps marketing companies build models based on historical data to predict who
will respond to the new marketing campaigns such as direct mail, online marketing
campaign…etc. Through the results, marketers will have appropriate approach to sell profitable
products to targeted customers.
Data mining brings a lot of benefits to retail companies in the same way as marketing.
Through market basket analysis, a store can have an appropriate production arrangement in a
way that customers can buy frequent buying products together with pleasant. In addition, it also
helps the retail companies offer certain discounts for particular products that will attract more
customers.

2. Finance / Banking

Data mining gives financial institutions information about loan information and credit
reporting. By building a model from historical customer’s data, the bank and financial institution
can determine good and bad loans. In addition, data mining helps banks detect fraudulent credit
card transactions to protect credit card’s owner.

3. Manufacturing

By applying data mining in operational engineering data, manufacturers can detect faulty
equipments and determine optimal control parameters. For example semi-conductor
manufacturers has a challenge that even the conditions of manufacturing environments at
different wafer production plants are similar, the quality of wafer are lot the same and some
for unknown reasons even has defects. Data mining has been applying to determine the
ranges of

control parameters that lead to the production of golden wafer. Then those optimal control
parameters are used to manufacture wafers with desired quality.

4. Governments

Data mining helps government agency by digging and analyzing records of financial
transaction to build patterns that can detect money laundering or criminal activities.

5. Law enforcement:
Data mining can aid law enforcers in identifying criminal suspects as well as apprehending
these criminals by examining trends in location, crime type, habit, and other patterns of
behaviors.

6. Researchers:

Data mining can assist researchers by speeding up their data analyzing process; thus,
allowing those more time to work on other projects.

1.1 PROBLEM STATEMENT:


 Fail to provide real time answers on difficult inputs.

 The real nearest neighbour lies quite far away from the query point, while all the closer
neighbours are missing at least one of the query keywords.

1.2 EXISTING SYSTEM

 Spatial queries with keywords have not been extensively explored. In the past years, the
community has sparked enthusiasm in studying keyword search in relational databases.
 It is until recently that attention was diverted to multidimensional data. The best method
to date for nearest neighbour search with keywords is due to Felipe et al.. They nicely
integrate two well-known concepts: R-tree, a popular spatial index, and signature file, an
effective method for keyword-based document retrieval. By doing so they develop a
structure called the IR2 -tree, which has the strengths of both R-trees and signature files.
 Like R-trees, the IR2 - tree preserves objects’ spatial proximity, which is the key to
solving spatial queries efficiently. On the other hand, like signature files, the IR2 -tree is
able to filter a considerable portion of the objects that do not contain all the query
keywords, thus significantly reducing the number of objects to be examined.

1.3 MOTIVATION
Integration tests are designed to test integrated software components to determine if they
actually run as one program. Testing is event driven and is more concerned with the
basic outcome of screens or fields. Integration tests demonstrate that although the
components were individually satisfaction, as shown by successfully unit testing, the
combination of components is correct and consistent. Integration testing is specifically
aimed at exposing the problems that arise from the combination of components.

DISADVANTAGES OF EXISTING SYSTEM:

 Fail to provide real time answers on difficult inputs.

 The real nearest neighbour lies quite far away from the query point, while all the closer
neighbours are missing at least one of the query keywords.

1.4 PROPOSED SYSTEM

 In this paper, we design a variant of inverted index that is optimized for multidimensional
points, and is thus named the spatial inverted index (SI-index). This access method
successfully incorporates point coordinates into a conventional inverted index with small
extra space, owing to a delicate compact storage scheme.
 Meanwhile, an SI-index preserves the spatial locality of data points, and comes with an
R-tree built on every inverted list at little space overhead. As a result, it offers two
competing ways for query processing.
 We can (sequentially) merge multiple lists very much like merging traditional inverted
lists by ids. Alternatively, we can also leverage the R-trees to browse the points of all
relevant lists in ascending order of their distances to the query point. As demonstrated by
experiments, the SI-index significantly outperforms the IR2 -tree in query efficiency,
often by a factor of orders of magnitude.

1.5 SCOPE
Integration tests are designed to test integrated software components to determine if they
actually run as one program. Testing is event driven and is more concerned with the
basic outcome of screens or fields. Integration tests demonstrate that although the
components were individually satisfaction, as shown by successfully unit testing, the
combination of components is correct and consistent. Integration testing is specifically
aimed at exposing the problems that arise from the combination of components.

ADVANTAGES OF PROPOSED SYSTEM:

 Distance browsing is easy with R-trees. In fact, the best-first algorithm is exactly
designed to output data points in ascending order of their distances

 It is straight forward to extend our compression scheme to any dimensional space

MODULES:
 System Model
 Map View
 Distance Search
 Neighbor Search

MODULES DESCRIPTION

System Model
 In this module a User have to register first, then only he/she has to access the data base.
 In this module, any of the above mentioned person have to login, they should login by
giving their email id and password.
 In this module Admin registers the location along with its famous place. Also he
measures the distance of the corresponding place from the corresponding source place by
using spatial distance of Google map
 It means that the user can give the key in which place that the city/location is famous for
.This results in the list of menu items displayed.

Map View:

 The User can see the view of their locality by Google Map (such as map view, satellite
view).
 As our goal is to combine keyword search with the existing location-finding services on
facilities such as hospitals, restaurants, hotels, etc., we will focus on dimensionality 2, but
our technique can be extended to arbitrary dimensionalities with no technical obstacle.
 Note that the list of each word maintains a sorted order of point ids, which provides
considerable convenience in query processing by allowing an efficient merge step. For
example, assume that we want to find the points that have words c and d. This is
essentially to compute the intersection of the two words’ inverted lists.

Distance Search:

 The User can measure the distance and calculate time that takes them to reach the
destination by giving speed. Chart will be prepared by using these values. These are done
by the use of Google Maps.
 Traditional nearest neighbor search returns the data point closest to a query point.
 We consider that the data set does not fit in memory, and needs to be indexed by efficient
access methods in order to minimize the number of I/Os in answering a query
Neighbor Search

 In this module we implement our neighbor Search. The other problem with this
search algorithm is that the indexing information has to be replicated in the broadcast
cycle to enable twice scanning.
 The first scan is for deciding the search range, and the second scan is for retrieving k
objects based on the search range.
 Therefore, we propose the Nearest Neighbor query approach to improve the
preceding on-air query algorithm.

 The system attempts to verify the validity of k objects by processing results obtained
from several peers.
 In this module we implement our neighbor Search. The other problem with this
search algorithm is that the indexing information has to be replicated in the broadcast
cycle to enable twice scanning.

1.4 Objective:
Conventional spatial queries, such as range search and nearest neighbour retrieval,Involve only
conditions on objects’ geometric properties. Today, many modern Applications call for novel
forms of queries that aim to find objects satisfying both a spatial predicate, and a predicate on
their associated texts. For example, instead of considering all the restaurants, a nearest
neighbour query would instead ask for the restaurant that is the closest among those whose
menus contain “steak, spaghetti, brandy” all at the same time. Currently, the best solution to
such queries is based on the IR2 -tree, which, as shown in this paper, has a few deficiencies
that Seriously impact its efficiency. Motivated by this, we develop a new access method Called
the spatial inverted index that extends the conventional inverted index to Cope with
multidimensional data, and comes with algorithms that can answer Nearest neighbour queries
with keywords in real time.
LITERATURE SURVEY

1. “Dbxplorer: A System forKeyword-Based Search over Relational


Databases,”

AUTHORS: S. Agrawal, S. Chaudhuri, and G. Das


Internet search engines have popularized the keyword-based search paradigm. While traditional
database management systems offer powerful query languages, they do not allow keyword-based
search. In this paper, we discuss DBXplorer, a system that enables keyword-based searches in
relational databases. DBXplorer has been implemented using a commercial relational database
and Web server and allows users to interact via a browser front-end. We outline the challenges
and discuss the implementation of our system, including results of extensive experimental
evaluation

2.“Keyword Searching and Browsing in Databases Using Banks,”


AUTHORS:G. Bhalotia, A. Hulgeri, C. Nakhe, S. Chakrabarti
With the growth of the Web, there has been a rapid increase in the number of users who need to
access online databases without having a detailed knowledge of the schema or of query
languages; even relatively simple query languages designed for non-experts are too complicated
for them. We describe BANKS, a system which enables keyword-based search on relational
databases, together with data and schema browsing. BANKS enables users to extract information
in a simple manner without any knowledge of the schema or any need for writing complex
queries. A user can get information by typing a few keywords, following hyperlinks, and
interacting with controls on the displayed results. BANKS models tuples as nodes in a graph,
connected by links induced by foreign key and other relationships. Answers to a query are
modeled as rooted trees connecting tuples that match individual keywords in the query. Answers
are ranked using a notion of proximity coupled with a notion of prestige of nodes based on
inlinks, similar to techniques developed for Web search. We present an efficient heuristic
algorithm for finding and ranking query results

3.“Spatial Keyword Querying,”


AUTHORS:X. Cao, L. Chen, G. Cong, C.S. Jensen, Q. Qu, A. Skovsgaard, D.Wu, and M.L.
Yiu,
The web is increasingly being used by mobile users. In addition, it is increasingly becoming
possible to accurately geo-position mobile users and web content. This development gives
prominence to spatial web data management. Specifically, a spatial keyword query takes a user
location and user-supplied keywords as arguments and returns web objects that are spatially and
textually relevant to these arguments. This paper reviews recent results by the authors that aim to
achieve spatial keyword querying functionality that is easy to use, relevant to users, and can be
supported efficiently. The paper covers different kinds of functionality as well as the ideas
underlying their definition.

4.“Retrieving Top-k Prestige- Based Relevant Spatial Web Objects,”


AUTHORS:X. Cao, G. Cong, and C.S. Jensen
The location-aware keyword query returns ranked objects that are near a query location and that
have textual descriptions that match query keywords. This query occurs inherently in many types
of mobile and traditional web services and applications, e.g., Yellow Pages and Maps services.
Previous work considers the potential results of such a query as being independent when ranking
them. However, a relevant result object with nearby objects that are also relevant to the query is
likely to be preferable over a relevant object without relevant nearby objects.
The paper proposes the concept of prestige-based relevance to capture both the textual relevance
of an object to a query and the effects of nearby objects. Based on this, a new type of query, the
Location-aware top-k Prestige-based Text retrieval (LkPT) query, is proposed that retrieves the
top-k spatial web objects ranked according to both prestige-based relevance and location
proximity.

We propose two algorithms that compute LkPT queries. Empirical studies with real-world spatial
data demonstrate that LkPT queries are more effective in retrieving web objects than a previous
approach that does not consider the effects of nearby objects; and they show that the proposed
algorithms are scalable and outperform a baseline approach significantly.

5.“Efficient Query Processing in Geographic Web Search Engines,”


AUTHORS:Y.-Y. Chen, T. Sue, and A. Markowitz
Geographic web search engines allow users to constrain and order search results in an intuitive
manner by focusing a query on a particular geographic region. Geographic search technology,
also called local search, has recently received significant interest from major search engine
companies. Academic research in this area has focused primarily on techniques for extracting
geographic knowledge from the web. In this paper, we study the problem of efficient query
processing in scalable geographic search engines. Query processing is a major bottleneck in
standard web search engines, and the main reason for the thousands of machines used by the
major engines. Geographic search engine query processing is different in that it requires a
combination of text and spatial data processing techniques. We propose several algorithms for
efficient query processing in geographic search engines, integrate them into an existing web
search query processor, and evaluate them on large sets of real data and query traces.
3.SOFTWARE REQUIREMENTS SPECIFICATION

3.1 OVERALL DESCRIPTION


Java Technology

Java technology is both a programming language and a platform.

The Java Programming Language


The Java programming language is a high-level language that can be characterized by all
of the following buzzwords:

 Simple
 Architecture neutral
 Object oriented
 Portable
 Distributed
 High performance
 Interpreted
 Multithreaded
 Robust
 Dynamic
 Secure
With most programming languages, you either compile or interpret a program so that you
can run it on your computer. The Java programming language is unusual in that a program is
both compiled and interpreted. With the compiler, first you translate a program into an
intermediate language called Java byte codes —the platform-independent codes interpreted by
the interpreter on the Java platform. The interpreter parses and runs each Java byte code
instruction on the computer. Compilation happens just once; interpretation occurs each time the
program is executed. The following figure illustrates how this works.

Fig: 3.3 Java Virtual Machine

You can think of Java byte codes as the machine code instructions for the Java Virtual
Machine (Java VM). Every Java interpreter, whether it’s a development tool or a Web browser
that can run applets, is an implementation of the Java VM. Java byte codes help make “write
once, run anywhere” possible. You can compile your program into byte codes on any platform
that has a Java compiler. The byte codes can then be run on any implementation of the Java VM.
That means that as long as a computer has a Java VM, the same program written in the Java
programming language can run on Windows 2000, a Solaris workstation, or on an iMac.
Fig 3.4 Java Complier

The Java Platform


A platform is the hardware or software environment in which a program runs.
We’ve already mentioned some of the most popular platforms like Windows 2000,
Linux, Solaris, and MacOS. Most platforms can be described as a combination of the
operating system and hardware. The Java platform differs from most other platforms in
that it’s a software-only platform that runs on top of other hardware-based platforms.
The Java platform has two components:
 The Java Virtual Machine (Java VM)
 The Java Application Programming Interface (Java API)
You’ve already been introduced to the Java VM. It’s the base for the Java platform
and is ported onto various hardware-based platforms.
The Java API is a large collection of ready-made software components that provide
many useful capabilities, such as graphical user interface (GUI) widgets. The Java API is
grouped into libraries of related classes and interfaces; these libraries are known as
packages. The next section, What Can Java Technology Do? Highlights what
functionality some of the packages in the Java API provide.
The following figure depicts a program that’s running on the Java platform. As the
figure shows, the Java API and the virtual machine insulate the program from the
hardware.
Native code is code that after you compile it, the compiled code runs on a specific
hardware platform. As a platform-independent environment, the Java platform can be a
bit slower than native code. However, smart compilers, well-tuned interpreters, and just-
in-time byte code compilers can bring performance close to that of native code without
threatening portability.
What Can Java Technology Do?
The most common types of programs written in the Java programming language are
applets and applications. If you’ve surfed the Web, you’re probably already familiar with
applets. An applet is a program that adheres to certain conventions that allow it to run
within a Java-enabled browser.
However, the Java programming language is not just for writing cute, entertaining applets
for the Web. The general-purpose, high-level Java programming language is also a
powerful software platform. Using the generous API, you can write many types of
programs.
An application is a standalone program that runs directly on the Java platform. A special
kind of application known as a server serves and supports clients on a network. Examples
of servers are Web servers, proxy servers, mail servers, and print servers. Another
specialized program is a servlet. A servlet can almost be thought of as an applet that runs
on the server side. Java Servlets are a popular choice for building interactive web
applications, replacing the use of CGI scripts. Servlets are similar to applets in that they
are runtime extensions of applications. Instead of working in browsers, though, servlets
run within Java Web servers, configuring or tailoring the server.
How does the API support all these kinds of programs? It does so with packages of
software components that provides a wide range of functionality. Every full
implementation of the Java platform gives you the following features:
 The essentials: Objects, strings, threads, numbers, input and output, data
structures, system properties, date and time, and so on.
 Applets: The set of conventions used by applets.
 Networking: URLs, TCP (Transmission Control Protocol), UDP (User Data gram
Protocol) sockets, and IP (Internet Protocol) addresses.
 Internationalization: Help for writing programs that can be localized for users
worldwide. Programs can automatically adapt to specific locales and be displayed
in the appropriate language.
 Security: Both low level and high level, including electronic signatures, public
and private key management, access control, and certificates.
 Software components: Known as JavaBeansTM, can plug into existing
component architectures.
 Object serialization: Allows lightweight persistence and communication via
Remote Method Invocation (RMI).
 Java Database Connectivity (JDBCTM): Provides uniform access to a wide
range of relational databases.
The Java platform also has APIs for 2D and 3D graphics, accessibility, servers,
collaboration, telephony, speech, animation, and more. The following figure depicts what
is included in the Java 2 SDK.

Fig 3.5 Java IDE


How Will Java Technology Change My Life?
We can’t promise you fame, fortune, or even a job if you learn the Java programming
language. Still, it is likely to make your programs better and requires less effort than
other languages. We believe that Java technology will help you do the following:
 Get started quickly: Although the Java programming language is a powerful
object-oriented language, it’s easy to learn, especially for programmers already
familiar with C or C++.
 Write less code: Comparisons of program metrics (class counts, method counts,
and so on) suggest that a program written in the Java programming language can
be four times smaller than the same program in C++.
 Write better code: The Java programming language encourages good coding
practices, and its garbage collection helps you avoid memory leaks. Its object
orientation, its JavaBeans component architecture, and its wide-ranging, easily
extendible API let you reuse other people’s tested code and introduce fewer bugs.
 Develop programs more quickly: Your development time may be as much as
twice as fast versus writing the same program in C++. Why? You write fewer
lines of code and it is a simpler programming language than C++.
 Avoid platform dependencies with 100% Pure Java: You can keep your
program portable by avoiding the use of libraries written in other languages. The
100% Pure JavaTMProduct Certification Program has a repository of historical
process manuals, white papers, brochures, and similar materials online.
 Write once, run anywhere: Because 100% Pure Java programs are compiled into
machine-independent byte codes, they run consistently on any Java platform.
 Distribute software more easily: You can upgrade applets easily from a central
server. Applets take advantage of the feature of allowing new classes to be loaded
“on the fly,” without recompiling the entire program.
ODBC
Microsoft Open Database Connectivity (ODBC) is a standard programming interface for
application developers and database systems providers. Before ODBC became a de facto
standard for Windows programs to interface with database systems, programmers had to use
proprietary languages for each database they wanted to connect to. Now, ODBC has made the
choice of the database system almost irrelevant from a coding perspective, which is as it should
be. Application developers have much more important things to worry about than the syntax that
is needed to port their program from one database to another when business needs suddenly
change.
Through the ODBC Administrator in Control Panel, you can specify the particular
database that is associated with a data source that an ODBC application program is written to
use. Think of an ODBC data source as a door with a name on it. Each door will lead you to a
particular database. For example, the data source named Sales Figures might be a SQL Server
database, whereas the Accounts Payable data source could refer to an Access database. The
physical database referred to by a data source can reside anywhere on the LAN.
The ODBC system files are not installed on your system by Windows 95. Rather, they
are installed when you setup a separate database application, such as SQL Server Client or
Visual Basic 4.0. When the ODBC icon is installed in Control Panel, it uses a file called
ODBCINST.DLL. It is also possible to administer your ODBC data sources through a stand-
alone program called ODBCADM.EXE. There is a 16-bit and a 32-bit version of this program
and each maintains a separate list of ODBC data sources.

From a programming perspective, the beauty of ODBC is that the application can be
written to use the same set of function calls to interface with any data source, regardless of the
database vendor. The source code of the application doesn’t change whether it talks to Oracle or
SQL Server. We only mention these two as an example. There are ODBC drivers available for
several dozen popular database systems. Even Excel spreadsheets and plain text files can be
turned into data sources. The operating system uses the Registry information written by ODBC
Administrator to determine which low-level ODBC drivers are needed to talk to the data source
(such as the interface to Oracle or SQL Server). The loading of the ODBC drivers is transparent
to the ODBC application program. In a client/server environment, the ODBC API even handles
many of the network issues for the application programmer.
The advantages of this scheme are so numerous that you are probably thinking there must
be some catch. The only disadvantage of ODBC is that it isn’t as efficient as talking directly to
the native database interface. ODBC has had many detractors make the charge that it is too slow.
Microsoft has always claimed that the critical factor in performance is the quality of the driver
software that is used. In our humble opinion, this is true. The availability of good ODBC drivers
has improved a great deal recently. And anyway, the criticism about performance is somewhat
analogous to those who said that compilers would never match the speed of pure assembly
language. Maybe not, but the compiler (or ODBC) gives you the opportunity to write cleaner
programs, which means you finish sooner. Meanwhile, computers get faster every year.

JDBC
In an effort to set an independent database standard API for Java; Sun Microsystems
developed Java Database Connectivity, or JDBC. JDBC offers a generic SQL database access
mechanism that provides a consistent interface to a variety of RDBMSs. This consistent interface
is achieved through the use of “plug-in” database connectivity modules, or drivers. If a database
vendor wishes to have JDBC support, he or she must provide the driver for each platform that the
database and Java run on.
To gain a wider acceptance of JDBC, Sun based JDBC’s framework on ODBC. As you
discovered earlier in this chapter, ODBC has widespread support on a variety of platforms.
Basing JDBC on ODBC will allow vendors to bring JDBC drivers to market much faster than
developing a completely new connectivity solution.
JDBC was announced in March of 1996. It was released for a 90 day public review that
ended June 8, 1996. Because of user input, the final JDBC v1.0 specification was released soon
after.
The remainder of this section will cover enough information about JDBC for you to know what it
is about and how to use it effectively. This is by no means a complete overview of JDBC. That
would fill an entire book.

JDBC Goals
Few software packages are designed without goals in mind. JDBC is one that, because of
its many goals, drove the development of the API. These goals, in conjunction with early
reviewer feedback, have finalized the JDBC class library into a solid framework for building
database applications in Java.
The goals that were set for JDBC are important. They will give you some insight as to why
certain classes and functionalities behave the way they do. The eight design goals for JDBC are
as follows:

1. SQL Level API


The designers felt that their main goal was to define a SQL interface for Java. Although not
the lowest database interface level possible, it is at a low enough level for higher-level tools
and APIs to be created. Conversely, it is at a high enough level for application programmers
to use it confidently. Attaining this goal allows for future tool vendors to “generate” JDBC
code and to hide many of JDBC’s complexities from the end user.
2. SQL Conformance
SQL syntax varies as you move from database vendor to database vendor. In an effort to
support a wide variety of vendors, JDBC will allow any query statement to be passed through
it to the underlying database driver. This allows the connectivity module to handle non-
standard functionality in a manner that is suitable for its users.
3. JDBC must be implemental on top of common database interfaces
The JDBC SQL API must “sit” on top of other common SQL level APIs. This goal
allows JDBC to use existing ODBC level drivers by the use of a software interface. This
interface would translate JDBC calls to ODBC and vice versa.
4. Provide a Java interface that is consistent with the rest of the Java system
Because of Java’s acceptance in the user community thus far, the designers feel that they
should not stray from the current design of the core Java system.
5. Keep it simple
This goal probably appears in all software design goal listings. JDBC is no exception.
Sun felt that the design of JDBC should be very simple, allowing for only one method of
completing a task per mechanism. Allowing duplicate functionality only serves to confuse
the users of the API.
6. Use strong, static typing wherever possible
Strong typing allows for more error checking to be done at compile time; also, less error
appear at runtime.
7. Keep the common cases simple
Because more often than not, the usual SQL calls used by the programmer are simple
SELECT’s, INSERT’s, DELETE’s and UPDATE’s, these queries should be simple to
perform with JDBC. However, more complex SQL statements should also be possible.
Java ha two things: a programming language and a platform. Java is a high-level
programming language that is all of the following

Simple Architecture-neutral
Object-oriented Portable
Distributed High-performance
Interpreted multithreaded
Robust Dynamic
Secure

Java is also unusual in that each Java program is both compiled and interpreted.
With a compile you translate a Java program into an intermediate language called Java
byte codes the platform-independent code instruction is passed and run on the
computer.

Compilation happens just once; interpretation occurs each time the program is
executed. The figure illustrates how this works.

JavaProgram Interpreter

Compilers My Program
You can think of Java byte codes as the machine code instructions for the Java
Virtual Machine (Java VM). Every Java interpreter, whether it’s a Java development
tool or a Web browser that can run Java applets, is an implementation of the Java VM.
The Java VM can also be implemented in hardware.
Java byte codes help make “write once, run anywhere” possible. You can compile
your Java program into byte codes on my platform that has a Java compiler. The byte
codes can then be run any implementation of the Java VM. For example, the same
Java program can run Windows NT, Solaris, and Macintosh.

Networking

TCP/IP stack
The TCP/IP stack is shorter than the OSI one:

Fig 3.6 TCP/IP OSI


TCP is a connection-oriented protocol; UDP (User Datagram Protocol) is a
connectionless protocol.
IP datagram’s
The IP layer provides a connectionless and unreliable delivery system. It considers
each datagram independently of the others. Any association between datagram must be
supplied by the higher layers. The IP layer supplies a checksum that includes its own
header. The header includes the source and destination addresses. The IP layer handles
routing through an Internet. It is also responsible for breaking up large datagram into
smaller ones for transmission and reassembling them at the other end.
UDP
UDP is also connectionless and unreliable. What it adds to IP is a checksum for the
contents of the datagram and port numbers. These are used to give a client/server model
- see later.

TCP
TCP supplies logic to give a reliable connection-oriented protocol above IP. It
provides a virtual circuit that two processes can use to communicate.
Internet addresses
In order to use a service, you must be able to find it. The Internet uses an address
scheme for machines so that they can be located. The address is a 32 bit integer which
gives the IP address. This encodes a network ID and more addressing. The network ID
falls into various classes according to the size of the network address.
Network address
Class A uses 8 bits for the network address with 24 bits left over for other
addressing. Class B uses 16 bit network addressing. Class C uses 24 bit network
addressing and class D uses all 32.
Subnet address
Internally, the UNIX network is divided into sub networks. Building 11 is currently
on one sub network and uses 10-bit addressing, allowing 1024 different hosts.
Host address
8 bits are finally used for host addresses within our subnet. This places a limit of
256 machines that can be on the subnet.
Total address

The 32 bit address is usually written as 4 integers separated by dots.

Port addresses
A service exists on a host, and is identified by its port. This is a 16 bit number. To
send a message to a server, you send it to the port for that service of the host that it is
running on. This is not location transparency! Certain of these ports are "well known".

Sockets
A socket is a data structure maintained by the system to handle network
connections. A socket is created using the call socket. It returns an integer that is like a
file descriptor. In fact, under Windows, this handle can be used with Read File and
Write File functions.
#include <sys/types.h>
#include <sys/socket.h>
int socket(int family, int type, int protocol);
Here "family" will be AF_INET for IP communications, protocol will be zero, and
type will depend on whether TCP or UDP is used. Two processes wishing to
communicate over a network create a socket each. These are similar to two ends of a
pipe - but the actual pipe does not yet exist.
JFree Chart
JFreeChart is a free 100% Java chart library that makes it easy for developers to
display professional quality charts in their applications. JFreeChart's extensive feature set
includes:
A consistent and well-documented API, supporting a wide range of chart types;
A flexible design that is easy to extend, and targets both server-side and client-
side applications;
Support for many output types, including Swing components, image files
(including PNG and JPEG), and vector graphics file formats (including PDF, EPS and
SVG);
JFreeChart is "open source" or, more specifically, free software. It is distributed
under the terms of the GNU Lesser General Public Licence (LGPL), which permits use in
proprietary applications.
1. Map Visualizations
Charts showing values that relate to geographical areas. Some examples include:
(a) population density in each state of the United States, (b) income per capita for each
country in Europe, (c) life expectancy in each country of the world. The tasks in this
project include:
Sourcing freely redistributable vector outlines for the countries of the world,
states/provinces in particular countries (USA in particular, but also other areas);
Creating an appropriate dataset interface (plus default implementation), a
rendered, and integrating this with the existing XYPlot class in JFreeChart;
Testing, documenting, testing some more, documenting some more.
2. Time Series Chart Interactivity

Implement a new (to JFreeChart) feature for interactive time series charts --- to display a
separate control that shows a small version of ALL the time series data, with a sliding "view"
rectangle that allows you to select the subset of the time series data to display in the main
chart.
3. Dashboards
There is currently a lot of interest in dashboard displays. Create a flexible dashboard
mechanism that supports a subset of JFreeChart chart types (dials, pies, thermometers, bars,
and lines/time series) that can be delivered easily via both Java Web Start and an applet.

4. Property Editors

The property editor mechanism in JFreeChart only handles a small subset of the
properties that can be set for charts. Extend (or reimplement) this mechanism to provide
greater end-user control over the appearance of the charts.

What is a Java Web Application?


A Java web application generates interactive web pages containing various types of markup
language (HTML, XML, and so on) and dynamic content. It is typically comprised of web
components such as JavaServer Pages (JSP), servlets and JavaBeans to modify and temporarily
store data, interact with databases and web services, and render content in response to client
requests.
Because many of the tasks involved in web application development can be repetitive or require
a surplus of boilerplate code, web frameworks can be applied to alleviate the overhead associated
with common activities. For example, many frameworks, such as JavaServer Faces, provide
libraries for templating pages and session management, and often promote code reuse.

3.2 OPERATING ENVIRONMENT

SYSTEM REQUIREMENTS:
HARDWARE REQUIREMENTS:

 System : Pentium IV 2.4 GHz.


 Hard Disk : 40 GB.
 Floppy Drive : 1.44 Mb.
 Monitor : 15 VGA Colour.
 Mouse : Logitech.
 Ram : 512 Mb.

SOFTWARE REQUIREMENTS:

 Operating system : Windows XP/7.


 Coding Language : JAVA/J2EE
 IDE : Net beans 7.4
 Database : MYSQL

3.3 FUNCTIONAL REQUIREMENTS


INPUT DESIGN

Input design is a part of overall system design. The main objective during the input
design is as given below:

 To produce a cost-effective method of input.


 To achieve the highest possible level of accuracy.
 To ensure that the input is acceptable and understood by the user.
INPUT STAGES:

The main input stages can be listed as below:

 Data recording
 Data transcription
 Data conversion
 Data verification
 Data control
 Data transmission
 Data validation
 Data correction

INPUT TYPES:

It is necessary to determine the various types of inputs. Inputs can be categorized as follows:

 External inputs, which are prime inputs for the system.


 Internal inputs, which are user communications with the system.
 Operational, which are computer department’s communications to the system?
 Interactive, which are inputs entered during a dialogue.

INPUT MEDIA:

At this stage choice has to be made about the input media. To conclude about the input
media consideration has to be given to;

 Type of input
 Flexibility of format
 Speed
 Accuracy
 Verification methods
 Rejection rates
 Ease of correction
 Storage and handling requirements
 Security
 Easy to use
 Portability
Keeping in view the above description of the input types and input media, it can be said
that most of the inputs are of the form of internal and interactive. As

Input data is to be the directly keyed in by the user, the keyboard can be considered to be the
most suitable input device.

ERROR AVOIDANCE

At this stage care is to be taken to ensure that input data remains accurate form the stage
at which it is recorded up to the stage in which the data is accepted by the system. This can be
achieved only by means of careful control each time the data is handled.

ERROR DETECTION

Even though every effort is make to avoid the occurrence of errors, still a small
proportion of errors is always likely to occur, these types of errors can be discovered by using
validations to check the input data.

DATA VALIDATION

Procedures are designed to detect errors in data at a lower level of detail. Data
validations have been included in the system in almost every area where there is a possibility for
the user to commit errors. The system will not accept invalid data. Whenever an invalid data is
keyed in, the system immediately prompts the user and the user has to again key in the data and
the system will accept the data only if the data is correct. Validations have been included where
necessary.

The system is designed to be a user friendly one. In other words the system has been
designed to communicate effectively with the user. The system has been designed with popup
menus.

FEASIBILITY STUDY:
Preliminary investigation examine project feasibility, the likelihood the system will be
useful to the organization. A Project Feasibility Study is an exercise that involves documenting
each of the potential solutions to a particular business problem or opportunity. Feasibility Studies
can be undertaken by any type of business, project or team and they are a critical part of the
Project Life Cycle.

The purpose of a Feasibility Study is to identify the likelihood of one or more solutions
meeting the stated business requirements. In other words, if you are unsure whether your solution
will deliver the outcome you want, then a Project Feasibility Study will help gain that clarity.
During the Feasibility Study, a variety of 'assessment' methods are undertaken. The outcome of
the Feasibility Study is a confirmed solution for implementation.

The main objective of the feasibility study is to test the Technical, Operational and
Economical feasibility for adding new modules and debugging old running system. All system is
feasible if they are unlimited resources and infinite time. There are aspects in the feasibility study
portion of the preliminary investigation:

 Technical Feasibility
 Economical Feasibility
 Operation Feasibility
Technical Feasibility:

 In the feasibility study first step is that the organization or company has to decide that
what technologies are suitable to develop by considering existing system.
 Here in this application used the technologies like Java and Oracle. These are free
software that would be downloaded from web.
 Java IDE –it is tool or technology.

Determining Economic Feasibility

For any system if the expected benefits equal or exceed the expected costs, the system can be
judged to be economically feasible. In economic feasibility, cost benefit analysis is done in
which expected costs and benefits are evaluated. Economic analysis is used for evaluating the
effectiveness of the proposed system.

In economic feasibility, the most important is Cost-benefit analysis. As the name suggests, it is
an analysis of the costs to be incurred in the system and benefits derivable out of the system.
Click on the link below which will get you to the page that explains what cost benefit analysis is
and how you can perform a cost benefit analysis.

Operational Feasibility

Not only must an application make economic and technical sense, it must also make operational
sense.

Issues to consider when determining the operational feasibility of a project.

Operations Issues Support Issues

 What
 What tools are documenta
needed to support tion will
operations? users be
 What skills will given?
operators need to be  What
trained in? training
 What processes need will users
to be created and/or be given?
updated?  How will
 What documentation change
does operations requests
need? be
managed?

Very often you will need to improve the existing operations, maintenance, and support
infrastructure to support the operation of the new application that you intend to develop. To
determine what the impact will be you will need to understand both the current operations and
support infrastructure of your organization and the operations and support characteristics of your
new application.
To operate this application END-TO-END VMS. The user no need to require any
technical knowledge that we are used to develop this project is JSP, Java. That the application
providing rich user interface by user can do the operation in flexible manner.

3.4 NON-FUNCTIONAL REQUIREMENTS

Non-functional requirements describe user-visible aspects of the system that are not
directly related to functionality of the system.

User Interface

A menu interface has been provided to the user to be user friendly.

Performance Constraints

Requests should be processed within no time.

Users should be authenticated for accessing the requested data

Error Handling and Extreme Conditions

In case of User Error, the System should display a meaningful error message to the user,
such that the user can correct his Error.

The high level components in proposed system should handle exceptions that
occur while connecting to various database servers, IO Exceptions etc.

Quality Issues

Quality issues mainly refers to reliable, available and robust system developing the
proposed system the developer must be able to guarantee the reliability transactions so that they
will be processed completely and accurately.

The ability of system to detect failures and recovery from those failures refers to the
availability of system.
3.4 COMMUNICATION INTERFACES
What is Java EE?
Java EE (Enterprise Edition) is a widely used platform containing a set of coordinated
technologies that significantly reduce the cost and complexity of developing, deploying, and
managing multi-tier, server-centric applications. Java EE builds upon the Java SE platform and
provides a set of APIs (application programming interfaces) for developing and running portable,
robust, scalable, reliable and secure server-side applications.
Some of the fundamental components of Java EE include:
 Enterprise JavaBeans (EJB): a managed, server-side component architecture used to
encapsulate the business logic of an application. EJB technology enables rapid and
simplified development of distributed, transactional, secure and portable applications
based on Java technology.
 Java Persistence API (JPA): a framework that allows developers to manage data using
object-relational mapping (ORM) in applications built on the Java Platform.

JavaScript and Ajax Development


JavaScript is an object-oriented scripting language primarily used in client-side interfaces for
web applications. Ajax (Asynchronous JavaScript and XML) is a Web 2.0 technique that allows
changes to occur in a web page without the need to perform a page refresh. JavaScript toolkits
can be leveraged to implement Ajax-enabled components and functionality in web pages.

Web Server and Client


Web Server is a software that can process the client request and send the response back to the
client. For example, Apache is one of the most widely used web server. Web Server runs on
some physical machine and listens to client request on specific port.
A web client is a software that helps in communicating with the server. Some of the most widely
used web clients are Firefox, Google Chrome, Safari etc. When we request something from
server (through URL), web client takes care of creating a request and sending it to server and
then parsing the server response and present it to the user.

HTML and HTTP


Web Server and Web Client are two separate softwares, so there should be some common
language for communication. HTML is the common language between server and client and
stands for HyperTextMarkup Language.
Web server and client needs a common communication protocol, HTTP (HyperTextTransfer
Protocol) is the communication protocol between server and client. HTTP runs on top of TCP/IP
communication protocol.
Some of the important parts of HTTP Request are:
 HTTP Method – action to be performed, usually GET, POST, PUT etc.
 URL – Page to access
 Form Parameters – similar to arguments in a java method, for example user,password
details from login page.
Sample HTTP Request:
1GET /FirstServletProject/jsps/hello.jsp HTTP/1.1
2Host: localhost:8080
3Cache-Control: no-cache
Some of the important parts of HTTP Response are:
 Status Code – an integer to indicate whether the request was success or not. Some of the
well known status codes are 200 for success, 404 for Not Found and 403 for Access
Forbidden.
 Content Type – text, html, image, pdf etc. Also known as MIME type
 Content – actual data that is rendered by client and shown to user.

MIME Type or Content Type: If you see above sample HTTP response header, it contains tag
“Content-Type”. It’s also called MIME type and server sends it to client to let them know the
kind of data it’s sending. It helps client in rendering the data for user. Some of the mostly used
mime types are text/html, text/xml, application/xml etc.

Understanding URL
URL is acronym of Universal Resource Locator and it’s used to locate the server and resource.
Every resource on the web has it’s own unique address. Let’s see parts of URL with an example.
http://localhost:8080/FirstServletProject/jsps/hello.jsp
http:// – This is the first part of URL and provides the communication protocol to be used in
server-client communication.

localhost – The unique address of the server, most of the times it’s the hostname of the server
that maps to unique IP address. Sometimes multiple hostnames point to same IP addresses and
web server virtual host takes care of sending request to the particular server instance.

8080 – This is the port on which server is listening, it’s optional and if we don’t provide it in
URL then request goes to the default port of the protocol. Port numbers 0 to 1023 are reserved
ports for well known services, for example 80 for HTTP, 443 for HTTPS, 21 for FTP etc.

FirstServletProject/jsps/hello.jsp – Resource requested from server. It can be static html, pdf,


JSP, servlets, PHP etc.

Why we need Servlet and JSPs?


Web servers are good for static contents HTML pages but they don’t know how to generate
dynamic content or how to save data into databases, so we need another tool that we can use to
generate dynamic content. There are several programming languages for dynamic content like
PHP, Python, Ruby on Rails, Java Servlets and JSPs.
Java Servlet and JSPs are server side technologies to extend the capability of web servers by
providing support for dynamic response and data persistence.

Web Container
Tomcat is a web container, when a request is made from Client to web server, it passes the
request to web container and it’s web container job to find the correct resource to handle the
request (servlet or JSP) and then use the response from the resource to generate the response and
provide it to web server. Then web server sends the response back to the client.
When web container gets the request and if it’s for servlet then container creates two Objects
HTTPServletRequest and HTTPServletResponse. Then it finds the correct servlet based on the
URL and creates a thread for the request. Then it invokes the servlet service() method and based
on the HTTP method service() method invokes doGet() or doPost() methods. Servlet methods
generate the dynamic page and write it to response. Once servlet thread is complete, container
converts the response to HTTP response and send it back to client.
Some of the important work done by web container are:
 Communication Support – Container provides easy way of communication between
web server and the servlets and JSPs. Because of container, we don’t need to build a
server socket to listen for any request from web server, parse the request and generate
response. All these important and complex tasks are done by container and all we need to
focus is on our business logic for our applications.
 Lifecycle and Resource Management – Container takes care of managing the life cycle
of servlet. Container takes care of loading the servlets into memory, initializing servlets,
invoking servlet methods and destroying them. Container also provides utility like JNDI
for resource pooling and management.
 Multithreading Support – Container creates new thread for every request to the servlet
and when it’s processed the thread dies. So servlets are not initialized for each request
and saves time and memory.
 JSP Support – JSPs doesn’t look like normal java classes and web container provides
support for JSP. Every JSP in the application is compiled by container and converted to
Servlet and then container manages them like other servlets.
 Miscellaneous Task – Web container manages the resource pool, does memory
optimizations, run garbage collector, provides security configurations, support for
multiple applications, hot deployment and several other tasks behind the scene that makes
our life easier.

Web Application Directory Structure


Java Web Applications are packaged as Web Archive (WAR) and it has a defined structure. You
can export above dynamic web project as WAR file and unzip it to check the hierarchy. It will be
something like below image.
Fig 3.7 Web Archive
Deployment Descriptor
web.xml file is the deployment descriptor of the web application and contains mapping for
servlets (prior to 3.0), welcome pages, security configurations, session timeout settings etc.
Thats all for the java web application startup tutorial, we will explore Servlets and JSPs more in
future posts.

MySQL:

MySQL, the most popular Open Source SQL database management system, is developed,
distributed, and supported by Oracle Corporation.
The MySQL Web site (http://www.mysql.com/) provides the latest information about MySQL
software.

 MySQL is a database management system.


A database is a structured collection of data. It may be anything from a simple shopping
list to a picture gallery or the vast amounts of information in a corporate network. To add,
access, and process data stored in a computer database, you need a database management
system such as MySQL Server. Since computers are very good at handling large amounts
of data, database management systems play a central role in computing, as standalone
utilities, or as parts of other applications.
 MySQL databases are relational.
A relational database stores data in separate tables rather than putting all the data in one
big storeroom. The database structures are organized into physical files optimized for
speed. The logical model, with objects such as databases, tables, views, rows, and
columns, offers a flexible programming environment. You set up rules governing the
relationships between different data fields, such as one-to-one, one-to-many, unique,
required or optional, and “pointers” between different tables. The database enforces these
rules, so that with a well-designed database, your application never sees inconsistent,
duplicate, orphan, out-of-date, or missing data.
The SQL part of “MySQL” stands for “Structured Query Language”. SQL is the most
common standardized language used to access databases. Depending on your
programming environment, you might enter SQL directly (for example, to generate
reports), embed SQL statements into code written in another language, or use a language-
specific API that hides the SQL syntax.
SQL is defined by the ANSI/ISO SQL Standard. The SQL standard has been evolving
since 1986 and several versions exist. In this manual, “SQL-92” refers to the standard
released in 1992, “SQL:1999” refers to the standard released in 1999, and “SQL:2003”
refers to the current version of the standard. We use the phrase “the SQL standard” to
mean the current version of the SQL Standard at any time.

 MySQL software is Open Source.


Open Source means that it is possible for anyone to use and modify the software.
Anybody can download the MySQL software from the Internet and use it without paying
anything. If you wish, you may study the source code and change it to suit your needs.
The MySQL software uses the GPL (GNU General Public License),
http://www.fsf.org/licenses/, to define what you may and may not do with the software in
different situations. If you feel uncomfortable with the GPL or need to embed MySQL
code into a commercial application, you can buy a commercially licensed version from
us. See the MySQL Licensing Overview for more information
(http://www.mysql.com/company/legal/licensing/).
 The MySQL Database Server is very fast, reliable, scalable, and easy to use.
If that is what you are looking for, you should give it a try. MySQL Server can run
comfortably on a desktop or laptop, alongside your other applications, web servers, and
so on, requiring little or no attention. If you dedicate an entire machine to MySQL, you
can adjust the settings to take advantage of all the memory, CPU power, and I/O capacity
available. MySQL can also scale up to clusters of machines, networked together.
You can find a performance comparison of MySQL Server with other database managers
on our benchmark page.

MySQL Server was originally developed to handle large databases much faster than
existing solutions and has been successfully used in highly demanding production
environments for several years. Although under constant development, MySQL Server
today offers a rich and useful set of functions. Its connectivity, speed, and security make
MySQL Server highly suited for accessing databases on the Internet.

 MySQL Server works in client/server or embedded systems.


The MySQL Database Software is a client/server system that consists of a multi-threaded
SQL server that supports different backends, several different client programs and
libraries, administrative tools, and a wide range of application programming interfaces
(APIs).
We also provide MySQL Server as an embedded multi-threaded library that you can link
into your application to get a smaller, faster, easier-to-manage standalone product.

 A large amount of contributed MySQL software is available.


MySQL Server has a practical set of features developed in close cooperation with our
users. It is very likely that your favorite application or language supports the MySQL
Database Server.

The official way to pronounce “MySQL” is “My EssQue Ell” (not “my sequel”), but we do not
mind if you pronounce it as “my sequel” or in some other localized way.
4.SYSTEM DESIGN

4.1 DATA FLOW DIAGRAM:

1. The DFD is also called as bubble chart. It is a simple graphical formalism that can be
used to represent a system in terms of input data to the system, various processing carried
out on this data, and the output data is generated by this system.
2. The data flow diagram (DFD) is one of the most important modeling tools. It is used to
model the system components. These components are the system process, the data used
by the process, an external entity that interacts with the system and the information flows
in the system.
3. DFD shows how the information moves through the system and how it is modified by a
series of transformations. It is a graphical technique that depicts information flow and the
transformations that are applied as data moves from input to output.
4. DFD is also known as bubble chart. A DFD may be used to represent a system at any
level of abstraction. DFD may be partitioned into levels that represent increasing
information flow and functional detail.

DFD Diagrams:

Context Level DFD


Data Input Stage Data Out Put Stage
Data Storage
Admin

Messaging

UI SCREENS
user

Reports
User

System

Fig 4.1.1 Context Level DFD

Login DFD Diagram:


Login Master

Enter User User Home


Open Login Yes Yes
Name and Check User Page
form
Password

No

Validates
Data

Fig 4.1.2 Login DFD

User Data Flow :


1st level DFD

Login Master Tbl_Location Tbl_UserDetai


ls Tbl_Credentio
als
Verifies Details

Open Form View Location ChangePassword

1.0.0 1.0.2 1.0.4

Enter Login Details Manage Profiles

1.0.1 Data Storage


1.0.3

Validates
Data

Fig 4.1.2 User Data DFD


2nd Level DFD

Tbl_Location Tbl_Location
Tbl_Location
Tbl_Location

Verifies Details

View Locatin View District


2.0.0 2.0.2

View City
2.0.1 View Hotels
2.0.3 Data Storage

Validates
Data

Fig 4.1.42nd Level User Data DFD

Admin Data Flow :


1st level DFD
Login Master Tbl_Location Tbl_UserDetai
ls Tbl_Credentio
als
Verifies Details

Open Form Add Location ChangePassword

1.0.0 1.0.2 1.0.4

Enter Login Details Manage Profiles

1.0.1 Data Storage


1.0.3

Validates
Data

Fig 4.2.1 Admin1stLevel User Data DFD

2nd Level DFD


Tbl_Location Tbl_Location
Tbl_Location
Tbl_Location

Verifies Details

Add Locatin Add District


2.0.0 2.0.2

Add City
2.0.1 Add Hotels
2.0.3 Data Storage

Validates
Data

Fig 4.2.2 Admin2ndLevel User Data DFD

4.2 E-R DIAGRAMS


Fig 4.2.1 E-R Diagram
4.3 UML DIAGRAMS

UML stands for Unified Modeling Language. UML is a standardized general-purpose


modeling language in the field of object-oriented software engineering. The standard is managed,
and was created by, the Object Management Group.
The goal is for UML to become a common language for creating models of object
oriented computer software. In its current form UML is comprised of two major components: a
Meta-model and a notation. In the future, some form of method or process may also be added to;
or associated with, UML.
The Unified Modeling Language is a standard language for specifying, Visualization,
Constructing and documenting the artifacts of software system, as well as for business modeling
and other non-software systems.
The UML represents a collection of best engineering practices that have proven
successful in the modeling of large and complex systems.
The UML is a very important part of developing objects oriented software and the
software development process. The UML uses mostly graphical notations to express the design
of software projects.

GOALS:
The Primary goals in the design of the UML are as follows:
1. Provide users a ready-to-use, expressive visual modeling Language so that they can
develop and exchange meaningful models.
2. Provide extendibility and specialization mechanisms to extend the core concepts.
3. Be independent of particular programming languages and development process.
4. Provide a formal basis for understanding the modeling language.
5. Encourage the growth of OO tools market.
6. Support higher level development concepts such as collaborations, frameworks, patterns
and components.
7. Integrate best practices.

4.3.1 CLASS DIAGRAM:

In software engineering, a class diagram in the Unified Modeling


Language (UML) is a type of static structure diagram that describes the structure of a system by
showing the system's classes, their attributes, operations (or methods), and the relationships
among the classes. It explains which class contains information.
User Server
keyword keyoword
place

send();
view(); request();
response();

Fig 4.3.1(a) Class Diagram

4.3.2 USE CASE DIAGRAM:

A use case diagram in the Unified Modeling Language (UML) is a type of behavioral
diagram defined by and created from a Use-case analysis. Its purpose is to present a graphical
overview of the functionality provided by a system in terms of actors, their goals (represented as
use cases), and any dependencies between those use cases. The main purpose of a use case
diagram is to show what system functions are performed for which actor. Roles of the actors in
the system can be depicted.
Register

Login

search with keyword

view place

User

get nearest place and distance

Logout

Fig 4.3.2(b) Class Diagram


4.3.3SEQUENCE DIAGRAM:

A sequence diagram in Unified Modeling Language (UML) is a kind of interaction


diagram that shows how processes operate with one another and in what order. It is a construct of
a Message Sequence Chart. Sequence diagrams are sometimes called event diagrams, event
scenarios, and timing diagrams.

User Server

Create Account

Login

search with keyword

View inbo

Get nearest place and distance

Fig 4.3.3(c) Sequence Diagram


User Login Sequence

User frmLogin BAL:clsLogin DAL:clsconnection DB

1 : Enter UserId,Pwd()

2 :InsertData ()

3 :Connetion()

4 :Actionmethod()

5 : Response()

6 : Show Result()

Fig 4.3.4 (b) User Login Sequence


User Sequence To View Location

User frm Home BAL:clsLocation DAL:clscoonect DB

1 : Login()

2 :AcceptRequests()

3 :Connetion()

4 :GetData()

5 : Response()

6 : Show Result()

Fig 4.3.5 (c) UserSequence to View Location


4.3.6 Colloabration Diagrams:

User Login

4 :Connectmethod()
DAL:clsSql DB
Helper
3 : Connected()

BAL:clsLogi
n

2 : Check Data()
5 : Response()
frmLogin

1 : Enter UserId,Pwd()

Admin

Fig 4.3.6 CollaborationDiagrams


4.3.7ACTIVITY DIAGRAM:
Activity diagrams are graphical representations of workflows of stepwise activities and
actions with support for choice, iteration and concurrency. In the Unified Modeling Language,
activity diagrams can be used to describe the business and operational step-by-step workflows of
components in a system. An activity diagram shows the overall flow of control.

User

Registe r

Login
NO

Search with keyword

View places

get nearest place and distance

Fig 4.3.7Activity Diagram


4.3.8State chart Diagram:

A state chart diagram shows the behavior of classes in response to external stimuli. This diagram
models the dynamic flow of control from state to state within a system.

Basic State chart Diagram Symbols and Notations

States: States represent situations during the life of an object. You can easily illustrate a state in
Smart Draw by using a rectangle with rounded corners.

Transition: A solid arrow represents the path between different states of an object. Label the
transition with the event that triggered it and the action that results from it.

Initial State: A filled circle followed by an arrow represents the object's initial state.

Final State: An arrow pointing to a filled circle nested inside another circle represents the
object's final state.

Synchronization and Splitting of Control:A short heavy bar with two transitions entering it
represents a synchronization of control. A short heavy bar with two transitions leaving it
represents a splitting of control that creates multiple states.
State Chat Diagram:-

Enter Search Data

Submit

View Map

Submit Data

Get data success

Fig 4.3.8 State chart Diagram


4.3.9Component diagram:

Home
OLEDB

Enter View Map


Location
Security

Map

Google Map Display source


Resource
4.3.10Deployment Diagram:
5. System Testing
Testing Methodologies :
Testing is the process of finding differences between the expected behavior
specified by system models and the observed behavior implemented system. From modeling
point of view , testing is the attempt of falsification of the system with respect to the system
models. The goal of testing is to design tests that exercise defects in the system and to reveal
problems.
The process of executing a program with intent of finding errors is called testing. During testing ,
the program to be tested is executed with a set of test cases , and the output of the program for
the test cases is evaluated to determine if the program is performing as expected . Testing forms
the first step in determining the errors in the program. The success of testing in revealing errors
in program depends critically on test cases.

Strategic Approach to Software Testing:

The software engineering process can be viewed as a spiral. Initially system engineering
defines the role of software and leads to software requirements analysis where the information
domain , functions , behavior , performance , constraints and validation criteria for software are
established. moving inward along the spiral , we come to design and finally to coding . To
develop computer software we spiral in along streamlines that decreases the level of abstraction
on each item.
A Strategy for software testing may also be viewed in the context of the spiral. Unit testing
begins at the vertex of the spiral and concentrates on each unit of the software as implemented in
source code. Testing will progress by moving outward along the spiral to integration testing ,
where the focus on the design and the concentration of the software architecture. Talking another
turn on outward on the spiral we encounter validation testing where requirements established as
part of software requirements analysis are validated against the software that has been
constructed . Finally we arrive at system testing , where the software and other system elements
are tested as a whole .
UNIT TESTING
UNUNI

MODULE

Component

SUB-SYSTEM

SYSTEM TESTING
Integration Testing

ACCEPTANCE

Fig 5.1 Test Methods

User Testing:

Different Levels of Testing


Client Needs Acceptance Testing
Requirements System Testing
Design Integration Testing
Code Unit Testing

Testing is the process of finding difference between the expected behavior specified by system
models and the observed behavior of the implemented system.

Testing Activities
Different levels of testing are used in the testing process , each level of testing aims to test
different aspects of the system. the basic levels are:
Unit testing
Integration testing
System testing
Acceptance testing

Unit Testing
Unit testing focuses on the building blocks of the software system, that is, objects and sub
system . There are three motivations behind focusing on components. First, unit testing reduces
the complexity of the overall tests activities, allowing us to focus on smaller units of the system.
Second , unit testing makes it easier to pinpoint and correct faults given that few components are
involved in this test . Third , Unit testing allows parallelism in the testing activities , that is each
component can be tested independently of one another . Hence the goal is to test the internal
logic of the module.

Integration Testing
In the integration testing, many test modules are combined into sub systems , which are
then tested . The goal here is to see if the modules can be integrated properly, the emphasis being
on testing module interaction.
After structural testing and functional testing we get error free modules. These modules
are to be integrated to get the required results of the system. After checking a module, another
module is tested and is integrated with the previous module. After the integration, the test cases
are generated and the results are tested.

System Testing
In system testing the entire software is tested . The reference document for this process is
the requirement document and the goal is to see whether the software meets its requirements.
The system was tested for various test cases with various inputs.

Acceptance Testing
Acceptance testing is sometimes performed with realistic data of the client to demonstrate
that the software is working satisfactory. Testing here focus on the external behavior of the
system , the internal logic of the program is not emphasized . In acceptance testing the system is
tested for various inputs.
Types of Testing
1. Black box or functional testing
2. White box testing or structural testing

Black box testing


This method is used when knowledge of the specified function that a product has been designed
to perform is known . The concept of black box is used to represent a system whose inside
workings are not available to inspection . In a black box the test item is a "Black" , since its logic
is unknown , all that is known is what goes in and what comes out , or the input and output.
Black box testing attempts to find errors in the following categories:

Incorrect or missing functions


Interface errors
Errors in data structure
Performance errors
Initialization and termination errors

As shown in the following figure of Black box testing , we are not thinking of the internal
workings , just we think about
What is the output to our system?
What is the output for given input to our system?

Input Output
?
The Black box is an imaginary box that hides its internal workings

White box testing

box testing is concerned with testing the implementation of the program. the intent of structural
is not to exercise all the inputs or outputs but to exercise the different programming and data
structure used in the program. Thus structural testing aims to achieve test cases that will force the
desire coverage of different structures . Two types of path testing are statement testing coverage
and branch testing coverage.

Input Output
INTERNAL
WORKING

The White Box testing strategy , the internal workings


Test Plan
Testing process starts with a test plan. This plan identifies all the testing related activities
that must be performed and specifies the schedules , allocates the resources , and specified
guidelines for testing . During the testing of the unit the specified test cases are executed and the
actual result compared with expected output. The final output of the testing phase is the test
report and the error report.

Test Data:
Here all test cases that are used for the system testing are specified. The goal is to test the
different functional requirements specified in Software Requirements Specifications (SRS)
document.

Unit Testing:
Each individual module has been tested against the requirement with some test data.
Test Report:
The module is working properly provided the user has to enter information. All data entry
forms have tested with specified test cases and all data entry forms are working properly.

Error Report:
If the user does not enter data in specified order then the user will be prompted with error
messages. Error handling was done to handle the expected and unexpected errors.

Test cases :

A Test case is a set of input data and expected results that exercises a component with the
purpose of causing failure and detecting faults .test case is an explicit set of instructions designed
to detect a particular class of defect in a software system , by bringing about a failure . A Test
case can give rise to many tests.

5.1TEST CASES:
Test cases can be divided in to two types. First one is Positive test cases and second one
is negative test cases. In positive test cases are conducted by the developer intention is to get the
output. In negative test cases are conducted by the developer intention is to don’t get the output.

+VE TEST CASES

S .No Test case Description Actual value Expected value Result

1 Create the new user New user created Update personal True
registration process successfully info in to oracle
database

2 Enter the username and Verification of Login True


password login details Successfully

3 Enter the source to Verification of Display the True


destination the location location
-VE TEST CASES

S .No Test case Description Actual value Expected value Result

1 Create the new user New user is not Personal False


registration process created information is
successfully not updated into
database.

2 Enter the username and Verification of Invalid user False


password login details name and
password

3 Enter the password Check the View Search False


signatures Result

4 Enter the special data Check the View result False


location
6.SCREEN SHOTS
7.CONCLUSION

We have seen plenty of applications calling for a search engine that is able to efficiently
support novel forms of spatial queries that are integrated with keyword search. The existing
solutions to such queries either incur prohibitive space consumption or are unable to give real
time answers. In this paper, we have remedied the situation by developing an access method
called the spatial inverted index (SI-index). Not only that the SI-index is fairly space economical,
but also it has the ability to perform keyword-augmented nearest neighbor search in time that is
at the order of dozens of milli-seconds. Furthermore, as the SI-index is based on the conventional
technology of inverted index, it is readily incorporable in a commercial search engine that
applies massive parallelism, implying its immediate industrial merits.

Our Project Fast Nearest Neighbor Search with Keywords is extremely effective for searching
nearest restaurant from user with expected menus. It does this by IR2 tree algorithm-
Compression, Merging and Distance Browsing, and GPS System. In this we can add features like
After selecting Hotel it will display menu card of that Hotel Implement this application for PC’s
and Desktops.We have so many applications that can be used as search engine which is able to
efficiently support novel forms of spatial queries that are integrated with keyword search. In this
project we have developed an access method called the Spatial Inverted Index (SIIndex). This
method is fairly space economical and it has ability to perform keyword augmented
nearestneighbor search in real time. This method is based on conventional technology of Inverted
Index. It is readily incorporable in a commercial search engine.
REFERENCES

[1] S. Agrawal, S. Chaudhuri, and G. Das, “Dbxplorer: A System for Keyword-Based Search
over Relational Databases,” Proc. Int’l Conf. Data Eng. (ICDE), pp. 5-16, 2002.

[2] N. Beckmann, H. Kriegel, R. Schneider, and B. Seeger, “The R - tree: An Efficient and
Robust Access Method for Points and Rectangles,” Proc. ACM SIGMOD Int’l Conf.
Management of Data, pp. 322-331, 1990.

[3] G. Bhalotia, A. Hulgeri, C. Nakhe, S. Chakrabarti, and S. Sudarshan, “Keyword Searching


and Browsing in Databases Using Banks,” Proc. Int’l Conf. Data Eng. (ICDE), pp. 431-440,
2002.

[4] X. Cao, L. Chen, G. Cong, C.S. Jensen, Q. Qu, A. Skovsgaard, D. Wu, and M.L. Yiu,
“Spatial Keyword Querying,” Proc. 31st Int’l Conf. Conceptual Modeling (ER), pp. 16-29, 2012.

[5] X. Cao, G. Cong, and C.S. Jensen, “Retrieving Top-k Prestige- Based Relevant Spatial Web
Objects,” Proc. VLDB Endowment, vol. 3, no. 1, pp. 373-384, 2010.

[6] X. Cao, G. Cong, C.S. Jensen, and B.C. Ooi, “Collective Spatial Keyword Querying,” Proc.
ACM SIGMOD Int’l Conf. Management of Data, pp. 373-384, 2011.

[7] B. Chazelle, J. Kilian, R. Rubinfeld, and A. Tal, “The Bloomier Filter: An Efficient Data
Structure for Static Support Lookup Tables,” Proc. Ann. ACM-SIAM Symp. Discrete
Algorithms (SODA), pp. 30- 39, 2004.

[8] Y.-Y. Chen, T. Suel, and A. Markowetz, “Efficient Query Processing in Geographic Web
Search Engines,” Proc. ACM SIGMOD Int’l Conf. Management of Data, pp. 277-288, 2006.

[9] E. Chu, A. Baid, X. Chai, A. Doan, and J. Naughton, “Combining Keyword Search and
Forms for Ad Hoc Querying of Databases,” Proc. ACM SIGMOD Int’l Conf. Management of
Data, 2009.

[10] G. Cong, C.S. Jensen, and D. Wu, “Efficient Retrieval of the Top-k Most Relevant Spatial
Web Objects,” PVLDB, vol. 2, no. 1, pp. 337- 348, 2009.
[11] C. Faloutsos and S. Christodoulakis, “Signature Files: An Access Method for Documents
and Its Analytical Performance Evaluation,” ACM Trans. Information Systems, vol. 2, no. 4, pp.
267-288, 1984.

[12] I.D. Felipe, V. Hristidis, and N. Rishe, “Keyword Search on Spatial Databases,” Proc. Int’l
Conf. Data Eng. (ICDE), pp. 656-665, 2008.

[13] R. Hariharan, B. Hore, C. Li, and S. Mehrotra, “Processing Spatial- Keyword (SK) Queries
in Geographic Information Retrieval (GIR) Systems,” Proc. Scientific and Statistical Database
Management (SSDBM), 2007.

[14] G.R. Hjaltason and H. Samet, “Distance Browsing in Spatial Databases,” ACM Trans.
Database Systems, vol. 24, no. 2, pp. 265-318, 1999.

[15] V. Hristidis and Y. Papakonstantinou, “Discover: Keyword Search in Relational


Databases,” Proc. Very Large Data Bases (VLDB), pp. 670-681, 2002.

[16] I. Kamel and C. Faloutsos, “Hilbert R-Tree: An Improved R-Tree Using Fractals,” Proc.
Very Large Data Bases (VLDB), pp. 500-509, 1994.

[17] J. Lu, Y. Lu, and G. Cong, “Reverse Spatial and Textual k Nearest Neighbor Search,” Proc.
ACM SIGMOD Int’l Conf. Management of Data, pp. 349-360, 2011.

[18] S. Stiassny, “Mathematical Analysis of Various Superimposed Coding Methods,” Am.


Doc., vol. 11, no. 2, pp. 155-169, 1960.

[19] J.S. Vitter, “Algorithms and Data Structures for External Memory,” Foundation and Trends
in Theoretical Computer Science, vol. 2, no. 4, pp. 305-474, 2006.

[20] D. Zhang, Y.M. Chee, A. Mondal, A.K.H. Tung, and M. Kitsuregawa, “Keyword Search in
Spatial Databases: Towards Searching by Document,” Proc. Int’l Conf. Data Eng. (ICDE), pp.
688-699, 2009.
[21] Y. Zhou, X. Xie, C. Wang, Y. Gong, and W.-Y.Ma, “Hybrid Index Structures for Location-
Based Web Search,” Proc. Conf. Information and Knowledge Management (CIKM), pp. 155-
162, 2005.

You might also like