Professional Documents
Culture Documents
James Collins
May 25, 2006
1
Contents
1 Background 3
2 Aim 4
3 Method 5
3.1 Experiment: HTML Compression Using GZIP . . . . . . . . . 5
3.2 Experiment: Image Compression vs Image Removal . . . . . . 5
3.3 User Survey . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.4 Proposed Time Line . . . . . . . . . . . . . . . . . . . . . . . 6
5 References 8
2
1 Background
Web browsing forms a vital part of a typical internet user’s online activity.
Since its conception, web browsing has progressed from a small collection
of text-based web pages, to an almost limitless collection of visually appeal-
ing web sites. This transformation has meant that the download size of a
typical web page has drastically increased. However, despite the increasing
availability of high bandwidth broadband connections, over two thirds (69%)
of Australian internet users still are connected via dialup [6]. In addition,
the invention of wireless internet technologies such as GPRS has required
download optimisation techniques to be introduced.
It has been shown that the main contributor to perceived network latency
is limited modem bandwidth [5]. Thus, it can be said that there is relatively
high demand for improving web browsing performance over poor connections.
In order for this to be achieved, the amount of data transferred must be
decreased. Formula 1 (below) illustrates this:
• Removing images (<img ..> tags), and selectively showing them when
requested.
3
Figure 1: A network employing the use of a proxy server
This diagram demonstrates that only the filtered content (and thus less
data) is passed over the low bandwidth connection, whereas the original
content is passed over the high bandwidth connection.
Filtering web content using a proxy server has several advantages over a
client-side solution:
The concept of reducing download times whilst browsing the web is not
only limited to desktop computers; it is also applicable for other devices such
as PDA’s and 3G phones. Previous research has been performed in this area
[1, 3, 2]. The ideas presented in this project will be aimed at not only dialup
users, but users of wireless internet enabled devices such as PDA’s.
2 Aim
This project will involve the development of a customised proxy server, capa-
ble of filtering content with the primary aim of reducing download sizes whilst
browsing the web. This system will be compatible with typical desktop com-
puters, as well as with portable devices such as PDA’s and mobile phones.
Once this system is developed, experiments aimed to measure performance
and user satisfaction will be performed.
4
3 Method
In order to simplify development, an open source proxy such as Squid [8] or
RabbIT [7] will be extended with the specific aim of reducing the download
times of web pages. At this stage the implementation will probably be written
in C or Java. Other languages may be also be considered if appropriate.
The development process will adhere to common software engineering
practices, such as:
• Verification and validation techniques will help ensure the system achieves
its objectives.
Hypothesis #2: Removing images found on web pages will reduce down-
load size by at least 80%.
5
Linux’s wget command will be used to download an existing (image inten-
sive) web site. The total time required and total bytes downloaded will be
recorded. This operation will be performed three times:
• With the proxy server set to remove all images and replace them with
links.
• With the proxy server set to compress all images before being down-
loaded.
These results will then be compared in order to determine the overall effec-
tiveness of each method.
6
Date Milestone
Semester 1
Week 2 Background research begins
Week 3 Project proposal and summary
Week 4 Research previous developments
Week 5 Literature review
Week 6 Proposal talk presented to research group
Week 9 Conduct GZIP compression experiment
Week 10-11 Modify initial proposal if required
Week 12 Revised project proposal
Mid-year break User survey(s) written
Begin development and implementation
Submit survey for UWA ethics approval
Testing and refinement of implementation
Semester 2
Week 1 Survey(s) approved and administered
Weeks 2-3 Other experiments conducted
Week 4 Analysis of survey results
Week 4 Performance testing of implementation
Week 7 Conclusions drawn
Week 9 Draft dissertation due
Week 10 Seminar title and abstract
Week 12 Final dissertation
Week 13 Poster
Seminar
Study break Marked dissertation available for collection
After exams Corrected dissertation
7
5 References
[1] Hassan Artail and Mackram Raydan. Device-aware desktop web page
transformation for rendering on handhelds. Personal Ubiquitous Com-
puting, 9(6):368–380, 2005.
[2] Staffan Bjork, Lars Erik Holmquist, Johan Redstrom, Ivan Bretan, Rolf
Danielsson, Jussi Karlgren, and Kristofer Franzen. West: a web browser
for small terminals. In UIST ’99: Proceedings of the 12th annual ACM
symposium on User interface software and technology, pages 187–196,
New York, NY, USA, 1999. ACM Press.
[4] Peter Deutsch. Rfc 1952: Gzip file format specification version 4.3.
http://www.gzip.org/zlib/rfc1952.pdf.
[5] Li Fan, Pei Cao, Wei Lin, and Quinn Jacobson. Web prefetching be-
tween low-bandwidth clients and proxies: potential and performance. In
SIGMETRICS ’99: Proceedings of the 1999 ACM SIGMETRICS inter-
national conference on Measurement and modeling of computer systems,
pages 178–187, New York, NY, USA, 1999. ACM Press.