Welcome to Scribd!

Java Practise Exercise

Uploaded by

0% found this document useful (0 votes)

65 views3 pages

This document provides instructions for four exercises in advanced parallel computing. Exercise 1 involves optimizing matrix multiplication and calculating matrix norms in C/C++ to improve cache usage. Exercise 2 introduces OpenMP and has students write a simple parallel "Hello World" program. Exercise 3 involves parallelizing a SAXPY operation in C++ using OpenMP work sharing directives. Exercise 4 instructs students to calculate the dot product of vectors in parallel.

Original Description:

Original Title

Java Practise exercise

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

65 views3 pages

Java Practise Exercise

Uploaded by

sivaterror

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

Advanced Parallel Computing for Scientific

Applications
Autumn Term 2010
Prof. I. F. Sbalzarini Prof. P. Arbenz
ETH Zentrum, CAB G34 ETH Zentrum, CAB H89
CH-8092 Zürich CH-8092 Zürich

Exercise 3
Release: 12. Oct 2010
Due: 26 Oct. 2010

1 Practice in C/C++
The following two assignments illustrate the effects of caching in matrix operations. C
uses row major memory layout for storing matrices and high dimensional arrays. Hence
row-wise access of elements is more cache efficient than column-wise access.

Question 1: Matrix multiplication

The file matrixMult.cpp contains a program to find the execution time for matrix multi-
plication.

C =A·B

Each matrix is stored as a 1-D array. The multiplication is performed in the method void
Multiply(...). To calculate each element in C, the elements of A are accessed row-wise
and that of B are accessed column-wise resulting in several cache misses especially for
large matrix sizes. A better cache usage can be achieved if the matrix B is transposed
and the matrix multiplication operation is modified accordingly to give the same result as
before. Your task is to implement the methods void InPlaceTranspose(...) and void
MultiplyEfficient(...).
Compile your code using the default GNU compiler: g++ -o mult matrixMult.cpp
Do you observe better performance in the case of large matrices?

Question 2: Matrix norm

You have to calculate the 1-norm and infinity norm of a matrix A of size m × n given by
m
X
||A||1 = max |aij |
1≤j≤n
i=1

n
X
||A||∞ = max |aij |
1≤i≤m
j=1

1
Implement the above equations in the appropriate methods double Norm 1() and double
Norm Inf() in the file matrixNorm.cpp. Count the floating point operations in the calcu-
lation and compute Mflop/s for different matrix sizes n.
Compile your code using : g++ -o norm matrixNorm.cpp
Which of the above norms is calculated faster and why?

In each of the above examples, time measurement is done using the method double walltime(...)
which is implemented in the file walltime.h
Please submit the jobs to the batch queue as follows:

bsub -o <op_file> ./<executable>

2 Introduction to OpenMP
1. OpenMP is an application programming interface that provides a parallel program-
ming model for shared memory and distributed shared memory multiprocessors.

2. OpenMP is based on the Fork/Join Execution Model : An OpenMP program starts

as a single thread (master) and additional threads are created when the master hits
a parallel region.

3. There is a standard include file omp.h for C/C++ OpenMP programs.

4. The number of threads is fixed a priori by the programmer using environment

variable OMP NUM THREADS

5. omp get num threads() and omp get thread num() can be used to get the number
of threads created and the local number assigned to each thread.

6. The directive #pragma omp parallel in the program marks the beginning of parallel
section.

7. The keywords used for distributing work among threads are for, sections, critical
etc

Question 3: First OpenMP program

Using the above information, write a simple program that will create n = 2, 4, 6 threads and
each thread will display one of the following messages along with its own thread number.

This is Advanced Parallel Computing tutorial.

This is the first OpenMP program.
This program uses n threads
Hello World

2
Compile the program using the GNU compiler as follows:

gcc -lgomp -fopenmp -o omp1 omp1.c

Question 4: Work sharing among OpenMP threads

The file vectorAdd.cpp contains the code for serial and parallel execution of SAXPY oper-
ation along with time measurement.
a) Compile the program and execute it using say 4 threads. Why is there no speedup?
Modify the code in order to achieve appreciable speedup.
b) Write a code to calculate dot product ā.b̄ in parallel.

You may submit the jobs to the batch queue as follows:

bsub -n N -o <op_file> ./<executable>

{N is the number of processors}

Do not forget to set the environment variable OMP NUM THREADS before execution.

Master in High Performance Computing Advanced Parallel Programming LABS
Document2 pages
Master in High Performance Computing Advanced Parallel Programming LABS
nijota1
No ratings yet
The Tip of The Iceberg: 1 Before You Start
Document18 pages
The Tip of The Iceberg: 1 Before You Start
regupathi6413
No ratings yet
Report - Viber String
Document26 pages
Report - Viber String
LopUi
No ratings yet
L01 AlgorithmAnalysis
Document26 pages
L01 AlgorithmAnalysis
Zelal Zelal
No ratings yet
Assignment 2
Document5 pages
Assignment 2
Nguyễn Văn Nhật
No ratings yet
1.1 Parallelism Is Ubiquitous
Document3 pages
1.1 Parallelism Is Ubiquitous
Rajinder Sanwal
No ratings yet
Notes Code
Document8 pages
Notes Code
Ismael Arce
No ratings yet
Lecture "Programming" SS 2020 Problem Set 5: Exercise 1 (C++) Static & Dynamic Arrays
Document7 pages
Lecture "Programming" SS 2020 Problem Set 5: Exercise 1 (C++) Static & Dynamic Arrays
david Abotsitse
No ratings yet
Lab 1
Document2 pages
Lab 1
ims
No ratings yet
Fig. 1.22. Parallel Sparse Matrix Multiply Code Fragment, in Fortran, That Times The Operation
Document5 pages
Fig. 1.22. Parallel Sparse Matrix Multiply Code Fragment, in Fortran, That Times The Operation
latinwolf
No ratings yet
Improving Data Locality by Chunking
Document15 pages
Improving Data Locality by Chunking
Salim Mehenni
No ratings yet
CS 2073 Lab 10: Matrix Multiplication Using Pointers: I Objectives
Document2 pages
CS 2073 Lab 10: Matrix Multiplication Using Pointers: I Objectives
Sarfaraz Akhtar
No ratings yet
++probleme Tot
Document22 pages
++probleme Tot
Rochelle Byers
No ratings yet
LP 1 Lab Manual - v1
Document80 pages
LP 1 Lab Manual - v1
Diptesh Thakare
No ratings yet
Mpi, Openmp, Matlab P: 2.1 Programming Style
Document15 pages
Mpi, Openmp, Matlab P: 2.1 Programming Style
Web dev
No ratings yet
HW4: Merge Sort: 1 Assignment Goal
Document6 pages
HW4: Merge Sort: 1 Assignment Goal
Rafael Mendonca
No ratings yet
CSC 429 Mid Fall 14
Document8 pages
CSC 429 Mid Fall 14
SAI
No ratings yet
Experiment No. 1: Development of Programs in C++ For Demonstration of Various Types of Functions
Document33 pages
Experiment No. 1: Development of Programs in C++ For Demonstration of Various Types of Functions
Mayur Satpute
No ratings yet
Parallel Computing Lab Manual PDF
Document51 pages
Parallel Computing Lab Manual PDF
SAMINA ATTARI
No ratings yet
Assignment 1
Document2 pages
Assignment 1
Arjun
No ratings yet
Article
Document12 pages
Article
Ryan Garcia
No ratings yet
Improved Code-Generation Facilities: About Me
Document13 pages
Improved Code-Generation Facilities: About Me
Rupesh Harode
No ratings yet
PW02
Document2 pages
PW02
rashida mammadova
No ratings yet
Day 1 1-12 Intro-Openmp
Document57 pages
Day 1 1-12 Intro-Openmp
JAMEEL AHMAD
No ratings yet
Implementing Matrix Multiplication On An Mpi Cluster of Workstations
Document9 pages
Implementing Matrix Multiplication On An Mpi Cluster of Workstations
Dr-Firoz Ali
No ratings yet
Class XII (As Per CBSE Board) : Computer Science
Document17 pages
Class XII (As Per CBSE Board) : Computer Science
Saravana Kumar R
No ratings yet
DIP Lab Manual No 02
Document24 pages
DIP Lab Manual No 02
myfirst
No ratings yet
A 02
Document2 pages
A 02
dsa
No ratings yet
What Is Algorithm
Document64 pages
What Is Algorithm
Kuljeet Kaur
No ratings yet
7 Amortized Analysis
Document1 page
7 Amortized Analysis
Anurag Parothia
No ratings yet
Programming Assignment: On Openmp
Document19 pages
Programming Assignment: On Openmp
yogesh
No ratings yet
Omp Hands On SC08
Document153 pages
Omp Hands On SC08
Sreeram Kumar
No ratings yet
Proj 2
Document3 pages
Proj 2
lorentzongustaf
No ratings yet
NN Lab2
Document5 pages
NN Lab2
Anne Wanningen
No ratings yet
PL01 Guiao
Document3 pages
PL01 Guiao
João Lourenço
No ratings yet
Csp214 Tasks #4: Exhaustive (Direct) Method
Document1 page
Csp214 Tasks #4: Exhaustive (Direct) Method
Anonymous YbrZfzhzG
No ratings yet
Assignment 1
Document2 pages
Assignment 1
Samira Yomi
No ratings yet
Ex1 2017 1
Document2 pages
Ex1 2017 1
Jon Arnold Grey
No ratings yet
Introduction To Algorithm Analysis and Design
Document18 pages
Introduction To Algorithm Analysis and Design
Arya Chinnu
No ratings yet
Algorithm: Definition (Algorithm) : An Algorithm Is A Finite Set of Instructions That, If Followed, Accomplishes A
Document22 pages
Algorithm: Definition (Algorithm) : An Algorithm Is A Finite Set of Instructions That, If Followed, Accomplishes A
Pk Wright
No ratings yet
Code: First Method:: (1) Write A C Program Using Open MP To Estimate The Value of PI (Use Minimum Two Methods)
Document8 pages
Code: First Method:: (1) Write A C Program Using Open MP To Estimate The Value of PI (Use Minimum Two Methods)
Salina Ranabhat
No ratings yet
Algorithm New
Document35 pages
Algorithm New
ongakimary8
No ratings yet
Python Solve CAT2
Document17 pages
Python Solve CAT2
Er Sachin Shandilya
No ratings yet
Computer Programming Laboratory Work
Document6 pages
Computer Programming Laboratory Work
Maia Zaica
No ratings yet
Assignment 1
Document5 pages
Assignment 1
Changwook Jung
No ratings yet
Basic Apis: Operating Systems Lab #5
Document1 page
Basic Apis: Operating Systems Lab #5
Programming Hunter
No ratings yet
Unit 4 Shared-Memory Parallel Programming With Openmp
Document37 pages
Unit 4 Shared-Memory Parallel Programming With Openmp
Sudha Palani
No ratings yet
Lec 01
Document2 pages
Lec 01
SNaveenMathew
No ratings yet
Sheet 03
Document3 pages
Sheet 03
eir.gn
No ratings yet
Baka
Document85 pages
Baka
M Narendran
No ratings yet
Machine Learning 2nd Assignment
Document3 pages
Machine Learning 2nd Assignment
lala
No ratings yet
PC File
Document57 pages
PC File
Avinash Vad
No ratings yet
CS 224n Assignment #3: Dependency Parsing: 1. Machine Learning & Neural Networks (8 Points)
Document7 pages
CS 224n Assignment #3: Dependency Parsing: 1. Machine Learning & Neural Networks (8 Points)
progis
No ratings yet
Practical-02 PC
Document12 pages
Practical-02 PC
SAMINA ATTARI
No ratings yet
Bash Course - Tutorial 4: Simulation Runtime Results
Document1 page
Bash Course - Tutorial 4: Simulation Runtime Results
anidcohen9058
No ratings yet
Purpose of Database - Overall System Structure - Entity Relationship Model - Mapping Constraints - Keys - ER Diagram
Document11 pages
Purpose of Database - Overall System Structure - Entity Relationship Model - Mapping Constraints - Keys - ER Diagram
chitrashan
No ratings yet
DAA Notes
Document199 pages
DAA Notes
John Rahul
No ratings yet
Project
Document3 pages
Project
Jamie DuGettho
No ratings yet
MC Openmp
Document10 pages
MC Openmp
Bui Khoa Nguyen Dang
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Symfony Book 2.4
Document254 pages
Symfony Book 2.4
pehdehceh
No ratings yet
Memory Verification Using UVM
Document13 pages
Memory Verification Using UVM
Anji medidi
No ratings yet
BCA 404 Old Book
Document255 pages
BCA 404 Old Book
Hiren Meghnani
No ratings yet
Attendance Management System Original
Document95 pages
Attendance Management System Original
Sangeetha Balakrishnan
No ratings yet
SOAPSDKManul
Document15 pages
SOAPSDKManul
Alfan Rosyid
0% (1)
Knapsack Problem Coursera
Document3 pages
Knapsack Problem Coursera
Aditya Jha
No ratings yet
Csharpsimplemodule - Writing Omnet++ Modules With C# and Mono
Document8 pages
Csharpsimplemodule - Writing Omnet++ Modules With C# and Mono
Wissem Hammouda
No ratings yet
Cp5152 Advanced Computer Architecture
Document3 pages
Cp5152 Advanced Computer Architecture
geetha MR
No ratings yet
DFF Fines
Document104 pages
DFF Fines
Alexander Del Aguila
No ratings yet
Java Opertors
Document20 pages
Java Opertors
Diva Kar
No ratings yet
How To Model MVC Framework With UML Sequence Diagram?
Document1 page
How To Model MVC Framework With UML Sequence Diagram?
abouanane
No ratings yet
Untitled
Document103 pages
Untitled
KEN DAEL
No ratings yet
Ocl
Document47 pages
Ocl
Walter Caballero
No ratings yet
Data Structure Using C Lab
Document75 pages
Data Structure Using C Lab
aafreen sadak
No ratings yet
PL/SQL Developer 7.0 New Features: December 2005
Document39 pages
PL/SQL Developer 7.0 New Features: December 2005
Ane Paleski
No ratings yet
Unit 1 - Understand Basics of .NET Framework
Document50 pages
Unit 1 - Understand Basics of .NET Framework
Asmatullah Khan
No ratings yet
5 Xv6-Notes
Document316 pages
5 Xv6-Notes
yogeshwari bahiram
No ratings yet
Character Device Driver
Document8 pages
Character Device Driver
Ranjith M Kumar
100% (1)
Cse Programming For Problem Solving Lecture Notes
Document185 pages
Cse Programming For Problem Solving Lecture Notes
TAMMISETTY VIJAY KUMAR
No ratings yet
Unit I
Document30 pages
Unit I
ramyab
No ratings yet
Problem Solving: Algorithms and Flowcharts
Document19 pages
Problem Solving: Algorithms and Flowcharts
syed abdullah
No ratings yet
Exam I Sample Questions
Document6 pages
Exam I Sample Questions
Manit Shah
No ratings yet
JAVA For Beginners: 2 Edition
Document12 pages
JAVA For Beginners: 2 Edition
shekharc
No ratings yet
Mysqli Begin Tramsaction
Document4 pages
Mysqli Begin Tramsaction
Irwan Fath
No ratings yet
Question 4 Database
Document8 pages
Question 4 Database
Win Ozirac
No ratings yet
CBSE Class 11 Computer Science Sample Paper Set 1
Document4 pages
CBSE Class 11 Computer Science Sample Paper Set 1
Gurman Bhatia
No ratings yet
Shopping Cart
Document37 pages
Shopping Cart
juli_naz
No ratings yet
Mounika Full Stack Java Developer
Document7 pages
Mounika Full Stack Java Developer
Param Sinha
No ratings yet
Data Structures and Algorithms: Lilia Georgieva
Document40 pages
Data Structures and Algorithms: Lilia Georgieva
Zaheer Khan
No ratings yet
Javascript
Document32 pages
Javascript
Himanahu Choudhary
No ratings yet