Welcome to Scribd!

Programming Assignment #1: - DUE 5PM Friday, January 30 - Turning in Assignment

Uploaded by

0% found this document useful (0 votes)

10 views4 pages

This document contains instructions for programming assignment 1 in CS6963, which is due on January 30th at 5PM. It describes two problems: 1) A reduction problem to parallelize an accumulation computation using a tree structure. Students are to write a CUDA program to implement this. 2) A matrix addition problem where students develop a CUDA program that initializes two matrices randomly, adds them together, and returns the output array. The program should use a grid of blocks and threads to optimize performance.

Original Description:

Original Title

6963_hw1

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

10 views4 pages

Programming Assignment #1: - DUE 5PM Friday, January 30 - Turning in Assignment

Uploaded by

proxymo1

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

Programming Assignment #1

DUE 5PM Friday, January 30 Turning in assignment:

- Use the handin program on the CADE machines - Use the following command:
handin cs6963 lab1 <prob1file> <prob2file>

- The file <prob1file> should be a gzipped tar file of the CUDA program and output for Problem 1 - The file <prob2file> should be a gzipped tar file of the CUDA program and output for Problem 2

CS6963

How to Develop and Run CUDA programs in Lab 6 (WEB 130)

Run Visual Studio 2005 (not 2008) Select FileNew Project In the window, choose CUDA WinApp After this, create CUDA programs with .cu extensions

NOTE: Any machine running CUDA 2.0 can be used for this assignment, but this lab has the latest NVIDIA cards in it.

CS6963

Programming Assignment #1
Problem 1: Reduction
In the count 6 example using synchronization, we accumulated all results into out_array[0], thus serializing the accumulation computation on the GPU. Suppose we want to exploit some parallelism in this part, which will likely be particularly important to performance as we scale the number of threads. A common idiom for reduction computations is to use a tree-structured results-gathering phase, where independent threads collect their results in parallel. The tree below illustrates this for our example, with SIZE=16 and BLOCKSIZE=4. out[0] out[1] out[2] out[3]

out[0] += out[1]

out[2] += out[3]

out[0] += out[2] Your job is to write this version of the reduction in CUDA. You can start with the sample code, but adjust the problem size to be larger: #define SIZE 256 #define BLOCKSIZE 32
CS6963

Programming Assignment #1
Problem 2: Computation Partitioning & Data Distribution
Using the matrix addition example from Lecture 2 class notes, develop a complete CUDA program for integer matrix addition. Initialize the two input matrices using random integers from 0 to 9 (as in the count 6 example). Initialize the ith entry in both a[i] and b[i], so that everyones matrix is initialized the same way. Your CUDA code should use a data distribution for the input and output matrices that exploits several features of the architecture to improve performance (although the advantages of these will be greater on objects allocated to shared memory and with reuse of data within a block). a grid of 16 square blocks of threads (>= # multiproc) each block executing 64 threads (> warpsize) each thread accessing 32 elements (better if larger) Within a grid block, you should use a BLOCK data distribution for the columns, so that each thread operates on a column. This should help reduce memory bank conflicts. Return the output array.
CS6963

Programming Assignments: A1 - Systemc and Openmp
Document2 pages
Programming Assignments: A1 - Systemc and Openmp
Himanshu Patel
No ratings yet
HPC 4 B
Document5 pages
HPC 4 B
Nayan Jadhav
No ratings yet
E Caddy Commands
Document194 pages
E Caddy Commands
Junior Marcos
No ratings yet
Matlab CUDA Tutorial 8 08
Document25 pages
Matlab CUDA Tutorial 8 08
miniskirt
No ratings yet
Lecture 11 Programming On Gpus Part 1 Zxu2acms60212 40212 S15lec 11 Gpupdf
Document121 pages
Lecture 11 Programming On Gpus Part 1 Zxu2acms60212 40212 S15lec 11 Gpupdf
eipu tu
No ratings yet
GPU Mathematica 8
Document3 pages
GPU Mathematica 8
Mahmoud Hassanien
No ratings yet
Code Composer Studio Operation Manual
Document16 pages
Code Composer Studio Operation Manual
Abinet Tesfaye
No ratings yet
Assinmet&Case Study
Document19 pages
Assinmet&Case Study
santosh vighneshwar hegde
No ratings yet
Tema 4 Tutorial 2 Eng
Document12 pages
Tema 4 Tutorial 2 Eng
smatuszak97
No ratings yet
Digital Signal Processing and Applications With The TMS320C6713 DSK
Document63 pages
Digital Signal Processing and Applications With The TMS320C6713 DSK
liameta
No ratings yet
Modern GPU
Document221 pages
Modern GPU
x2y2z2rm
No ratings yet
Lab 2
Document6 pages
Lab 2
Benazeer Pathan
No ratings yet
Autocad TTLM From Minilik.g 2023 G.C
Document114 pages
Autocad TTLM From Minilik.g 2023 G.C
minilikgetaye394
No ratings yet
Lab02 1
Document3 pages
Lab02 1
sonal
No ratings yet
Lab 2
Document6 pages
Lab 2
Iskandar HambaAllah
No ratings yet
Bal Delimit
Document3 pages
Bal Delimit
tempogv tempogv
No ratings yet
Cuda Review 1
Document13 pages
Cuda Review 1
JESSEMAN
No ratings yet
Readme
Document2 pages
Readme
Lea Drudi
No ratings yet
CAN AVR Training
Document28 pages
CAN AVR Training
hoe3852
No ratings yet
Mini Project#2
Document4 pages
Mini Project#2
mohamed walid
No ratings yet
Example: 201201014-GPU-AS2: Assignments For GPU Programming Course/ Lab
Document4 pages
Example: 201201014-GPU-AS2: Assignments For GPU Programming Course/ Lab
kiran
No ratings yet
DSP Lecture 03 - 1
Document62 pages
DSP Lecture 03 - 1
sitaram_1
No ratings yet
How To Run CUDA C
Document6 pages
How To Run CUDA C
cse
No ratings yet
Accelerating Data Parallelism in Gpus Through Apgas
Document9 pages
Accelerating Data Parallelism in Gpus Through Apgas
sathishkadapa
No ratings yet
Assignment III - Advanced CUDA
Document12 pages
Assignment III - Advanced CUDA
Badajuan
No ratings yet
Synopsys ASIC Design Flow
Document18 pages
Synopsys ASIC Design Flow
Nishant Singh
100% (1)
Computer Fundamentals Labes Report (1-8) - Shabbir Hussain (RP-22-EE-416)
Document125 pages
Computer Fundamentals Labes Report (1-8) - Shabbir Hussain (RP-22-EE-416)
As Df
No ratings yet
CUDA
Document33 pages
CUDA
ravish177
No ratings yet
University of Ghana: This Paper Contains Two Parts (PART I and PART II) Answer All Questions From Both PARTS
Document3 pages
University of Ghana: This Paper Contains Two Parts (PART I and PART II) Answer All Questions From Both PARTS
Philip Pearce-Pearson
No ratings yet
Stata Datawork
Document22 pages
Stata Datawork
hasan zahid
No ratings yet
DSK 6713 CCS V5.5 Procedure
Document7 pages
DSK 6713 CCS V5.5 Procedure
Nida Waheed
No ratings yet
3 Cuda
Document5 pages
3 Cuda
manvitha thottempudi
No ratings yet
04 - First Project On SW4STM32
Document23 pages
04 - First Project On SW4STM32
jean-christophe Toussaint
No ratings yet
Exp 2
Document19 pages
Exp 2
Zain Nadeem
No ratings yet
CENG325 Software Applications and Design: Introduction To
Document43 pages
CENG325 Software Applications and Design: Introduction To
Farah Al Ibrahim
No ratings yet
Automating After Hour Cube Builds
Document3 pages
Automating After Hour Cube Builds
kanumoori1090
No ratings yet
Lab Assignment 1
Document13 pages
Lab Assignment 1
teddyfasika
No ratings yet
Lab 0 Assignment
Document10 pages
Lab 0 Assignment
happysheep96
No ratings yet
Group A Assignment 4 (A) : Two Large Vectors
Document5 pages
Group A Assignment 4 (A) : Two Large Vectors
Nayan Jadhav
No ratings yet
Assignment 14
Document5 pages
Assignment 14
rahulcheryala2004
No ratings yet
OpenCV With QT
Document5 pages
OpenCV With QT
Adila Syifa Mahfudz
No ratings yet
L2 Modeling and Simulation of Motor Speed BMMP 2343
Document6 pages
L2 Modeling and Simulation of Motor Speed BMMP 2343
Lk Ban
No ratings yet
Cuda Emulator
Document7 pages
Cuda Emulator
Midhun Pal Cochin
No ratings yet
Lab Set I (New)
Document13 pages
Lab Set I (New)
bezawitg2002
No ratings yet
CS 2110 Lab 9 Assignment: Stack Subtract: The Name's Assistant. Teaching Assistant. October 13, 2021
Document2 pages
CS 2110 Lab 9 Assignment: Stack Subtract: The Name's Assistant. Teaching Assistant. October 13, 2021
Klutch Boy
No ratings yet
#2.5. Monitoring of Analog Inputs
Document24 pages
#2.5. Monitoring of Analog Inputs
Naufal Reyhan Fadhil
No ratings yet
Sadasdasd
Document28 pages
Sadasdasd
Miguel Lucio
No ratings yet
Csnb594csnb4423 Lab 5 01a Harveen Velan Sw0104101
Document19 pages
Csnb594csnb4423 Lab 5 01a Harveen Velan Sw0104101
Harveen Velan
No ratings yet
AutoCAD Howtoclean DWG
Document24 pages
AutoCAD Howtoclean DWG
Kodali Naveen Kumar
No ratings yet
Assignment 2
Document4 pages
Assignment 2
hadi.dadic
No ratings yet
Assignment WRT Lab5: Solve Exercise 2 and Upload The Solution With All The Graphs. Do Not Forget To Rename The Folder With Your BITS ID
Document3 pages
Assignment WRT Lab5: Solve Exercise 2 and Upload The Solution With All The Graphs. Do Not Forget To Rename The Folder With Your BITS ID
NOAH GEORGE
No ratings yet
Week2 - Assignment Solutions
Document16 pages
Week2 - Assignment Solutions
mobal62523
No ratings yet
rCUDA Guide
Document13 pages
rCUDA Guide
최현준
No ratings yet
Final
Document4 pages
Final
Apinya SUTHISOPHAARPORN
No ratings yet
Parallel Computing I Homework 6: Laplace Equation: Thorsten Grahs, Andres Rodriguez 8. June 2015
Document2 pages
Parallel Computing I Homework 6: Laplace Equation: Thorsten Grahs, Andres Rodriguez 8. June 2015
linchi
No ratings yet
BCSL-022 Lab Manual Part 1
Document9 pages
BCSL-022 Lab Manual Part 1
Syed Shiyaz Mirza
50% (2)
HPC For CST Simulations: E. Bonanno, M. Husejko
Document35 pages
HPC For CST Simulations: E. Bonanno, M. Husejko
enggsmile
No ratings yet
AutoCAD Workbook
From Everand
AutoCAD Workbook
Phil Metherell
Rating: 3.5 out of 5 stars
3.5/5 (2)
Gd Script
From Everand
Gd Script
Marijo Trkulja
No ratings yet
The Definitive Guide to Azure Data Engineering: Modern ELT, DevOps, and Analytics on the Azure Cloud Platform
From Everand
The Definitive Guide to Azure Data Engineering: Modern ELT, DevOps, and Analytics on the Azure Cloud Platform
Ron C. L'Esteve
No ratings yet
Extending OpenMP To Clusters
Document1 page
Extending OpenMP To Clusters
proxymo1
No ratings yet
Android Debug Bridge - Android Developers
Document11 pages
Android Debug Bridge - Android Developers
proxymo1
No ratings yet
Parallel Novikov Si 1-2-2013
Document14 pages
Parallel Novikov Si 1-2-2013
proxymo1
No ratings yet
Khem Raj Embedded Linux Conference 2014, San Jose, CA
Document29 pages
Khem Raj Embedded Linux Conference 2014, San Jose, CA
proxymo1
No ratings yet
Device Tree Migration
Document19 pages
Device Tree Migration
proxymo1
No ratings yet
OpenCL Jumpstart Guide
Document17 pages
OpenCL Jumpstart Guide
proxymo1
No ratings yet
Factor of Safety and Stress Analysis of Fuselage Bulkhead Using Composite
Document7 pages
Factor of Safety and Stress Analysis of Fuselage Bulkhead Using Composite
proxymo1
No ratings yet
Aircraft Structures II Lab
Document15 pages
Aircraft Structures II Lab
proxymo1
0% (1)
Model 0710 & Experiment Overview
Document10 pages
Model 0710 & Experiment Overview
proxymo1
No ratings yet
GBA Programming Manual v1.22
Document172 pages
GBA Programming Manual v1.22
proxymo1
No ratings yet
Network World Magazine Article 1328
Document4 pages
Network World Magazine Article 1328
proxymo1
No ratings yet
Understanding U-Boot-Min Startup For DM814x: Links
Document2 pages
Understanding U-Boot-Min Startup For DM814x: Links
Balachandran
No ratings yet
Price List Toshiba - Januari 2011
Document3 pages
Price List Toshiba - Januari 2011
Rohman_RoRaSH
No ratings yet
Installation Guide
Document1 page
Installation Guide
CamiloRamirezSanabria
No ratings yet
045 PRP
Document29 pages
045 PRP
penavicb
No ratings yet
Assignment 1 Sol
Document13 pages
Assignment 1 Sol
kshambl
No ratings yet
Hardware Software Mindmap
Document1 page
Hardware Software Mindmap
Walid_Sassi_Tun
No ratings yet
Micro Component System: Instructions
Document0 pages
Micro Component System: Instructions
Hristu
No ratings yet
Serial Port Model
Document4 pages
Serial Port Model
Yanuar Andriyanto
No ratings yet
Expert Oracle Database 10g Administration by Sam R Alapati PDF
Document2 pages
Expert Oracle Database 10g Administration by Sam R Alapati PDF
Angel
No ratings yet
Freebitcoin Script Roll 10000
Document2 pages
Freebitcoin Script Roll 10000
Qahtan Maqhat
100% (1)
Acd-2100 DS 120220
Document2 pages
Acd-2100 DS 120220
mihu_cristina84
No ratings yet
Id1, Id2 Mettler Toledo.
Document52 pages
Id1, Id2 Mettler Toledo.
arodassanchez
100% (2)
Pioneer 1150
Document96 pages
Pioneer 1150
edgardomestre
No ratings yet
Role of Computer in Electronics
Document11 pages
Role of Computer in Electronics
Nabeel Raja
100% (1)
2010 Toyota Prius Package II Head Unit Upgrade
Document10 pages
2010 Toyota Prius Package II Head Unit Upgrade
Michael Haisley
No ratings yet
BBC TVP What Is Iptv
Document32 pages
BBC TVP What Is Iptv
Mukhlis Yulianto
No ratings yet
Company Name Booth No
Document26 pages
Company Name Booth No
Shashi Rajput
No ratings yet
Installation Instructions For Mercedes-Benz EWAnet
Document13 pages
Installation Instructions For Mercedes-Benz EWAnet
ING. RUBENS
No ratings yet
JVC LCD Lt-19da1bj - Bu
Document66 pages
JVC LCD Lt-19da1bj - Bu
ngoclinhdtdd
No ratings yet
Direct Memory Access
Document26 pages
Direct Memory Access
Eswin Angel
No ratings yet
HN4000 Installation R14
Document110 pages
HN4000 Installation R14
Tony Long
No ratings yet
Extreme Power Supply Calculator
Document2 pages
Extreme Power Supply Calculator
Stephanie Campbell
No ratings yet
Sungrow SG250HX-V112-User Manual
Document100 pages
Sungrow SG250HX-V112-User Manual
Gopi Laal Bahadur
No ratings yet
Design, Fabrication and Performance Evaluation of A Small Capacity Mungbean Sheller
Document42 pages
Design, Fabrication and Performance Evaluation of A Small Capacity Mungbean Sheller
princejhumer
0% (1)
CONTROL LAB EXP 07 (An Introduction To Industrial Programmable Controllers Using PC45ML PART 2) - ACS
Document4 pages
CONTROL LAB EXP 07 (An Introduction To Industrial Programmable Controllers Using PC45ML PART 2) - ACS
Sakib Mahmud
No ratings yet
Dankbaar College of Applied Science and Technology
Document9 pages
Dankbaar College of Applied Science and Technology
Bright Alike Chiwevu
No ratings yet
Brochure 47 ASR 500 KW
Document4 pages
Brochure 47 ASR 500 KW
elmorija
No ratings yet
9619 Sanyo LCD-42XF6 Chassis UH4-L Televisor LCD Manual de Servicio
Document50 pages
9619 Sanyo LCD-42XF6 Chassis UH4-L Televisor LCD Manual de Servicio
Natal2i2
No ratings yet
Itp FBSB Moc Fibreoptic 005 Final
Document6 pages
Itp FBSB Moc Fibreoptic 005 Final
Victor Biacolo
No ratings yet
Gradient Domain High Dynamic Range Compression Matlab
Document3 pages
Gradient Domain High Dynamic Range Compression Matlab
Ricky
100% (1)