Welcome to Scribd!

Why I/O memory must be 512-byte aligned for O_DIRECT transfers

Uploaded by

0% found this document useful (0 votes)

148 views2 pages

The document discusses the need to align memory addresses to 512 bytes when performing direct I/O (O_DIRECT) operations in Linux. While hardware DMA transfers typically only require 4 or 8 byte alignment, aligning to 512 bytes helps prevent single sector I/O requests from crossing virtual memory page boundaries, which could require splitting the DMA transfer. So the 512 byte alignment requirement is imposed by the operating system, not the hardware itself, but aligning in this way helps optimize single sector transfer performance.

Original Description:

mem align

Original Title

c - Memory Alignment - Stack Overflow

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

148 views2 pages

Why I/O memory must be 512-byte aligned for O_DIRECT transfers

Uploaded by

preetamn

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

Memory alignment

I have understood why memory should be aligned to 4 byte and 8 byte based on data width of the bus. But following statement confuses me "IoDrive requires that all I/O performed on a device using O_DIRECT must be 512-byte alligned and a multiple of 512 bytes in size." What is the need for aligning address to 512 bytes.
c operating-system memory-alignment

asked Aug 12 '10 at 17:00 Shishir 1,207 2 13 38 89% accept rate Was this post useful to you?

3 Answers
Blanket statements blaming DMA for large buffer alignment restrictions are wrong. Hardware DMA transfers are usually aligned on 4 or 8 byte boundaries since the PCI bus can physically transfer 32 or 64bits at a time. Beyond this basic alignment, hardware DMA transfers are designed to work with any address provided. However, the hardware deals with physical addresses, while the OS deals with virtual memory addresses (which is a protected mode construct in the x86 cpu). This means that a contiguous buffer in process space may not be contiguous in physical ram. Unless care is taken to create physically contiguous buffers, the DMA transfer needs to be broken up at VM page boundaries (typically 4K, possibly 2M). As for buffers needing to be aligned to disk sector size, this is completely untrue; the DMA hardware is completely oblivious to the physical sector size on a hard drive. Under Linux 2.4 O_DIRECT required 4K alignment, under 2.6 it's been relaxed to 512B. In either case, it was probably a design decision to prevent single sector updates from crossing VM page boundaries and therefor requiring split DMA transfers. (An arbitrary 512B buffer has a 1/4 chance of crossing a 4K page). So, while the OS is to blame rather than the hardware, we can see why page aligned buffers are more efficient. Edit: Of course, if we're writing large buffers anyways (100KB), then the number of VM page boundaries crossed will be practically the same whether we've aligned to 512B or not. So the main case being optimized by 512B alignment is single sector transfers.
edited Dec 16 '10 at 20:21 answered Dec 16 '10 at 20:08 Answer 56 2

feedback

Usually large alignment requirements like that are due to underlying DMA hardware. Large block

transfers can sometimes be made much faster by requiring much stronger alignment restrictions than what you have here. On several ARM processors, the first level translation table has to be aligned on a 16 KB boundary!
answered Aug 12 '10 at 17:02 Carl Norum 59.2k 6 79 149 how is it made faster by aligning to 512 bytes as if data is transfered 4 bytes in a cycle Shishir Aug 12 '10 at 17:09 @siri, that's the point - it might not be. It might be transferred 8, 16, 32, or even more, like all 512 bytes in a single cycle. DMA hardware can do basically anything - it's all very implementation dependent. Carl Norum Aug 12 '10 at 17:10

@siri: It is made faster by not having the processor involved in the transmission at all (that is what DMA is all about), but DMA hardware sometimes imposes limits above and beyond those implicit in the architecture itself. dmckee Aug 12 '10 at 17:10 +1 @dmckee, that's a good explanation. Carl Norum Aug 12 '10 at 17:12 A much nicer explanation than mine, and you used the magic word "DMA" Matt Joiner Aug 12 '10 at 17:29

feedback

If you don't know what you're doing, don't use O_DIRECT. O_DIRECT means "direct device access". This means it bypasses all OS caches, hitting the disk (or possibly RAID controller, etc) directly. Disk accesses are on a per-sector basis. EDIT: The alignment requirement is for the IO offset/size; it's not usually a memory-alignment requirement. EDIT: If you're looking at this page (it appears to be the only hit), it also says that the memory must be page-aligned.
answered Aug 12 '10 at 17:11 tc. 16.4k 1 14 38 feedback

Not the answer you're looking for? Browse other questions tagged c operating-system
memory-alignment or ask your own question.

Ram As Swap Space
Document9 pages
Ram As Swap Space
Surya Prakash Singh
No ratings yet
Basic Storage Subsystem Architectures
Document6 pages
Basic Storage Subsystem Architectures
Atthulai
No ratings yet
Storage and Hyper-V Part 1 Fundamentals
Document7 pages
Storage and Hyper-V Part 1 Fundamentals
Alemseged Habtamu
No ratings yet
DIsk BFR
Document26 pages
DIsk BFR
Arnaldo Canelas
No ratings yet
Storage Through the Ages: A Brief History of Data Storage Technology
Document40 pages
Storage Through the Ages: A Brief History of Data Storage Technology
tale
No ratings yet
High Performance Enterprise Data Storage and Analytics Solution
Document6 pages
High Performance Enterprise Data Storage and Analytics Solution
aamirf8
No ratings yet
Information Storage System-Chapter3
Document17 pages
Information Storage System-Chapter3
Salha Bujazia
No ratings yet
ZFS Features: ZFS - Building, Testing, and Benchmarking
Document28 pages
ZFS Features: ZFS - Building, Testing, and Benchmarking
Michael Kleinpaste
No ratings yet
Disco
Document33 pages
Disco
sushmsn
No ratings yet
An Introduction To NAND Flash
Document9 pages
An Introduction To NAND Flash
abhijitch
No ratings yet
Cwsparam
Document2 pages
Cwsparam
ni_maia220
No ratings yet
Cwsparam
Document2 pages
Cwsparam
Eduardo Fernandes
No ratings yet
NetApp - Architecture
Document5 pages
NetApp - Architecture
achilles7
No ratings yet
Challenges of SSD Forensic Analysis: by Digital Assembly
Document44 pages
Challenges of SSD Forensic Analysis: by Digital Assembly
Mohd Asri XSx
No ratings yet
Case For Data Caching and Flash Disks
Document6 pages
Case For Data Caching and Flash Disks
humairasadafsaleem
No ratings yet
Physical Characteristics of Disks
Document28 pages
Physical Characteristics of Disks
karlo
No ratings yet
Drilon Lajqi FlashVM
Document8 pages
Drilon Lajqi FlashVM
DrilonLajçi
No ratings yet
Farming Chia On An Old Computer - TurboFuture
Document1 page
Farming Chia On An Old Computer - TurboFuture
daniel
100% (1)
Flash Storage
Document5 pages
Flash Storage
Atthulai
No ratings yet
How To Build A $1000 RAC
Document5 pages
How To Build A $1000 RAC
Shahid Mahmud
No ratings yet
Cache Coherency
Document19 pages
Cache Coherency
sruthi.attineni
No ratings yet
Best Practices For Performance
Document0 pages
Best Practices For Performance
divandann
No ratings yet
Computer Recommendations - HDS
Document5 pages
Computer Recommendations - HDS
Pros Cons Statement
No ratings yet
MySQL and Linux Tuning - Better Together
Document26 pages
MySQL and Linux Tuning - Better Together
Oleksiy Kovyrin
100% (1)
Information Storage System-Chapter2
Document16 pages
Information Storage System-Chapter2
Salha Bujazia
No ratings yet
AIX Disk IO Tuning 093011
Document65 pages
AIX Disk IO Tuning 093011
Fonseca RS
No ratings yet
Challenges of SSD Forensic Analysis (37p)
Document37 pages
Challenges of SSD Forensic Analysis (37p)
Akis Karagiannis
No ratings yet
Dell Emc Poweredge Enterprise HDD Overview
Document5 pages
Dell Emc Poweredge Enterprise HDD Overview
Le Quang Thinh
No ratings yet
Build a Free RAC Test Environment on VMware
Document15 pages
Build a Free RAC Test Environment on VMware
Minh Quân
No ratings yet
How Hard Drives Work
Document14 pages
How Hard Drives Work
cellster
No ratings yet
The Nutanix Bible Guide
Document75 pages
The Nutanix Bible Guide
anonymous_9888
100% (1)
Interview Questions
Document17 pages
Interview Questions
Ashok Nalajala
No ratings yet
Important Technical Documents
Document22 pages
Important Technical Documents
Tanveer Ahmed
No ratings yet
Understanding Hard Disk: Connectors
Document5 pages
Understanding Hard Disk: Connectors
hiro_nakamura
0% (1)
Building A Home Lab For Vmware Vsphere: Expert Reference Series of White Papers
Document7 pages
Building A Home Lab For Vmware Vsphere: Expert Reference Series of White Papers
ome41
No ratings yet
Wafl Overview
Document15 pages
Wafl Overview
bhanusaie
No ratings yet
Aix Io Tuning
Document51 pages
Aix Io Tuning
akramh2
No ratings yet
DRBD 8.0.x and Beyond Shared-Disk Semantics On A Shared-Nothing Cluster
Document17 pages
DRBD 8.0.x and Beyond Shared-Disk Semantics On A Shared-Nothing Cluster
dave
No ratings yet
MicroSD-with-ECC Stackexchange
Document2 pages
MicroSD-with-ECC Stackexchange
arunraj03
No ratings yet
NOTEBOOK CHECK 5-26-12 SSDs and HDDs IN COMPARO
Document38 pages
NOTEBOOK CHECK 5-26-12 SSDs and HDDs IN COMPARO
hutz5000
No ratings yet
Storage: Cheap or Fast, Pick One
Document33 pages
Storage: Cheap or Fast, Pick One
Svatsan Sri
No ratings yet
Development and History: Early Ssds Using Ram and Similar Technology
Document25 pages
Development and History: Early Ssds Using Ram and Similar Technology
Avinash Thakur
No ratings yet
White Paper: February, 2010 Understanding What Makes A Computer Run Slowly or Fast How To Increase Your Computer's Speed
Document6 pages
White Paper: February, 2010 Understanding What Makes A Computer Run Slowly or Fast How To Increase Your Computer's Speed
throwawaynow
No ratings yet
Database Layout on SAN for Optimal Performance
Document3 pages
Database Layout on SAN for Optimal Performance
ANdy Hsu
No ratings yet
Five Secrets For Successfully Virtualizing A Data Center
Document8 pages
Five Secrets For Successfully Virtualizing A Data Center
Telco-Indonesia ParaKonTel Reborn
No ratings yet
Understand Flash Memory Types and Uses
Document36 pages
Understand Flash Memory Types and Uses
harikrishnach
No ratings yet
What Is The Best Way To Clone A Disk Between Two Macs?: 6 Answers
Document3 pages
What Is The Best Way To Clone A Disk Between Two Macs?: 6 Answers
fab
No ratings yet
IGNOUMCA 22FreeSolvedAssignments2011
Document15 pages
IGNOUMCA 22FreeSolvedAssignments2011
Sam Smart
No ratings yet
Optimize RAID Technology Guide in 40 Characters
Document4 pages
Optimize RAID Technology Guide in 40 Characters
hvrkkondragunta
No ratings yet
Network and Storage Options for Workstations
Document15 pages
Network and Storage Options for Workstations
Arun Sasidharan
No ratings yet
Embedded Systems Unit 8 Notes
Document17 pages
Embedded Systems Unit 8 Notes
Pradeep Kumar Goud Nadikuda
No ratings yet
Build Your Own Distributed Compilation Cluster: A Practical Walkthrough
From Everand
Build Your Own Distributed Compilation Cluster: A Practical Walkthrough
Hunter Davis
No ratings yet
PC Engine / TurboGrafx-16 Architecture: Architecture of Consoles: A Practical Analysis, #16
From Everand
PC Engine / TurboGrafx-16 Architecture: Architecture of Consoles: A Practical Analysis, #16
Rodrigo Copetti
No ratings yet
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
From Everand
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
Rodrigo Copetti
No ratings yet
Hard Circle Drives (HDDs): Uncovering the Center of Information Stockpiling
From Everand
Hard Circle Drives (HDDs): Uncovering the Center of Information Stockpiling
Friend Good
No ratings yet
FreeBSD Mastery: Storage Essentials: IT Mastery, #4
From Everand
FreeBSD Mastery: Storage Essentials: IT Mastery, #4
Michael W. Lucas
No ratings yet
FreeBSD Mastery: Advanced ZFS: IT Mastery, #9
From Everand
FreeBSD Mastery: Advanced ZFS: IT Mastery, #9
Michael W. Lucas
No ratings yet
Google BigQuery Analytics
From Everand
Google BigQuery Analytics
Jordan Tigani
Rating: 3 out of 5 stars
3/5 (1)
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Basic Hash Cracking
From Everand
Basic Hash Cracking
Mad76e
No ratings yet
Chapter 04: Instruction Sets and The Processor Organizations
Document24 pages
Chapter 04: Instruction Sets and The Processor Organizations
preetamn
No ratings yet
Comp Arch CH 03 L06 Array Mult
Document23 pages
Comp Arch CH 03 L06 Array Mult
preetamn
No ratings yet
4.hardware Versus Software Speculation Mechanisms
Document3 pages
4.hardware Versus Software Speculation Mechanisms
i_2loveu3235
No ratings yet
RSTP
Document16 pages
RSTP
nawrajlekhak
No ratings yet
VM Tlbs
Document17 pages
VM Tlbs
preetamn
No ratings yet
3 & 4.synchronization & Models of Memory Consistency
Document12 pages
3 & 4.synchronization & Models of Memory Consistency
preetamn
No ratings yet
Understanding Rapid Spanning Tree Protocol 802 1w
Document13 pages
Understanding Rapid Spanning Tree Protocol 802 1w
preetamn
No ratings yet
AMD Block Prefetch Paper
Document2 pages
AMD Block Prefetch Paper
preetamn
No ratings yet
13-Patal533marsv024-VxWorks QNX Scheduling Memory
Document10 pages
13-Patal533marsv024-VxWorks QNX Scheduling Memory
preetamn
No ratings yet
Bit Hacks
Document1 page
Bit Hacks
preetamn
No ratings yet
Spanning Tree
Document1 page
Spanning Tree
opexxx
100% (2)
RSTP 2
Document42 pages
RSTP 2
preetamn
No ratings yet
Transmission Control Protocol
Document16 pages
Transmission Control Protocol
preetamn
100% (1)
Semaphores
Document9 pages
Semaphores
preetamn
No ratings yet
Clearcase Guide
Document1 page
Clearcase Guide
preetamn
No ratings yet
Convert uppercase to lowercase by setting 5th bit
Document1 page
Convert uppercase to lowercase by setting 5th bit
preetamn
100% (1)
12 VxWORKS
Document14 pages
12 VxWORKS
preetamn
No ratings yet
Kernel and Address Spaces: Security in Java - Address Translation - HW 3: Strategy Pattern, Specializing Algorithms
Document18 pages
Kernel and Address Spaces: Security in Java - Address Translation - HW 3: Strategy Pattern, Specializing Algorithms
preetamn
No ratings yet
C++ Tutorial For C Users
Document56 pages
C++ Tutorial For C Users
Karthik Tantri
No ratings yet
Instruction and Data Cache Locking in E300
Document12 pages
Instruction and Data Cache Locking in E300
preetamn
No ratings yet
Nternet Rotocol (RFC 791) : - Preetam Narayan
Document125 pages
Nternet Rotocol (RFC 791) : - Preetam Narayan
preetamn
100% (1)
Using Inline Assembly With GCC
Document25 pages
Using Inline Assembly With GCC
gohan48
100% (1)
Bitwise Alignment
Document14 pages
Bitwise Alignment
preetamn
100% (1)
24 Copy Constructors
Document5 pages
24 Copy Constructors
preetamn
100% (1)
1 Introduction To GSM
Document75 pages
1 Introduction To GSM
preetamn
No ratings yet
TCP Ip
Document4 pages
TCP Ip
preetamn
No ratings yet
Emebbedd Question Bank
Document25 pages
Emebbedd Question Bank
sujith
100% (3)
Ipsec
Document6 pages
Ipsec
preetamn
No ratings yet
Brain Teaser
Document127 pages
Brain Teaser
InderSingh
100% (7)
C++ Manual
Document36 pages
C++ Manual
Nancy
No ratings yet
Zappos Case - Group 1 - The Arief - IfE - V3
Document19 pages
Zappos Case - Group 1 - The Arief - IfE - V3
asumanto
No ratings yet
Intro to MDX + ASO - Developing Essbase Applications
Document126 pages
Intro to MDX + ASO - Developing Essbase Applications
Sandeep Kulkarni
No ratings yet
General Purpose Electrochemical System Installation Instructios
Document50 pages
General Purpose Electrochemical System Installation Instructios
Athanasios Masouras
No ratings yet
Marketing Research and Information System (PPT)
Document42 pages
Marketing Research and Information System (PPT)
gopal roy
No ratings yet
Human Computer Interaction KI UI
Document4 pages
Human Computer Interaction KI UI
Yuni Susan
No ratings yet
Control Structures Self-Review Exercise
Document4 pages
Control Structures Self-Review Exercise
Kyana Kyu
No ratings yet
Audit Chap 13
Document3 pages
Audit Chap 13
Nana Moo
No ratings yet
Real Time Simulation
Document7 pages
Real Time Simulation
kumar
No ratings yet
Grade 12 Data Management Notes
Document2 pages
Grade 12 Data Management Notes
kayj09
0% (1)
h17514 Ready Hyperv Unity
Document57 pages
h17514 Ready Hyperv Unity
ricardoans
No ratings yet
Ijbb V5 I3
Document65 pages
Ijbb V5 I3
AI Coordinator - CSC Journals
No ratings yet
Essential ETAP Features for Power System Modeling and Analysis
Document4 pages
Essential ETAP Features for Power System Modeling and Analysis
suraj
No ratings yet
10G Concepts
Document4 pages
10G Concepts
srini6886dba
No ratings yet
Binary Locks
Document4 pages
Binary Locks
Shelvin Echo
No ratings yet
GJFJJDDSGGDDGDFSGFDSG
Document119 pages
GJFJJDDSGGDDGDFSGFDSG
Sanke Ashok
No ratings yet
CV Manish IP
Document4 pages
CV Manish IP
Manish Kumar
No ratings yet
Lab 11 File Streams
Document7 pages
Lab 11 File Streams
BilalHussain
No ratings yet
Guidelines For Submission of Project
Document9 pages
Guidelines For Submission of Project
Rahul Kaushik
No ratings yet
Introduction To Predictive Analytics: FICO Solutions Education
Document51 pages
Introduction To Predictive Analytics: FICO Solutions Education
Sumit Rampuria
No ratings yet
Box Muller Method: 1 Motivation
Document2 pages
Box Muller Method: 1 Motivation
Nitish Kumar
No ratings yet
DSC
Document3 pages
DSC
YM Anfas
No ratings yet
Adaptive Quadrature
Document35 pages
Adaptive Quadrature
MarsetiayuNingsih
100% (1)
Simulation of Lumbar Spine Biomechanics Using Abaqus
Document1 page
Simulation of Lumbar Spine Biomechanics Using Abaqus
SIMULIACorp
No ratings yet
Vikas Resume Support
Document3 pages
Vikas Resume Support
Sumit Agrawal
No ratings yet
History of the Internet in 40 Characters
Document2 pages
History of the Internet in 40 Characters
Ian
No ratings yet
VoLTE Performance Optimization Training PDF
Document4 pages
VoLTE Performance Optimization Training PDF
Pandiya Rajan
No ratings yet
CNC Machine: by - Pankaj Dhut
Document19 pages
CNC Machine: by - Pankaj Dhut
Varun Singh
No ratings yet
Lemonsoft Technologies Jsoup Cookbook
Document19 pages
Lemonsoft Technologies Jsoup Cookbook
Balkishan Sankhla
No ratings yet
20773A ENU TrainerHandbook PDF
Document340 pages
20773A ENU TrainerHandbook PDF
Ricardo Araujo
No ratings yet