1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
|
dd(P) dd(P)
NAME
dd - convert and copy a file
SYNOPSIS
dd [operand ...]
DESCRIPTION
The dd utility shall copy the specified input file to
the specified output file with possible conversions
using specific input and output block sizes. It shall
read the input one block at a time, using the specified
input block size; it shall then process the block of
data actually returned, which could be smaller than the
requested block size. It shall apply any conversions
that have been specified and write the resulting data to
the output in blocks of the specified output block size.
If the bs= expr operand is specified and no conversions
other than sync, noerror, or notrunc are requested, the
data returned from each input block shall be written as
a separate output block; if the read returns less than a
full block and the sync conversion is not specified, the
resulting output block shall be the same size as the
input block. If the bs= expr operand is not specified,
or a conversion other than sync, noerror, or notrunc is
requested, the input shall be processed and collected
into full-sized output blocks until the end of the input
is reached.
The processing order shall be as follows:
An input block is read.
If the input block is shorter than the specified input
block size and the sync conversion is specified, null
bytes shall be appended to the input data up to the
specified size. (If either block or unblock is also
specified, <space>s shall be appended instead of null
bytes.) The remaining conversions and output shall
include the pad characters as if they had been read from
the input.
If the bs= expr operand is specified and no conversion
other than sync or noerror is requested, the resulting
data shall be written to the output as a single block,
and the remaining steps are omitted.
If the swab conversion is specified, each pair of input
data bytes shall be swapped. If there is an odd number
of bytes in the input block, the last byte in the input
record shall not be swapped.
Any remaining conversions ( block, unblock, lcase, and
ucase) shall be performed. These conversions shall oper-
ate on the input data independently of the input block-
ing; an input or output fixed-length record may span
block boundaries.
The data resulting from input or conversion or both
shall be aggregated into output blocks of the specified
size. After the end of input is reached, any remaining
output shall be written as a block without padding if
conv= sync is not specified; thus, the final output
block may be shorter than the output block size.
OPTIONS
None.
OPERANDS
All of the operands shall be processed before any input
is read. The following operands shall be supported:
if=file
Specify the input pathname; the default is stan-
dard input.
of=file
Specify the output pathname; the default is stan-
dard output. If the seek= expr conversion is not
also specified, the output file shall be trun-
cated before the copy begins if an explicit of=
file operand is specified, unless conv= notrunc
is specified. If seek= expr is specified, but
conv= notrunc is not, the effect of the copy
shall be to preserve the blocks in the output
file over which dd seeks, but no other portion of
the output file shall be preserved. (If the size
of the seek plus the size of the input file is
less than the previous size of the output file,
the output file shall be shortened by the copy.)
ibs=expr
Specify the input block size, in bytes, by expr
(default is 512).
obs=expr
Specify the output block size, in bytes, by expr
(default is 512).
bs=expr
Set both input and output block sizes to expr
bytes, superseding ibs= and obs=. If no conver-
sion other than sync, noerror, and notrunc is
specified, each input block shall be copied to
the output as a single block without aggregating
short blocks.
cbs=expr
Specify the conversion block size for block and
unblock in bytes by expr (default is zero). If
cbs= is omitted or given a value of zero, using
block or unblock produces unspecified results.
The application shall ensure that this operand is also
specified if the conv= operand is specified with a value
of ascii, ebcdic, or ibm. For a conv= operand with an
ascii value, the input is handled as described for the
unblock value, except that characters are converted to
ASCII before any trailing <space>s are deleted. For
conv= operands with ebcdic or ibm values, the input is
handled as described for the block value except that the
characters are converted to EBCDIC or IBM EBCDIC,
respectively, after any trailing <space>s are added.
skip=n Skip n input blocks (using the specified input
block size) before starting to copy. On seekable
files, the implementation shall read the blocks
or seek past them; on non-seekable files, the
blocks shall be read and the data shall be dis-
carded.
seek=n Skip n blocks (using the specified output block
size) from the beginning of the output file
before copying. On non-seekable files, existing
blocks shall be read and space from the current
end-of-file to the specified offset, if any,
filled with null bytes; on seekable files, the
implementation shall seek to the specified offset
or read the blocks as described for non-seekable
files.
count=n
Copy only n input blocks.
conv=value[,value ...]
Where values are comma-separated symbols from the
following list:
ascii Convert EBCDIC to ASCII; see ASCII to EBCDIC Con-
version .
ebcdic Convert ASCII to EBCDIC; see ASCII to EBCDIC Con-
version .
ibm Convert ASCII to a different EBCDIC set; see
ASCII to IBM EBCDIC Conversion .
The ascii, ebcdic, and ibm values are mutually-exclu-
sive.
block Treat the input as a sequence of <newline>-termi-
nated or end-of-file-terminated variable-length
records independent of the input block bound-
aries. Each record shall be converted to a record
with a fixed length specified by the conversion
block size. Any <newline> shall be removed from
the input line; <space>s shall be appended to
lines that are shorter than their conversion
block size to fill the block. Lines that are
longer than the conversion block size shall be
truncated to the largest number of characters
that fit into that size; the number of truncated
lines shall be reported (see the STDERR section).
The block and unblock values are mutually-exclusive.
unblock
Convert fixed-length records to variable length.
Read a number of bytes equal to the conversion
block size (or the number of bytes remaining in
the input, if less than the conversion block
size), delete all trailing <space>s, and append a
<newline>.
lcase Map uppercase characters specified by the
LC_CTYPE keyword tolower to the corresponding
lowercase character. Characters for which no
mapping is specified shall not be modified by
this conversion.
The lcase and ucase symbols are mutually-exclusive.
ucase Map lowercase characters specified by the
LC_CTYPE keyword toupper to the corresponding
uppercase character. Characters for which no
mapping is specified shall not be modified by
this conversion.
swab Swap every pair of input bytes.
noerror
Do not stop processing on an input error. When an
input error occurs, a diagnostic message shall be
written on standard error, followed by the cur-
rent input and output block counts in the same
format as used at completion (see the STDERR sec-
tion). If the sync conversion is specified, the
missing input shall be replaced with null bytes
and processed normally; otherwise, the input
block shall be omitted from the output.
notrunc
Do not truncate the output file. Preserve blocks
in the output file not explicitly written by this
invocation of the dd utility. (See also the pre-
ceding of= file operand.)
sync Pad every input block to the size of the ibs=
buffer, appending null bytes. (If either block or
unblock is also specified, append <space>s,
rather than null bytes.)
The behavior is unspecified if operands other than conv=
are specified more than once.
For the bs=, cbs=, ibs=, and obs= operands, the applica-
tion shall supply an expression specifying a size in
bytes. The expression, expr, can be:
A positive decimal number
A positive decimal number followed by k, specifying mul-
tiplication by 1024
A positive decimal number followed by b, specifying mul-
tiplication by 512
Two or more positive decimal numbers (with or without k
or b) separated by x, specifying the product of the
indicated values
All of the operands are processed before any input is
read.
The following two tables display the octal number char-
acter values used for the ascii and ebcdic conversions
(first table) and for the ibm conversion (second table).
In both tables, the ASCII values are the row and column
headers and the EBCDIC values are found at their inter-
sections. For example, ASCII 0012 (LF) is the second
row, third column, yielding 0045 in EBCDIC. The inverted
tables (for EBCDIC to ASCII conversion) are not shown,
but are in one-to-one correspondence with these tables.
The differences between the two tables are highlighted
by small boxes drawn around five entries.
Table: ASCII to EBCDIC Conversion
Table: ASCII to IBM EBCDIC Conversion
STDIN
If no if= operand is specified, the standard input shall
be used. See the INPUT FILES section.
INPUT FILES
The input file can be any file type.
ENVIRONMENT VARIABLES
The following environment variables shall affect the
execution of dd:
LANG Provide a default value for the
internationalization variables that are unset or
null. (See the Base Definitions volume of
IEEE Std 1003.1-2001, Section 8.2, International-
ization Variables for the precedence of interna-
tionalization variables used to determine the
values of locale categories.)
LC_ALL If set to a non-empty string value, override the
values of all the other internationalization
variables.
LC_CTYPE
Determine the locale for the interpretation of
sequences of bytes of text data as characters
(for example, single-byte as opposed to multi-
byte characters in arguments and input files),
the classification of characters as uppercase or
lowercase, and the mapping of characters from one
case to the other.
LC_MESSAGES
Determine the locale that should be used to
affect the format and contents of diagnostic mes-
sages written to standard error and informative
messages written to standard output.
NLSPATH
Determine the location of message catalogs for
the processing of LC_MESSAGES .
ASYNCHRONOUS EVENTS
For SIGINT, the dd utility shall interrupt its current
processing, write status information to standard error,
and exit as though terminated by SIGINT. It shall take
the standard action for all other signals; see the ASYN-
CHRONOUS EVENTS section in Utility Description Defaults
.
STDOUT
If no of= operand is specified, the standard output
shall be used. The nature of the output depends on the
operands selected.
STDERR
On completion, dd shall write the number of input and
output blocks to standard error. In the POSIX locale the
following formats shall be used:
"%u+%u records in\n", <number of whole input blocks>,
<number of partial input blocks>
"%u+%u records out\n", <number of whole output blocks>,
<number of partial output blocks>
A partial input block is one for which read() returned
less than the input block size. A partial output block
is one that was written with fewer bytes than specified
by the output block size.
In addition, when there is at least one truncated block,
the number of truncated blocks shall be written to stan-
dard error. In the POSIX locale, the format shall be:
"%u truncated %s\n", <number of truncated blocks>, "record" (if
<number of truncated blocks> is one) "records" (otherwise)
Diagnostic messages may also be written to standard
error.
OUTPUT FILES
If the of= operand is used, the output shall be the same
as described in the STDOUT section.
EXTENDED DESCRIPTION
None.
EXIT STATUS
The following exit values shall be returned:
0 The input file was copied successfully.
>0 An error occurred.
CONSEQUENCES OF ERRORS
If an input error is detected and the noerror conversion
has not been specified, any partial output block shall
be written to the output file, a diagnostic message
shall be written, and the copy operation shall be dis-
continued. If some other error is detected, a diagnostic
message shall be written and the copy operation shall be
discontinued.
The following sections are informative.
APPLICATION USAGE
The input and output block size can be specified to take
advantage of raw physical I/O.
There are many different versions of the EBCDIC code-
sets. The ASCII and EBCDIC conversions specified for the
dd utility perform conversions for the version specified
by the tables.
EXAMPLES
The following command:
dd if=/dev/rmt0h of=/dev/rmt1h
copies from tape drive 0 to tape drive 1, using a common
historical device naming convention.
The following command:
dd ibs=10 skip=1
strips the first 10 bytes from standard input.
This example reads an EBCDIC tape blocked ten 80-byte
EBCDIC card images per block into the ASCII file x:
dd if=/dev/tape of=x ibs=800 cbs=80 conv=ascii,lcase
RATIONALE
The OPTIONS section is listed as "None" because there
are no options recognized by historical dd utilities.
Certainly, many of the operands could have been designed
to use the Utility Syntax Guidelines, which would have
resulted in the classic hyphenated option letters. In
this version of this volume of IEEE Std 1003.1-2001, dd
retains its curious JCL-like syntax due to the large
number of applications that depend on the historical
implementation.
A suggested implementation technique for conv= noerror,
sync is to zero (or <space>-fill, if blocking or
unblocking) the input buffer before each read and to
write the contents of the input buffer to the output
even after an error. In this manner, any data trans-
ferred to the input buffer before the error was detected
is preserved. Another point is that a failed read on a
regular file or a disk generally does not increment the
file offset, and dd must then seek past the block on
which the error occurred; otherwise, the input error
occurs repetitively. When the input is a magnetic tape,
however, the tape normally has passed the block contain-
ing the error when the error is reported, and thus no
seek is necessary.
The default ibs= and obs= sizes are specified as 512
bytes because there are historical (largely portable)
scripts that assume these values. If they were left
unspecified, unusual results could occur if an implemen-
tation chose an odd block size.
Historical implementations of dd used creat() when pro-
cessing of= file. This makes the seek= operand unusable
except on special files. The conv= notrunc feature was
added because more recent BSD-based implementations use
open() (without O_TRUNC) instead of creat(), but they
fail to delete output file contents after the data
copied.
The w multiplier (historically meaning word), is used in
System V to mean 2 and in 4.2 BSD to mean 4. Since word
is inherently non-portable, its use is not supported by
this volume of IEEE Std 1003.1-2001.
Standard EBCDIC does not have the characters '[' and ']'
. The values used in the table are taken from a common
print train that does contain them. Other than those
characters, the print train values are not filled in,
but appear to provide some of the motivation for the
historical choice of translations reflected here.
The Standard EBCDIC table provides a 1:1 translation for
all 256 bytes.
The IBM EBCDIC table does not provide such a transla-
tion. The marked cells in the tables differ in such a
way that:
EBCDIC 0112 ( 'cent' ) and 0152 (broken pipe) do not
appear in the table.
EBCDIC 0137 ( 'not' ) translates to/from ASCII 0236 (
'^' ). In the standard table, EBCDIC 0232 (no graphic)
is used.
EBCDIC 0241 ( '~' ) translates to/from ASCII 0176 ( '~'
). In the standard table, EBCDIC 0137 ( 'not' ) is used.
0255 ( '[' ) and 0275 ( ']' ) appear twice, once in the
same place as for the standard table and once in place
of 0112 ( 'cent' ) and 0241 ( '~' ).
In net result: EBCDIC 0275 ( ']' ) displaced EBCDIC 0241
( '~' ) in cell 0345.
That displaced EBCDIC 0137 ( 'not' ) in cell 0176.
That displaced EBCDIC 0232 (no graphic) in cell
0136.
That replaced EBCDIC 0152 (broken pipe) in cell
0313.
EBCDIC 0255 ( '[' ) replaced EBCDIC 0112 ( 'cent' ).
This translation, however, reflects historical practice
that (ASCII) '~' and 'not' were often mapped to each
other, as were '[' and 'cent' ; and ']' and (EBCDIC) '~'
.
The cbs operand is required if any of the ascii, ebcdic,
or ibm operands are specified. For the ascii operand,
the input is handled as described for the unblock oper-
and except that characters are converted to ASCII before
the trailing <space>s are deleted. For the ebcdic and
ibm operands, the input is handled as described for the
block operand except that the characters are converted
to EBCDIC or IBM EBCDIC after the trailing <space>s are
added.
The block and unblock keywords are from historical BSD
practice.
The consistent use of the word record in standard error
messages matches most historical practice. An earlier
version of System V used block, but this has been
updated in more recent releases.
Early proposals only allowed two numbers separated by x
to be used in a product when specifying bs=, cbs=, ibs=,
and obs= sizes. This was changed to reflect the histori-
cal practice of allowing multiple numbers in the product
as provided by Version 7 and all releases of System V
and BSD.
A change to the swab conversion is required to match
historical practice and is the result of IEEE PASC
Interpretations 1003.2 #03 and #04, submitted for the
ISO POSIX-2:1993 standard.
A change to the handling of SIGINT is required to match
historical practice and is the result of IEEE PASC
Interpretation 1003.2 #06 submitted for the
ISO POSIX-2:1993 standard.
FUTURE DIRECTIONS
None.
SEE ALSO
Utility Description Defaults , sed , tr
COPYRIGHT
Portions of this text are reprinted and reproduced in
electronic form from IEEE Std 1003.1, 2003 Edition,
Standard for Information Technology -- Portable Operat-
ing System Interface (POSIX), The Open Group Base Speci-
fications Issue 6, Copyright (C) 2001-2003 by the Insti-
tute of Electrical and Electronics Engineers, Inc and
The Open Group. In the event of any discrepancy between
this version and the original IEEE and The Open Group
Standard, the original IEEE and The Open Group Standard
is the referee document. The original Standard can be
obtained online at http://www.open-
group.org/unix/online.html .
POSIX 2003 dd(P)
|