1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
|
csplit(P) csplit(P)
NAME
csplit - split files based on context
SYNOPSIS
csplit [-ks][-f prefix][-n number] file arg1 ...argn
DESCRIPTION
The csplit utility shall read the file named by the file
operand, write all or part of that file into other files
as directed by the arg operands, and write the sizes of
the files.
OPTIONS
The csplit utility shall conform to the Base Definitions
volume of IEEE Std 1003.1-2001, Section 12.2, Utility
Syntax Guidelines.
The following options shall be supported:
-f prefix
Name the created files prefix 00, prefix 01, ...,
prefixn. The default is xx00 ... xx n. If the
prefix argument would create a filename exceeding
{NAME_MAX} bytes, an error shall result, csplit
shall exit with a diagnostic message, and no
files shall be created.
-k Leave previously created files intact. By
default, csplit shall remove created files if an
error occurs.
-n number
Use number decimal digits to form filenames for
the file pieces. The default shall be 2.
-s Suppress the output of file size messages.
OPERANDS
The following operands shall be supported:
file The pathname of a text file to be split. If file
is '-' , the standard input shall be used.
The operands arg1 ... argn can be a combination of the
following:
/rexp/[offset]
A file shall be created using the content of the
lines from the current line up to, but not
including, the line that results from the evalua-
tion of the regular expression with offset, if
any, applied. The regular expression rexp shall
follow the rules for basic regular expressions
described in the Base Definitions volume of
IEEE Std 1003.1-2001, Section 9.3, Basic Regular
Expressions. The application shall use the
sequence "\/" to specify a slash character within
the rexp. The optional offset shall be a positive
or negative integer value representing a number
of lines. A positive integer value can be pre-
ceded by '+' . If the selection of lines from an
offset expression of this type would create a
file with zero lines, or one with greater than
the number of lines left in the input file, the
results are unspecified. After the section is
created, the current line shall be set to the
line that results from the evaluation of the reg-
ular expression with any offset applied. If the
current line is the first line in the file and a
regular expression operation has not yet been
performed, the pattern match of rexp shall be
applied from the current line to the end of the
file. Otherwise, the pattern match of rexp shall
be applied from the line following the current
line to the end of the file.
%rexp%[offset]
Equivalent to /rexp/[offset], except that no file
shall be created for the selected section of the
input file. The application shall use the
sequence "\%" to specify a percent-sign character
within the rexp.
line_no
Create a file from the current line up to (but
not including) the line number line_no. Lines in
the file shall be numbered starting at one. The
current line becomes line_no.
{num} Repeat operand. This operand can follow any of
the operands described previously. If it follows
a rexp type operand, that operand shall be
applied num more times. If it follows a line_no
operand, the file shall be split every line_no
lines, num times, from that point.
An error shall be reported if an operand does not refer-
ence a line between the current position and the end of
the file.
STDIN
See the INPUT FILES section.
INPUT FILES
The input file shall be a text file.
ENVIRONMENT VARIABLES
The following environment variables shall affect the
execution of csplit:
LANG Provide a default value for the internationaliza-
tion variables that are unset or null. (See the
Base Definitions volume of IEEE Std 1003.1-2001,
Section 8.2, Internationalization Variables for
the precedence of internationalization variables
used to determine the values of locale cate-
gories.)
LC_ALL If set to a non-empty string value, override the
values of all the other internationalization
variables.
LC_COLLATE
Determine the locale for the behavior of ranges,
equivalence classes, and multi-character collat-
ing elements within regular expressions.
LC_CTYPE
Determine the locale for the interpretation of
sequences of bytes of text data as characters
(for example, single-byte as opposed to multi-
byte characters in arguments and input files) and
the behavior of character classes within regular
expressions.
LC_MESSAGES
Determine the locale that should be used to
affect the format and contents of diagnostic mes-
sages written to standard error.
NLSPATH
Determine the location of message catalogs for
the processing of LC_MESSAGES .
ASYNCHRONOUS EVENTS
If the -k option is specified, created files shall be
retained. Otherwise, the default action occurs.
STDOUT
Unless the -s option is used, the standard output shall
consist of one line per file created, with a format as
follows:
"%d\n", <file size in bytes>
STDERR
The standard error shall be used only for diagnostic
messages.
OUTPUT FILES
The output files shall contain portions of the original
input file; otherwise, unchanged.
EXTENDED DESCRIPTION
None.
EXIT STATUS
The following exit values shall be returned:
0 Successful completion.
>0 An error occurred.
CONSEQUENCES OF ERRORS
By default, created files shall be removed if an error
occurs. When the -k option is specified, created files
shall not be removed if an error occurs.
The following sections are informative.
APPLICATION USAGE
None.
EXAMPLES
This example creates four files, cobol00 ... cobol03:
csplit -f cobol file '/procedure division/' /par5./ /par16./
After editing the split files, they can be recombined as
follows:
cat cobol0[0-3] > file
Note that this example overwrites the original file.
This example would split the file after the first 99
lines, and every 100 lines thereafter, up to 9999 lines;
this is because lines in the file are numbered from 1
rather than zero, for historical reasons:
csplit -k file 100 {99}
Assuming that prog.c follows the C-language coding con-
vention of ending routines with a '}' at the beginning
of the line, this example creates a file containing each
separate C routine (up to 21) in prog.c:
csplit -k prog.c '%main(%' '/^}/+1' {20}
RATIONALE
The -n option was added to extend the range of filenames
that could be handled.
Consideration was given to adding a -a flag to use the
alphabetic filename generation used by the historical
split utility, but the functionality added by the -n
option was deemed to make alphabetic naming unnecessary.
FUTURE DIRECTIONS
None.
SEE ALSO
sed , split
COPYRIGHT
Portions of this text are reprinted and reproduced in
electronic form from IEEE Std 1003.1, 2003 Edition,
Standard for Information Technology -- Portable Operat-
ing System Interface (POSIX), The Open Group Base Speci-
fications Issue 6, Copyright (C) 2001-2003 by the Insti-
tute of Electrical and Electronics Engineers, Inc and
The Open Group. In the event of any discrepancy between
this version and the original IEEE and The Open Group
Standard, the original IEEE and The Open Group Standard
is the referee document. The original Standard can be
obtained online at http://www.open-
group.org/unix/online.html .
POSIX 2003 csplit(P)
|