csplit(1)
csplit --
context split
Synopsis
csplit [-s] [-k] [-f prefix] [-n digits] file arg1 [ ... argn]
Description
csplit
reads
file
and separates it into
n+1
sections,
defined by the arguments
arg1 . . . argn.
By default the sections are placed in
xx00-xx99.
These sections get the following pieces of
file:
00:-
From the start of
file
up to (but not including) the line referenced by
arg1.
01:-
From the line referenced by
arg1
up to the line referenced by
arg2.
. . .-
. . .
n:-
From the line referenced by
argn
to the end of
file.
If the
file
argument is a
-,
then standard input is used.
csplit processes supplementary code set characters,
and recognizes supplementary code set characters in the
prefix given to the -f option (see below)
according to the locale specified in the LC_CTYPE
environment variable (see LANG on
environ(5)).
In basic regular expressions (BREs), pattern searches are performed
on characters, not bytes, as described on
ed(1).
The options to
csplit
are:
-s-
csplit
normally prints the number of bytes in each file created.
If the
-s
option is present,
csplit
suppresses the printing of all byte counts.
-k-
csplit
normally removes created files if an error occurs.
If the
-k
option is present,
csplit
leaves previously created files intact.
-f prefix-
If the
-f
option is used, the created files are named
prefix00 . . . prefixn.
The default is
xx00 . . . xxn.
Supplementary code set characters may be used in prefix.
-n digits-
Use digits decimal digits to form filenames for the file pieces.
The default is 2.
The arguments
(arg1 . . . argn)
to
csplit
can be a combination of the following:
/rexp/-
A file is to be created for the section from the current line
up to (but not including) the line containing the basic regular expression
rexp.
The line containing
rexp
becomes the current line.
This argument may be followed by an optional
+ or
-
some number of lines
(for example, /Page/-5).
See
ed(1)
for a description of how
to specify a regular expression.
%rexp%-
This argument is the same as
/rexp/,
except that no file is created for the section.
lnno-
A file is to be created from the current line up to (but not including)
lnno.
lnno
becomes the current line.
{num}-
Repeat argument.
This argument may follow any of the above arguments.
If it follows a
rexp
type argument, that argument is applied
num
more times.
If it follows
lnno,
the file will be split every
lnno
lines
(num
times)
from that point.
{*}-
Repeats previous operand as many times as necessary to finish input.
Enclose all
rexp
type arguments that contain blanks or other characters meaningful to
the shell in the appropriate quotes.
Basic regular expressions may not contain embedded new-lines.
csplit
does not affect the original file;
it is the user's responsibility to remove it
if it is no longer wanted.
Examples
csplit -f cobol file '/procedure division/' /par5./ /par16./
This example creates four files,
cobol00 . . . cobol03.
After editing the ``split'' files, they can be recombined as follows:
cat cobol0[0-3] > file
Note that this example overwrites the original file.
csplit -k file 100 {99}
This example splits the file at every 100 lines, up to 10,000 lines.
The
-k
option causes the created files to be retained if there
are less than 10,000 lines; however, an error message would
still be printed.
csplit -k prog.c '%main(%´ '/^}/+1' {20}
If
prog.c
follows the normal
C
coding convention
(the last line of a routine consists only of a
}
in the first character position),
this example creates a file for each separate
C
routine (up to 21) in
prog.c.
Files
/usr/lib/locale/locale/LC_MESSAGES/uxdfm-
language-specific message file (See LANG on
environ(5).)
References
ed(1),
regexp(5),
sh(1)
Diagnostics
arg - out of range
-
arg does not reference a line
between the current position and the end of the file.
© 2004 The SCO Group, Inc. All rights reserved.
UnixWare 7 Release 7.1.4 - 25 April 2004