
0
1990
Wiley-Liss,
Inc.
Cytometry
11:323-332
(1990)
Data
File
Standard for
Flow
Cytometry
Data File Standards Committee
of
the Society
for
Analytical Cytology
Received for publication November
27,
1989; accepted November
27,
1989
This data file standard for flow cytom-
etry (FCS) provides a detailed descrip-
tion of
a
data file structure designed such
that the file can include all of the infor-
mation necessary to describe fully:
1)
the
instrument used to obtain the data; 2) the
sample measured;
3)
the data obtained;
and
4)
the results of analysis of the data.
The file may contain one
or
more data
sets. Each data set includes data from
a
single sample
or
acquisition run. Each
data set consists of
a
minimum of four
parts-HEADER, TEXT, DATA, and
ANALYSIS-all of which are required.
The file is structured such that the parts
of
a
data set do not have to be in any par-
ticular order, except for the HEADER,
which must always be first. The
HEADER contains pointers to the begin-
ning and end of each of the other three
parts. The file is written
as
a
continuous
byte stream, with no line or page format-
ting. The HEADER, TEXT, and ANALY-
SIS parts are written in ASCII format
and the DATA part can be written in
ASCII, binary integer or floating point.
This Standard, noted as FCS2.0, is
based on
a
standard proposed by Mur-
phy and Chused
(Cytonetry
5:553-555,
1984).
Data
written utilizing their pro-
posed format (FCSl.0)
is
compatible with
the format described here. Future ver-
sions of the standard will maintain com-
patibility with older versions.
1.
2.
3.
4.
5.
6.
7.
GENERAL PRINCIPLES
A data set contains all of the information relative
to a flow cytometric measurement.
Data from more than one measurement may be
stored in
a
single file, as a number of data sets.
Each data
set
contains four mandatory parts
(HEADER, TEXT, DATA, and ANALYSIS) and
any number of additional (user-defined) parts.
Each data set
is
complete unto itself, not depen-
dent on any other data set in the data file.
The HEADER is always the first part of
a
data
set.
It
contains the byte address of the beginning
and end of all other parts of a data set, relative to
the beginning of the data set. This feature means
that the parts do not have to be contiguous or in a
particular order.
The TEXT part of a data set contains
KEY-
WORDS
and their associated
VALUES.
The form
of
a keyword value depends on the nature of the
keyword (e.g., character string or number). There
is
NO
default for any keyword. Keywords and
their values may be of any length (number of
bytes) except that they must contain at least one
character (no nulls). Keywords are case (upper/
lower) insensitive; values are case sensitive.
FCS keywords begin with the
“$”
character. They
are recognized by all programs with formats
as
specified in the Standard. User-defined keywords
must begin with some other character.
8. There are
required
and
optional
keywords. The
required keywords contain all of the information
necessary to read the data part of the data set.
Optional keywords are used to supply additional
information about the measurement.
9.
All space within
a
file which is not being used will
be filled with
a
space character.
SPECIFIC FILE PARTS
HEADER
The HEADER part of each data
set
in
a
data file
begins with the Flow Cytometry Standard version
identifier, which occupies the first 10 bytes. This ver-
sion of the Standard will be entered
as
“FCS2.0” fol-
lowed by four spaces. The next eight
(8)
bytes contain
the offset, in bytes, from the beginning of the data set
to the
start
of
the TEXT part of the data set; i.e., an
offset of 60 bytes points to the 61st byte of the data set.
All offsets are in ASCII, right justified. The next eight
(8)
bytes contain the offset, again in bytes, from the
beginning of the data set to the last byte of the TEXT
part. The next four
(4)
8-byte numbers give the offsets
to the beginning and end of the DATA part, and the
beginning and end of the ANALYSIS part. If any part
of the data set is not included in the file, the offsets can
be zero
(0)
or blank. Additional pairs of locally defined
offsets may follow the standard three
(3)
pairs if de-
sired. Thus, the
first
58 bytes of the data
set
are of fixed
format and contain information that identifies the ver-