TECHNICAL NOTES

INDEX
* SPIRES Technical Notes
+1 A Brief Explanation About this Document
1 Sorting Records in SPIRES; SPISORT
1.1 An Overview of Record-Sorting Methods
1.2 Sorting Records with SPISORT
1.3 Creating the SPISORT Input File
1.3.1 The DEFINE SET Command
1.3.2 The GENERATE SET Command
1.4 Sorting the Input File: the SPISORT Command
1.4.1 SPISORT Error Codes
1.4.2 (*) Running a Batch SPISORT Job
1.5 Processing the SPISORT Output File: the FOR SET Command
1.6 The SHOW SET INFORMATION (SHO SET INF) Command
1.7 Filters and SPISORT
1.8 Direct Sets
1.8.1 Direct Sets and the DEFINE SET Command
1.8.2 Direct Sets and the FOR SET Command
1.8.3 Direct Sets and Filters
1.9 Display Sets
2 An Alternate Form of the Subfile Name
3 Sharing Records in SPIRES System Subfiles with Other Users
3.1 The METACCT Subfile
3.2 Using METACCT Privileges: The SET METACCT Command
3.3 Using VERSION-STR to Track Copies of System File Records
4 Object Code Management: The ZAP Subfile-name Commands
4.1 The ZAP FORMAT Command
4.2 The ZAP RECDEF Command
4.3 The ZAP SYS PROTO Command
4.4 The ZAP VGROUP Command
4.5 The ZAP STATIC Command
5 Multiply Defined Elements
5.1 The Throw-away Element: The "-" Element in SPIRES Subfiles
6 Storing DECLARE Data for Multipurpose Use
6.1 Declare Data Subfiles
6.2 Using Your Stored Declared Data
7 Output Control
7.1 The Output Control Declaration
7.2 Using Output Control
7.3 DATA MOVE Processing
7.3.1 DATA MOVE for Subfile Output
7.3.2 DATA MOVE for Table output
7.4 DECLARE ELEMENT SUBFILE Description
7.5 DATA MOVE using PERFORM TABLE CREATE processing
8 External Files
8.1 External File Record Processing
8.2 External File Data Declaration -- the EXTERNAL subfile
8.3 External File Control Data
8.4 External File Data Declation -- Examples
9 Change Generation
9.1 The Change Generation Procedure
9.2 The Changes Subfile
10 XEQ DATA Processing
10.1 The Meta-Data record structure and XSEMPROC Actions
10.2 The SET XEQDATA Command
10.3 The XEQ DATA Command
10.4 XsemProc Action Descriptions
10.5 Xeq Data Sample Meta-Data Subfile Definition
10.6 Xeq Data Sample Protocol
11 Input Control
11.1 The Input Control Declaration
11.2 Using Input Control
11.2.1 Input Control commands -- Sample 1: Declare Tables
11.2.2 Input Control commands -- Sample 2: Declare Input Tables
11.2.3 Input Control commands -- Sample 3: Input RECDEF definitions
11.2.4 Input Control commands -- Sample 4: Input Control protocol
11.2.5 Input Control commands -- Sample 5: Processing
12 The REFERENCE Command, Partial FOR
12.1 Introduction; the REFERENCE Command
12.2 Record Navigation
12.3 Partial Processing Commands
12.4 Partial Processing UPDATE and MERGE Capabilities
12.5 The SHOW LEVELS Command
12.6 The FOR * command
12.7 Partial Processing to the Rescue
12.8 Using the INCLOSE Command to Close-Out a Referenced Record
13 I/O Monitoring Commands In SPIRES
13.1 The SET SINFO (SET SIN) Command and the "S" Parameter
13.2 The SET NOSINFO (SET NOSIN) Command
13.3 The SHOW SINFO (SHO SIN) Command
13.4 The CLEAR SINFO (CLE SIN) Command
13.5 The SHOW FILE COUNTS and SET FILE COUNTS Command
13.6 The SHOW SUBFILE INFORMATION Command
14 Path Processing
14.1 The Primary Path
14.2 Establishing Alternate Paths
14.3 Alternate Subfile Paths
14.4 Alternate Format Paths
14.5 Alternate VGROUP Path
14.6 Establishing a Subfile Path as the Default Subfile
14.7 Clearing Paths
14.8 Obtaining Information on Paths
14.9 Examples of Path Processing
14.10 Simultaneous Transfers and References in Paths
14.11 The CLEAR SUBGOALS (CLR SUBG) Command
15 Maintenance and Debugging Commands
15.1 DUMP BLOCK Command
15.2 FIX BLOCK Command
15.3 DEBUGGING COMMANDS, EMULATOR
15.4 DUMP RECORD Command
15.5 Object Deck Maintenance
16 Setting Locks in SPIRES
16.1 Setting Attach Locks in SPIRES
17 Subfile Tables
17.1 Establishing a Subfile Table: The DECLARE TABLE Command
17.2 Using a Subfile Table
17.3 Establishing a Subfile Input Table: The DECLARE INPUT TABLE Command
18 Packed Decimals in SPIRES
18.1 General Information on Packed Decimals
18.1.1 The Components and Terminology of Packed Decimals
18.1.2 The Input and Output Forms of Packed Decimal Values
18.1.3 Using Packed Decimals
18.2 Packed Decimals as Data Elements
18.3 Packed Decimal Variables and Arithmetic
18.4 Arithmetic with Packed Decimal Values
18.5 Handling Other Types of Numeric Data
19 Edit Masks
19.1 Numeral Representation: "9", "Z", and "*"
19.2 Decimal Point Indication: "."
19.3 Sign and Currency Symbols: "+", "-", "CR", "DB" and "$"
19.4 Floating Characters: "+", "-" and "$"
19.5 Insertion Characters: " ", "B", "0" and ","
19.6 Formal Edit Mask Syntax
20 Dynamic Elements
20.1 The DEFINE ELEMENT (DEF ELE) Command
20.2 Secondary Elements as Multiple Occurrences of a Dynamic Element
20.3 Declared Elements: Dynamic Elements With Element Definitions
20.4 Using Dynamic Elements
20.5 Dynamic Elements with Formats
20.6 Dynamic Elements and WHERE Clauses
20.7 Errors in Using Dynamic Elements
21 Element Filters
21.1 The SET FILTER Command
21.1.1 Setting Additional Filters: The SET FILTER OVERLAY command
21.2 Filter Capabilities and Limits
21.3 Showing and Clearing Filters
21.4 Filtering Elements by Occurrence Number
21.5 Filters and Dynamic Elements
22 IF-Testing in Record Input
23 Phantom Structures
23.1 Coding Phantom Structures
23.2 Capabilities and Restrictions of Phantom Structures
23.3 Defining Phantom Structures Dynamically with DEFINE ELEMENT
24 Temporary SPIRES Files
24.1 DECLARE FILE
24.2 DECLARE SUBFILE subname
25 The WITH DATA Option: Input Data as Part of the Command
26 DECLARE EXTERNAL DATA
27 Hierarchical Records in Multiple Table Output: DEFINE TABLE
27.1 The DEFINE TABLE Command
27.2 The GENERATE TABLES Command
27.3 The CLEAR TABLES Command
27.4 The SHOW TABLES Command
28 The PERFORM Commands
28.1 The PERFORM PRINT Command
28.1.2 Error messages
28.1.3 Printing from Protocols Files
28.2 The PERFORM PUBLISH Command
28.3 The PERFORM BUILD Command
28.3.1 The PERFORM BUILD PROTOCOLS Command
28.3.2 PROTOCOLS File Definition
28.3.3 $PROTOCOLS format
28.4 The PERFORM FILEDEF SUMMARY Command
28.5 The PERFORM FORMAT LIST Command
28.6 PERFORM SYSTEM PRINT Command
28.6.1 FONT TABLE
28.6.2 FORMATTING-CHATACTER TABLE
28.6.3 DESTINATION ATTACHED and POSTSCRIPT
28.7 PERFORM SYSTEM SEND Command
28.8 PERFORM SYSTEM MAIL Command
28.9 PERFORM CHNGREF
28.10 The PERFORM SYSDUMP Command
29 Exporting Data from SPIRES to Other Programs: The Exporter
29.1 The Exporter Input Screens
29.1.1 The Target System Selection Screen
29.1.2 The Data Structure Selection Screen
29.1.3 The Element Specification Screen
29.1.4 The Field Layout Screen
29.2 The Exporter Command Environment
29.3 Importing Your Data
29.4 Sample Data
29.5 EMS
29.6 Full-Screen Key-Sequences
29.7 The SPIMSG Command
29.8 Command Retry
: Appendices
:29 SPIRES Documentation

* SPIRES Technical Notes

******************************************************************
*                                                                *
*                     Stanford Data Center                       *
*                     Stanford University                        *
*                     Stanford, Ca.   94305                      *
*                                                                *
*       (c)Copyright 1994 by the Board of Trustees of the        *
*               Leland Stanford Junior University                *
*                      All rights reserved                       *
*            Printed in the United States of America             *
*                                                                *
******************************************************************

        SPIRES (TM) is a trademark of Stanford University.

+1 A Brief Explanation About this Document

This manual is in the process of being written. There are many subjects that fit into the category of data base management that do not and should not belong to other SPIRES manuals. In fact, this manual was created when we began to realize that we did not know what other currently existing manuals should contain certain new material developed and documented in SPIRES.

This manual currently is a catch-all, a place for homeless documentation of SPIRES components. When time becomes available, it will become a structured document, organized like any other SPIRES manual, as a logically constructed reference manual. For the time being, however, it will remain as a "draft" document, with new sections added as needed in order to document the growing number of SPIRES capabilities that cross the boundaries between File Definition, Formats, Protocols and general file management.

1 Sorting Records in SPIRES; SPISORT

SPIRES has several different capabilities for sorting goal records, that is, placing records in order based on the values of one or more goal record elements. Record sorting is often desirable for reports, where you want all the records having a particular value for a given element to be displayed together, all the records having a second value to be together, and so forth.

The four methods of arranging goal records are discussed and compared in the next section. [See 1.1.] The remaining sections of this chapter will cover the most versatile of the methods, a procedure called SPISORT. [See 1.2.]

1.1 An Overview of Record-Sorting Methods

This section will discuss and compare four different ways to sort goal records. The remainder of this chapter will be about one of them, usually called SPISORT, although the actual SPISORT program is just one of its steps. [See 1.2.] The others are discussed in other SPIRES manuals; appropriate references are included here.

The four methods are:

1. the SEQUENCE command
2. SPISORT and record sets
3. the FOR INDEX command
4. the FOR SUBFILE command

Below, each of these techniques is briefly discussed:

The SEQUENCE command

The simplest of the four techniques to use, the SEQUENCE command sorts the records in the current stack or search result. The resulting stack can then be displayed with the TYPE command. It is commonly used in a situation like this:

-> find date.reported after 1980
-Result: 186 REPORTS
-> sequence date.reported, who.reported
-Stack: 186 REPORTS
-> set format discrepancy report
-> in active clear, type
->

The SEQUENCE command:

 - is usually cheaper than SPISORT, but is less versatile;

 - is a single command issued online, with the  sorted  stack  of  records  returned  for  use
 immediately;

 - cannot sort a large number of records (no absolute number can be specified as a limit; as a
 rule of thumb, a practical limit is 3,000 records);

 - orders records only by the first occurrence of an element;

 - requires an existing stack or search result for its input;

 - can sort records by either the internal or external forms of the element values.

 - is documented in the reference manual "SPIRES Searching and Updating".

The SPISORT Procedure

Several steps are involved in using SPISORT. First you create a "set" of records, using the Global FOR commands DEFINE SET and GENERATE SET. Next you issue a SPISORT command to sort the records in the set. Then you may process the sorted records under the Global FOR command FOR SET.

The SPISORT procedure:

 - is usually the most expensive method, but is certainly the  most  powerful  and  versatile,
 handling many situations the other methods cannot;

 - requires that several commands be issued, and a batch job be run (the SPISORT command  runs
 a batch job to execute the SPISORT program);

 - can sort an arbitrarily large number of records;

 - can sort records by multiply occurring elements and structures; the same record may  appear
 several  times in the set, once for each occurrence of the element on which the records are
 being sorted.

 - does not require a pre-existing result or stack to be created;

 - can sort records by either the internal or external form of element values, in ascending or
 descending order;

The FOR INDEX Command

With this technique and FOR SUBFILE below, you are taking advantage of "automatic sorting" done by SPIRES in maintaining the data base. The indexes of the subfile in effect sort goal records by element value. Using the FOR INDEX command with Global FOR processing commands, you can display records in order without having to sort them specially. In addition, using the SEQUENCE option on the FOR INDEX command, you can sort the records retrieved at a given index node (i.e., having the same value for the indexed element) by secondary elements, in a manner similar to the SEQUENCE command. Here is a sample session using the FOR INDEX command:

-> for index name
+> set format $report Name City Donation.Amount
+> in active display all
+>

The report in the active file would display records in alphabetic order by values passed from goal records to the NAME index, which presumably contains the values from the NAME element.

The FOR INDEX command:

 - is cheaper than SPISORT or the SEQUENCE command, but is just slightly more  expensive  than
 the FOR SUBFILE command;

 - requires that the primary element for the sorting be indexed in a simple index;

 - sorts the records by the primary element only in its internal, indexed form, but  secondary
 sorting elements may be used in either their internal or external form;

 - sorts the records only in ascending  order  by  the  primary,  but  sorts  them  in  either
 ascending or descending order for secondary elements;

 - handles only the goal records that are represented in the index;

 - is an easy-to-use "subset" of the capabilities of the FOR PATH command, which has
 more power than FOR INDEX;

 - is documented in the reference manual "Sequential Record Processing in SPIRES:  Global
 FOR".

The FOR SUBFILE Command

If the goal records are to be sorted by the key element of the record, then no special sorting is required at all -- the records are maintained in key order in the tree. Either FOR TREE or FOR SUBFILE (to pick up deferred queue records) processing can be used to retrieve the records in that order. Since the key is unique for each record, there are no secondary elements for sorting. Of the four techniques discussed, this method is the cheapest, since it requires no special sorting.

FOR SUBFILE processing is also discussed in the Global FOR manual mentioned above.

1.2 Sorting Records with SPISORT

SPISORT is a name used to describe a three-step procedure for sorting goal records, though specifically its name refers only to the batch program used in the second step. The three steps you follow are:

1) Create an input data set containing the unsorted data, called a "set" in SPIRES. This ORVYL data set is created in Global FOR mode using the Global FOR commands DEFINE SET, GENERATE SET, and ENDFOR:

-> select my biblio
-> for subfile where fiction yes
+> define set fictin, elem author(tv=all) title
+> generate set
+> endfor
-'ORV.GQ.DOC.FICTIN' has 192 sort entries
-End of Global FOR
->

2) Sort the input data set by issuing a SPISORT command. This command actually runs a batch job that uses the SPISORT program. That job creates an output data set, also called a "set". The SPISORT command for the example begun above might look like this:

-> spisort fictin to fictout
->

3) Process the sorted set to display the records. The FOR SET command lets you process records in a set in Global FOR mode:

-> for set fictout
+> in active, display all
+>

Each of these steps will be described in turn. The next section will discuss the first step, in which a set for sorting is created. [See 1.3.] Following that will be a discussion of the SPISORT command [See 1.4.] followed by more information on the FOR SET command, used to process sets. [See 1.5.]

Discarding the set could be considered a significant fourth step to the procedure. Since the set is stored as an ORVYL data set on your account, it will accrue storage charges, so you should get rid of it when you no longer need it. To discard a set, issue the ORVYL command ERASE:

ERASE setname

Multiple occurrences of an element used as a sort field cause multiple "sort records" to be created in the set. Thus, after the set is sorted, FOR SET processing may cause the same goal record to appear several times, once for each occurrence of the sort element. By default, in fact, when the goal record is displayed under FOR SET, only the occurrence of the sort element that caused the record to appear at that point in the set is displayed; the others are "filtered out" automatically. The filters can be removed, if desired. Filters add a great deal of power and sophistication to SPISORT work, and they are discussed separately later in this chapter. [See 1.7.]

By default, during FOR SET processing, SPIRES retrieves a pointer from the set for each goal record, and then fetches the goal record from the tree (or deferred queue) using that pointer. Thus, each record is, in effect, retrieved twice: once to fetch the sort data, and once to fetch the data for FOR SET processing. In some cases, you can save substantially by creating a "direct" set, in which the set itself contains all the data used for sorting and used for displaying under FOR SET processing. Direct sets and their use are discussed in a later section. [See 1.8.]

1.3 Creating the SPISORT Input File

A set is created with a combination of Global FOR commands: DEFINE SET, GENERATE SET and ENDFOR.

The DEFINE SET command "opens" the set -- that is, it establishes the ORVYL data set that will hold the data.

In addition, it tells SPIRES which elements to put into the set. [See 1.3.1.]

1.3.2

The ENDFOR command ends Global FOR mode, closing the set.

You must be in Global FOR mode to issue this series of commands. The Global FOR command "FOR class" initiates Global FOR, telling SPIRES which records to process. WHERE clauses and SET SCAN commands may also be incorporated into the procedure as desired. For more information about Global FOR in general (and about the "FOR class" command in particular), see the reference manual "Sequential Record Processing in SPIRES: Global FOR".

The next sections will discuss the DEFINE SET and GENERATE SET commands in detail.

1.3.1 The DEFINE SET Command

A SPISORT input file is created by the DEFINE SET command. Unsorted records are placed in it by a GENERATE SET command. Both of these commands can be issued only when Global FOR is in effect. Any global FOR class, WHERE clause, SET SCAN commands, etc., can be used to specify which records will be placed in the input file. The input file may be created only on your own account.

The form of the DEFINE SET command is

 DEFINE SET setname [REPLACE] [DIRECT [EXTERNAL] [SCAN] [ALL]] [TV=ALL]...
 ... ELEMENTS [=] element-list [+ direct-list] [- direct-list]

DEFINE SET setname CONTINUE

"Setname" is the name of an ORVYL file that will hold the sort input data. If this is the name of an existing file, either the CONTINUE, APPEND (the same as CONTINUE) or REPLACE options should be used. (If an existing file is named, and REPLACE is not specified, then the system will ask permission to replace the file. See the examples below for more information on the CONTINUE option.) You may use a fully qualified ORVYL data set name ("ORV.gg.uuu.name" where "gg.uuu" must be your account number) or type only the "name" portion; the "name" portion may not exceed 33 characters in length.

The DIRECT option and its accompanying "direct-list" (a list of elements) are discussed in the section on direct sets. [See 1.8.]

The TV=ALL option specifies that all occurrences of the elements named in the element-list are to be processed, unless the element includes a TV=n option to override the global TV=ALL option. Use the TV=ALL option once before the ELEMENTS keyword as an alternative to including TV=ALL as an option for each element in the element-list.

The ELEMENTS keyword signals that a list of the elements to be used for sorting follows. Here is the syntax of the "element-list" portion of DEFINE SET:

ELEMENTS [=] elemname [(options)] [elemname [(options)] ... ]

For example, to define a set where entries are sorted first by CITY and then by ADD.DATE,

-> define set citysort elements = city(x,tv=all), add.date(d)

Up to sixteen elements may be specified, separated by commas or blanks. The "sort portion" of a sort record, i.e., the elements in the ELEMENTS list, can be about 1000 bytes long per record. (The total length of the sort record, including direct set information, can be 5500 bytes.) When creating sort records under GENERATE SET, SPIRES will start with the value of the first element and continue till the end of the list of elements or the 1000-byte limit is reached, whichever comes first.

The elements may include elements from phantom structures, as well as dynamic elements. With dynamic elements, however, be aware that SPIRES will create the dynamic element's value for the sort file, but will otherwise know nothing about the dynamic element. If you use the set later, to use the dynamic element, you will need to define it again; and since SPIRES will again compute its value when you want to see it, it may differ from its value at the time the set was created. (This is not a problem with direct sets.) [See 1.8.1.]

Use parentheses around groups of elements to specify that several different elements should be used for sorting at a particular level. For example, to specify the primary sorting element as AUTHOR or EDITOR (whichever occurs) and the secondary sorting element as TITLE, use a command such as:

-> define set namelist tv=all elements = (author editor) title

Options for Element-names

The element list is made up of a series of element-name/option pairs. There are several possible options, and all options following an individual element mnemonic must be enclosed in a single pair of parentheses. The possible options are any combination of the following:

D -- Specifies descending sort order for this element. (The default is ascending sort order.)

X -- Specifies that the values to be used for sorting are to be passed through the element's OUTPROC rules before sorting. (The default is to use the internal form of the value.)

I -- Used in conjunction with direct sets and the "X" option above, "I" specifies that the element should be sorted by its external value, but stored in the direct set in its internal value. [See 1.8.1.]

U -- Specifies an "unforced value". When this option is specified, upper-lowercase character strings will not be converted to all uppercase for sorting purposes; note that this only affects the sorting of values, not their display. The two sorted lists of values below demonstrate the difference this option can make:

Forced to upper for sorting:         Unforced:
  Aleph                               alpha
  alpha                               beta
  Beast                               Aleph
  beta                                Beast

Note that lowercase sorts before uppercase if values are not forced to uppercase. (The default is to force character string values to uppercase for sorting.)

The following additional options all have the form

option = n

where "n" is an integer. The "=" is optional, and may be replaced by a blank. The TV option is the one most commonly used; the others are used very infrequently -- their effects can be duplicated with various combinations of the SET FILTER command. [See 1.3.2.] If you are using SPISORT because you are trying to sort more records than the SEQUENCE command can handle, you need not use any of these options.

L=n -- Specifies the maximum length of values for sorting purposes. This option is meaningful only for TYPE=CHAR or TYPE=XEQ elements. Only the first "n" characters of the value will affect sorting. "N" may not exceed 255. (The default is 255; only the first 255 characters of any value can be sorted on, regardless of type.)

TV=n -- Specifies the maximum number of occurrences of the element that are to be processed (after skipping any occurrences as specified by TS; see below) for the entire record. Unlike the SEQUENCE command, the SPISORT program can sort on multiply occurring elements and structures; a single record, when sorted on one of its multiply occurring elements, can appear in several places in the sorted output file. "N" may be a number from 1 to 32,766, or the word ALL. (The default is 1.) A TV=n option in the element-list takes precedence over the global TV=ALL option before the ELEMENTS keyword, if present.

TS=n -- Specifies the number of the occurrence of the element or structure where sort-processing should begin. In other words, SPIRES should begin retrieving occurrences starting with the "nth" occurrence in the record. "N" may be any number from 0 to 32,767. The default is "1", which means to begin with the first occurrence. The sum of TV and TS may not exceed 32,767.

SV=n -- This option is useful only if TV is greater than 1, its default, and if the element is a multiply occurring element within a multiply occurring structure. This option specifies the number of occurrences of the element within each structure to retrieve. Its default is to retrieve all of them. (See examples below.)

SS=n -- Similar to TS=n, but applies to the element at a structural level. SS tells SPIRES to skip to the "nth" occurrence of the element within the structure before fetching sort values. (The default is SS=1, which means skip no values.)

Examples of DEFINE SET Commands

Below are some sample DEFINE SET commands:

-> define set test1 continue
       This command presumes that a DEFINE SET TEST1 has
       been issued before, specifying the sorting options
       desired.  No new options can be specified with the
       CONTINUE option.

-> define set outrec, elements = name zip-code
       A file called OUTREC is to be created to hold sort
       input data.  The records that go into this file will
       be sorted first by NAME, then by ZIP-CODE.

-> define set sortout, elems name(x) zip-code(tv=all)
       A file called SORTOUT is to be created to hold sort
       input data.  The records are to be sorted first by the
       value of the NAME element after it has been passed
       through the OUTPROC for that element in the file
       definition.  The records are then sorted by all
       occurrences of the ZIP-CODE element.  If multiple
       occurrences of ZIP-CODE appear in any one record, that
       record will appear in the set multiple times.

-> define set contacts elements = (home.city work.city) name
       The records in the CONTACTS set will be sorted first
       by city, whether the city is in the HOME.CITY element
       or the WORK.CITY element.  That is, the set will have
       interfiled HOME.CITY and WORK.CITY values.  The records
       are then sorted by NAME within each city.

In the following examples, the element ITEM.DATE is a multiply occurring element within the multiply occurring structure ITEM:

-> define set sortie, element = item.date(tv=all)
       The records in the set SORTIE will be sorted by all
       occurrences of the ITEM.DATE element.  Hence, if any
       one record contains multiple occurrences of ITEM.DATE,
       the record will appear in the set one time for each
       occurrence.  Note that TV=ALL means "all occurrences
       of the element in the record", even if the element is
       within a structure.  SV is not needed here (see below).

-> define set sortof, elem item.date(tv=all,sv=1)
       The records will be sorted by the first occurrence of
       the ITEM.DATE element within all occurrences of the
       ITEM structure.  TV=ALL is modified by SV=1: fetch
       all occurrences of ITEM.DATE that are the first
       occurrence within the ITEM structure.

-> define set sortoe, element item.date(tv=5,ts=3)
       Assuming an ample number of occurrences of ITEM.DATE
       in each record, the records will be sorted by the
       third through seventh (five) occurrences of ITEM.DATE.
       All the occurrences may be within the first occurrence
       of the ITEM structure, or they may be scattered across
       multiple occurrences of it.

In effect, the last three examples show that TV and TS, when used with an element in a structure, ignore structural boundaries, counting absolute occurrences of the element. Using SV and SS can re-establish those boundaries to some degree.

After defining the set, the next step is to put records into it, using the GENERATE SET command, discussed in the next section.

If you want to see the DEFINE SET command issued to create a set, issue the SHOW SET INFORMATION command. [See 1.6.]

1.3.2 The GENERATE SET Command

After the DEFINE SET command has been issued, you must tell SPIRES to place unsorted goal records into the set, which is done with the GENERATE SET command. Remember that the DEFINE SET and GENERATE SET commands can be issued only when Global FOR is in effect; GENERATE SET can be issued only after a DEFINE SET command has been issued.

The GENERATE SET command works in Global FOR like many other commands such as DISPLAY, REMOVE, etc. Its syntax is:

GENERATE SET [ALL|FIRST|*|NEXT|n|REST|LAST] [END='end clause']

The first options indicate which records in the Global FOR class are to be processed by the command. If no option is specified here, ALL is assumed; note that the default, ALL, processes all of the records into the input file.

A typical sequence of commands to create a sort input file might be:

-? select mail file
-? for tree where zip-code prefix 94
+? define set sortout, elements = zip-code name
+? generate set all
+? endfor
-'ORV.GQ.DOC.SORTOUT' has 3900 sort entries
-End of Global FOR
-?

An ENDFOR command (or a command that causes an END OF GLOBAL FOR condition, such as CLEAR SELECT) must occur before the SET can be sorted. Note that the response shown above to the ENDFOR command gives the name of the unsorted file (the set), as well as a count of the number of "entries" in it. The number of entries may or may not be equal to the number of records processed; if you are sorting on multiple occurrences of an element, then a single record may create multiple entries in the set (see Technical Note below).

The GENERATE SET command can be issued more than once while Global FOR is in effect. When Global FOR ends, or when Global FOR begins again, the set named in the DEFINE SET command is closed and the message giving the number of entries in it is output. If more sort input data is to be added to the same set, the DEFINE SET command will have to be issued with the CONTINUE option before any GENERATE SET commands are valid. Note that you cannot specify an element list if the CONTINUE option is used; the element list of the original DEFINE SET command that created the set is used. [See 1.3.1.]

You may set element filters (using the SET FILTERS command) to limit or control the occurrences of values used to create sets. The SET FILTER command(s) you need must be issued before the GENERATE SET command. For example, if you want to sort on the last occurrence of the MOD.DATE element in each record, you could issue these commands:

-> for result
+> define set lastmod, elements = mod.date, id.number
+> set filter for mod.date (last)
+> generate set
+> endfor
-'ORV.GQ.DOC.LASTMOD' has 397 sort entries
-End of Global FOR
->

Using filters as in the example, you can filter out occurrences of an element that you do not want to use for sorting.

The next step is to issue the SPISORT command, the subject of the next section. [See 1.4.]

Note that you can process the generated set under the FOR SET command even before it has been sorted, if desired. [See 1.5.]

Technical Information on the Creation of Sort Entries

How does SPIRES determine the number of "sort entries" to make for a given record? Do sort entries get made for a record having no occurrences of the sort elements?

The answer to the first question depends on several factors. The most important is whether the TV option is specified on any of the elements in the DEFINE SET command. If not, there will be one sort entry for each goal record processed by the Global FOR command GENERATE SET. (For example, the command GENERATE SET 5 would presumably process five goal records and create five sort entries.)

If any of the sort elements do not have a value in a record, SPIRES essentially assigns a null value to the element for sorting purposes. Even if none of the sort elements has a value, SPIRES still creates a sort entry for that record. It is important to realize that the DEFINE SET command does not determine what records deserve sort entries -- each goal record processed by the GENERATE SET command (as determined by the "FOR class" command, the WHERE clause, SET SCAN commands, and the GENERATE SET command itself) generates at least one sort entry. The DEFINE SET and SET FILTER commands may determine how many sort entries are created for each record, but at least one will be created for each goal record processed by GENERATE SET.

Returning abruptly to the question of how many sort entries are created for a given goal record, consider the effect of the TV option for an element on the DEFINE SET command. For example,

-> define set fbifile, element pseudonyms(tv=all)

If a goal record processed by the GENERATE SET command has two occurrences of the PSEUDONYMS element, then two sort entries will be created -- one for each value. But consider this example:

-> define set fbifile, elem pseudonyms(tv=4)

From one to four sort entries will be created for each record, depending on the number of occurrences of PSEUDONYMS. (Remember, if there are no occurrences of PSEUDONYMS in a record, one sort entry will be created.)

If filters are in effect, they are applied first, before the TV, SV, TS and SS options are applied.

The number of sort entries grows substantially when two or more sort elements are multiply occurring. You can use a simple formula to determine that number for any given goal record:

1) For each element in the sort list, determine the number of sort entries created if it were the only element listed. Remember that at least one entry would be created.

2) Multiply those numbers together for the total number of sort entries for that record.

Below is a simple example. Suppose that you want to know how many sort entries will be created for record ABC that has 10 each of the three sort elements named in the DEFINE SET command:

-> stack ABC
-Stack: 1 RECORD
-> for stack
+> define set test, element x(tv=all), y(tv=3), z(tv=2,ts=5)
+> generate set
+> endfor
-'ORV.GQ.DOC.TEST' has 60 sort entries
-End of Global FOR
->

If element X were the only sort element, 10 entries would be created; if element Y, 3 would be created; and if element Z, two would be created. Multiplying them together creates the product 60.

1.4 Sorting the Input File: the SPISORT Command

To sort the input set and create a sorted output file, issue a SPISORT command. The syntax is:

SPISORT infile [TO outfile] [REPLACE] [TIME = n] ...
  ... [ORDER = (sort order list)]

The SPISORT command verb may not be abbreviated.

"Infile" (which is required) is the name of the input file created by your DEFINE SET command.

"Outfile" is the name of the output file to contain the sorted data. If you omit the "TO outfile" parameter, then your input file will be overwritten by the sorted output file.

If you do name an output file, the REPLACE parameter may be used to specify that an existing file of that name may be replaced with the new data. (If you omit the REPLACE parameter and the file already exists, you will be asked whether it's ok to replace the file.)

With the TIME option, you may increase the time allotted for the sort job. The default is 1 minute. (Note that the sort job may not run in CLASS=F with a high time limit. This could cause the sort job to queue, especially if the SPISORT command is being issued from a job (e.g. Batwyl).

The ORDER option allows you to reorder the sort elements specified in your DEFINE SET command. Only elements named in the ELEMENT list (not the "direct" list; see 1.8) may be named here. The parentheses around the order list are optional, but if they are omitted, the ORDER parameter must be last in your SPISORT command.

Each element may be followed by "(A)" or "(D)" to indicate ascending or descending order; whichever was specified for the element in the DEFINE SET command is the default.

The elements may be specified by name (the name given in the DEFINE SET command, though any structural path information should be omitted) or number, where the number is the element's position in the ELEMENT list in the DEFINE SET command (see example below).

By the way, if the element names in the DEFINE SET command are themselves numbers, then SPISORT will assume the numbers in the ORDER parameter are element names, not position numbers. Naturally, however, the possibility of confusion is much greater if you use numbers as element names in the DEFINE SET command, particularly if you might need to use the ORDER parameter. You can see what DEFINE SET was issued to create a set by issuing the command SHOW SET INFORMATION. [See 1.6.]

Examples of SPISORT commands

Here are a few examples of SPISORT commands:

-> spisort books

      In this simple form, the input data in BOOKS will be
      sorted, and the sorted output will overwrite the
      input file.

-> spisort booksin to booksout

      Here, separate input and output files are specified.
      If BOOKSOUT already exists, SPIRES will ask if it is
      OK to replace it.  (You could avoid the question by
      including the REPLACE option.)

-> spisort books time=2 order = price(d),author

      The ORDER option here determines the sorting elements.
      The sort order here overrides the sort order specified
      in the DEFINE SET command.

The SPISORT Command Runs a Batch Job

The SPISORT command actually constructs JCL and runs a batch job on your behalf, to accomplish the sorting of your input file. For this reason, your "current job" number will be affected when you issue a SPISORT command. In addition, SPISORT makes use of the $ASK, $WDST, and $WDSR variables.

1.4.1 SPISORT Error Codes

If a SPISORT command fails, an error code will be supplied. Listed below are the possible codes for SPISORT failures. You may get online explanations, too, with the EXPLAIN command. For example:

-> spisort boooks
-SPISORT terminated, code U119
-Not a legal or complete command
-> explain u119
IN=file does not exist

The error code is also recorded in the $SORTCODE variable, and $NO is set to true when the SPISORT command fails.

SPISORT Error Codes and Explanations

0001  Attempt to define the same parameter twice,
      or a single parameter value exceeds 22 characters.
0002  Invalid parameter option.
0003  Either IN or OUT (or both) have not been specified.
0004  Main block of IN not correct.  Did you specify the
      DEFINE SET file name as the IN=file ?
0005  IN file does not define any data to sort.  Did you
      GENERATE SET after doing the DEFINE SET?
0006  Data blocks of IN not correct.  Did you PUT APPEND
      something to the IN=file ?
0007  OUT=file exists and REP option not specified.
0008  SORT did not output the required number of records.
0009  IN=file did not contain all the records to sort that
      it was supposed to contain.
0010  ORD field in the parm list is invalid.  Are the element
      names or numbers valid?  Did you specify a sort option
      other than "(A)" or "(D)"?
0011  IN=file did not have any sort fields.
0090  ORVYL file system unavailable.
0094  IN=file name illegal, probably invalid characters.
0096  Permanent I/O error on IN=file.
0097  Permanent I/O error on OUT=file.
0099  IN=file not available.  Did you ENDFOR before running?
0105  OUT=file not available.  Did you have it attached?
0107  IN=file access not permitted.  Is the file yours?
0108  IN=file read access prohibited.
0109  OUT=file write access prohibited.
0112  IN=file missing a required block.
0117  OUT=file storage limit exceeded.  Get more ORVYL blocks.
0119  IN=file does not exist.
0121  ORVYL out of block space.  Tell systems.
0194  OUT=file name illegal, probably invalid characters.
0198  OUT=file not available.  Do you have it attached?
0206  OUT=file access not permitted.  Is it yours?
0210  OUT=file storage limit exceeded.  Get more ORVYL blocks.
0214  OUT=file name overflows dictionary.  Tell systems.
0226  OUT=file overflows system tables.  Tell systems.
0230  ORVYL out of block space.  Tell systems.
1000  The SPISORT failed to parse correctly. You may have
      improperly spelled parameters.
1001  SPISORT could not sort your set.

1.4.2 (*) Running a Batch SPISORT Job

The interactive SPISORT command was implemented in March 1989. Prior to that time, the sorting step of the SPISORT procedure had to be accomplished by running a batch job to execute the SPISORT program. (In fact, the SPISORT command runs a batch job, constructing the JCL and submitting the job on your behalf.)

It is still possible, of course, to run the batch job yourself. The paragraphs below describe the JCL to run a batch SPISORT job.

To sort the input set and create a sorted output file, a batch job must be run using the following JCL to invoke the SPISORT program:

//  JOB
//  EXEC SPISORT,PARM='parameters'

The parameters specify the name(s) of the input and output files, and whether an existing file is to be replaced by a new output file. The forms of the parameters are:

PARM='IN=OUT=setname'
  The input file, called "setname", is overwritten by the
  sorted output file.

PARM='IN=setname1,OUT=setname2'
  The input file "setname1" is read, and the output file
  "setname2" is created.  "Setname2" must not exist; if it
  does, use the next example.

PARM='IN=setname1,OUT=setname2,REP'
  A file called "setname2" is to be created.  If it already
  exists, it is replaced.

PARM='IN=setname1,OUT=setname2,REP,ORD=(sort-list)'
  The "sort-list" allows you to reorder the sort elements
  specified in the DEFINE SET command.  Only elements named
  in the ELEMENT list (not the "direct" list; see 1.8) may
  be named here.  The elements may be specified by name (the
  name given in the DEFINE SET command, though any structural
  path information should be omitted) or number, where the
  number is the element's position in the ELEMENT list in the
  DEFINE SET command (see example below).  Each element may
  be followed by "(A)" or "(D)" to indicate ascending or
  descending order; whichever was specified for the element
  in the DEFINE SET command is the default.

Here is an example using the ORD parameter. Suppose your DEFINE SET command was:

-> define set eyes, elements color size weight(D)

You could change the sort order later on the EXEC card:

//  JOB
//  EXEC SPISORT,PARM='IN=OUT=EYES,ORD=(3(A),1,2)'

SPISORT will sort the elements in the order WEIGHT (in ascending order), COLOR and SIZE.

By the way, if the element names in the DEFINE SET command are themselves numbers, then SPISORT will assume the numbers in the ORD parameter are element names, not position numbers. Naturally, however, the possibility of confusion is much greater if you use numbers as element names in the DEFINE SET command, particularly if you might need to use the ORD parameter. You can see what DEFINE SET was issued to create a set by issuing the command SHOW SET INFORMATION. [See 1.6.]

Issue the WYLBUR command RUN to submit the SPISORT job for execution.

If your batch job fails, a SPISORT RETURN code will be supplied. The codes are the same as those listed earlier for the SPISORT command. [See 1.4.1.] The code is prefixed with a "U" in the HASP job log messages.

1.5 Processing the SPISORT Output File: the FOR SET Command

Sets of records created by the DEFINE SET and GENERATE SET commands are processed under the Global FOR command FOR SET. That command has the following form:

FOR SET setname [UNFILTERED|DIRECT] [WHERE clause]

FOR source VIA SET setname [UNFILTERED|DIRECT] [WHERE clause]

where "setname" is the name of the ORVYL data set created by the DEFINE SET and GENERATE SET commands or is a stored result or stack. If the set is stored as an ORVYL data set under some account other than your own, then use the form "FOR SET ORV.gg.uuu.setname" where "gg.uuu" is the account number under which the set is stored. For more information about using a stored result or stack with FOR SET, see the "Global FOR" manual; online, EXPLAIN FOR SET COMMAND, IN GLOBAL FOR.

The "source" may be any of the usual Global FOR access classes, e.g. SUBFILE, TREE, UPDATES, etc. In the first form, SUBFILE is assumed to be the source.

The UNFILTERED option tells SPIRES to process the entire goal record, rather than the version whose sort elements are filtered by path occurrence information gathered when the SPISORT sort entries were created. This option is explained in detail in the next section. [See 1.7.]

The DIRECT option is used when you created a direct set (by using the DIRECT option on the DEFINE SET command). The DIRECT option tells SPIRES not to retrieve the goal record that a sort entry in the set represents, but instead to use the data actually in the sort entry. Hence, "FOR source VIA SET setname DIRECT" is exactly the same as "FOR SET setname DIRECT" -- the "source" is always the direct set when DIRECT is specified. Direct sets are discussed in a later section. [See 1.8.]

1.6 The SHOW SET INFORMATION (SHO SET INF) Command

You can get information about generated sets stored on your account by issuing the SHOW SET INFORMATION command:

SHOW SET INFORMATION [setname]

You can issue this command without the "setname" option if you are processing records under a FOR SET command; if not, you must name the set you want information about.

The information displayed includes the complete DEFINE SET command you issued when creating the set, the number of sort entries in the set, the date and time it was created, etc., as you can see from the example below:

-> show set information sortfight

DEF SET SORTFIGHT rep, elem opponent(tv=all), date

Set ORV.GQ.JNK.SORTFIGHT created 06/22/1984
  from subfile FENCING of file ORV.GQ.DOC.FENCING
The set has not been sorted
  SORT order: OPPONENT, DATE
The set has     192 records              5 blocks

->

The first line shows the DEFINE SET command used to create the set. The next line shows the ORVYL data set containing the sort data, and the date, time and account that created it. The next line tells what file and subfile the data comes from. (No subfile is listed if the set was generated after an ATTACH command rather than a SELECT command.)

The next line announces whether the set has been sorted or not, with the next line telling by which elements the set was sorted. (Remember that the ORDER parameter on the SPISORT command allows you to specify a different order for sorting than appears in the DEFINE SET command.) Finally, at least in the example, the display tells you how many sort entry records were generated for the set and how many ORVYL blocks the set uses for storage.

For direct sets, discussed elsewhere, SPIRES tells you the minimum, maximum and average length of the sort entry records at the end of the display. [See 1.8.]

1.7 Filters and SPISORT

When a set is created with the DEFINE SET and GENERATE SET commands, "path information" is created to tell SPIRES how to retrieve appropriate element occurrences involved in the sorting. SPIRES uses the path information when records are processed under the FOR SET command; by default, only the particular occurrence of the element that caused the record to be sorted in that position will be retrieved when the record is displayed. (When multiply occurring elements or structures are named in the DEFINE SET command and TV other than 1 is specified, multiple copies of a single goal record in the set may be created.) This "automatic filtering" applies to formatted or non-formatted output commands.

For example, suppose a set is defined to sort records on the multiply occurring element EMPLOYEE. A record containing two employees appears twice in the set, but each time the record is displayed under the FOR SET command, only a single employee's name appears:

-> for subfile
+> display first
 OFFICE = ACCOUNTING;
 EMPLOYEE = Eight, Sharon;
 EMPLOYEE = Nine, Calvin;
+> define set names element employee
+> generate set *
+> endfor
-'ORV.GQ.DOC.NAMES' has 2 sort entries
-End of Global FOR
-> for set names
+> display all
 OFFICE = ACCOUNTING;
 EMPLOYEE = Eight, Sharon;

 OFFICE = ACCOUNTING;
 EMPLOYEE = Nine, Calvin;

SPIRES automatically filtered out the occurrences of EMPLOYEE that did not cause the record to appear at that position in the set. The automatic filtering occurs regardless of whether the set is sorted or not -- the path information is created when the set is created, not by the SPISORT program.

The UNFILTERED option can be added to the FOR SET command if the automatic filtering described above is not desired:

+> for set names unfiltered
+> display all
 OFFICE = ACCOUNTING;
 EMPLOYEE = Eight, Sharon;
 EMPLOYEE = Nine, Calvin;

 OFFICE = ACCOUNTING;
 EMPLOYEE = Eight, Sharon;
 EMPLOYEE = Nine, Calvin;
+>

Explicit filtering can be done with the SET FILTER command, which may be used to control the sort entries created for a goal record when the set is created. For example, compare these two sets of commands applied to the same subfile:

1) -> for subfile
   +> define set parties, elements guest(tv=all) date
   +> generate set first
   +> endfor
   -'ORV.GQ.DOC.PARTIES' has 25 sort entries
   -End of Global FOR
   ->

2) -> for subfile
   +> define set parties, elements guest(tv=all) date
   +> set filter for guest(1/5)
   +> generate set first
   +> endfor
   -'ORV.GQ.DOC.PARTIES' has 5 sort entries
   -End of Global FOR
   ->

The command sequences are identical, except for the SET FILTER command in the second example, which tells SPIRES to treat the goal records as if only the first five occurrences of the GUEST element exist.

When you use explicit SET FILTER commands on the sort elements to create a set, you should clear those filters when you process the set. (Note that SPIRES will be forgiving if you leave those filters set; that is, the output will be the same whether they are set or not. However, if you were to set other filters on the sort elements when processing the set, the results might be unpredictable. Hence, it is recommended that you not set any filters on the sort elements when you are processing the set under FOR SET.)

Information about filters with direct sets appears in the next section. [See 1.8.] General information about element filters appears later in this manual. [See 21.] The path information may be applied selectively in custom-designed SPIRES formats, if desired. See the manual "SPIRES Formats" for details on the PATH and NPATH options.

1.8 Direct Sets

Under FOR SET processing, SPIRES extracts from each sort entry a pointer, used to retrieve a goal record for processing. A special type of set, called a "direct set", can be created that already contains the goal record data desired for FOR SET processing -- that is, SPIRES does not need to retrieve each goal record for processing, because the desired data for display is already in the set.

Processing a direct set under FOR SET is more efficient than processing a non-direct set. Direct sets are a benefit primarily to large sorting applications, where thousands of sort entries are sorted and processed; and to sorting applications where the same set is sorted multiple times for multiple reports.

The subsections of this section will discuss direct sets in detail. The first subsection will cover the additional options on the DEFINE SET command that are used to create direct sets. [See 1.8.1.] The next will discuss the processing of direct sets, using the DIRECT option on the FOR SET command. [See 1.8.2.] Following that will be a discussion on using filters with direct sets. [See 1.8.3.]

1.8.1 Direct Sets and the DEFINE SET Command

Direct sets are created by adding the DIRECT option to the DEFINE SET command, and used by adding the DIRECT option to the FOR SET command. [See 1.8.2.] Here again is the syntax of the DEFINE SET command:

  DEFINE SET setname [REPLACE] [DIRECT [EXTERNAL] [SCAN] [ALL]] [TV=ALL]...
   ... ELEMENTS [=] element-list [+ direct-list] [- direct-list]

REPLACE, "setname", TV=ALL, and "element-list" were discussed in the section on the DEFINE SET command. [See 1.3.1.]

The DIRECT option tells SPIRES that the set should be a direct set. The other options relating to direct sets, which are discussed in detail below, are:

 - EXTERNAL -- tells SPIRES  to  store  the  external  form  of  all  elements  named  in  the
 "element-list"  and  the  "direct-list" as part of the direct data.  It
 does not affect the "sort" form  used  for  the  elements  in  the  "element
 list",  which  means  that  elements specified in their internal form for sorting will
 have both forms stored in the set: internal for sorting and external  for  displaying.   It
 can  be overridden on an element by element basis in the "direct-list" by putting
 "(I)", for "internal", after  the  element  name;  an  example  appears
 below.

 - SCAN -- Use the WHERE clause in effect on the Global  FOR  command  to  control  what  sort
 entries are created.  If the generated sort entry would not pass the WHERE clause criteria,
 it is not added to the set.

 - ALL -- All the data in the record that passes through the  filtering  in  effect  when  the
 GENERATE SET command is issued will go into the sort entries.

 - + direct-list -- This is a list of the  elements  to  be  included  in  the  set  for  each
 generated  sort  entry.  These "direct-set" elements are tag-alongs; they are not
 used for sorting; all sorting elements must appear in the  "element-list".   Each
 element in the "direct-list" will be added to the set in its internal form unless
 it  is  followed  by  "(X)", or unless the EXTERNAL option is specified.  (If the
 EXTERNAL option is specified, an element can be followed by "(I)" to request  the
 internal form for that element.)

 - - direct-list -- This is a list of elements to be excluded from the set for each  generated
 sort  entry;  it  is  useful  when  you  have specified ALL (see above) but want or need to
 eliminate one or a few elements.

Determining Which Elements Go Into the Direct Set

All the elements in "element-list" -- that is, the sort elements -- will be included in the direct set, so in general, there is no reason to list them also in the "direct-list" (but see the notes below). The other options, ALL and the "direct-lists", let you add other elements from the goal records to the direct set, in a manner similar to the SET ELEMENTS command. For example, if DIRECT ALL is specified with no "direct-lists", then all the data in the record that passes through the filtering in effect when the GENERATE SET command is issued will go into the sort entries generated for the direct set.

Important: DIRECT ALL does not mean that all occurrences of a multiply occurring sort element will go into the set -- only the occurrence of the element that is generating that entry will. (The others are being filtered out, and hence are not included in the sort entry.) If you will need to display all occurrences of a sort element, rather than just the occurrence for that sort entry, do not use direct sets.

The "direct-list" options can be used to add or subtract elements for storage in the direct set. In general, if you use the ALL option, you might subtract elements you did not need to be included in the set; if you do not use the ALL option, you might need to add elements to the direct set. You can, however, add or subtract elements as desired. For instance, you can subtract an entire structure from ALL, and then add back a single element within it:

-> define set x direct all, elems a b c - d + d@e

All the elements in the goal record except structure D but including element E in structure D will go into set X.

Any elements defined in the file definition for the goal record can be named as sort elements or "direct elements"; additionally, elements within phantom structures may be named in a DEFINE SET command for a direct set.

Dynamic elements, including elements from dynamically-defined phantom structures, may also be included. SPIRES will figure out the value for a dynamic value at the time the set is generated, storing that value in the set. Later, under "FOR SET name DIRECT", SPIRES will use the stored value as the element value. [See 1.8.2.] In fact, you don't need to redefine the element when you use the "FOR SET name DIRECT" command; if you do, SPIRES will ignore it (and any other dynamic element definitions) till after the next ENDFOR. (If you SHOW DYNAMIC ELEMENTS under "FOR SET name DIRECT", SPIRES will show it simply as "DEFINE ELEMENT name", with no further information, which indicates that definition is irrelevant under this form of FOR SET processing.)

The SCAN Option: Controlling Which Sort Entries are Created

The SCAN option lets you apply WHERE clause processing as the sort entries are created for the direct set. SCAN causes the GENERATE SET command to apply the current WHERE clause criteria to each sort entry to see if the entry should be included in the direct set.

Here is a very simple example to illustrate the effect. Here is a sample record:

ID = 1;
A = Apple;
A = Orange;
B = Cat;
B = Dog;

And here are commands to select records and create the set, first without the SCAN option:

-> for subfile where a = apple and b = cat
+> define set apples direct tv=all elements=a,b
+> generate set
+> endfor
-'ORV.GQ.JLS.APPLES' has 4 sort entries
-End of global FOR
+> for set apples
+> display all
ID = 1;
A = Apple;
B = Cat;

ID = 1;
A = Apple;
B = Dog;

ID = 1;
A = Orange;
B = Cat;

ID = 1;
A = Orange;
B = Dog;

The WHERE clause was applied to choose records to go in the direct set, but the set has entries for combinations besides the "apple/cat" one.

By adding the SCAN option to the DEFINE SET command, the WHERE clause will be applied as the set entries are created:

-> for subfile where a = apple and b = cat
+> define set apples direct scan tv=all elements=a,b
+> generate set
+> endfor
-'ORV.GQ.JLS.APPLES' has 1 sort entry
-End of global FOR
+> for set apples
+> display all
ID = 1;
A = Apple;
B = Cat;

Note that the same effect could have been achieved by adding a WHERE clause to the FOR SET command in the first example (FOR SET APPLES WHERE A = APPLE AND B = CAT). The difference is that in the second example, only one entry was created at all for the direct set. The WHERE clause processing happened when the set was created (GENERATE SET), not when the set was used (FOR SET).

The SCAN option must immediately follow the word DIRECT in the DEFINE SET command.

External and Internal Forms of Sort and Direct Elements

When generating a set, SPIRES by default places the internal form of both sort elements and direct elements into the set. For sort elements, however, you may want SPIRES to sequence on the external form, in which case you include the "X" parameter in the sort options for the element:

+> define set sortsox, element color(X)

Similarly, you can request that a direct element be put in the set in its external form:

+> define set sortsox direct, element color(X) + size(X)

This technique is primarily useful when you will be processing the direct set multiple times -- the element does not have to be processed through its OUTPROC rules for each report. That can save a great deal of processing, particularly when the OUTPROC rules are complex, e.g., requiring other records to be fetched for table lookups (the $LOOKUP and $SUBF.LOOKUP procs).

You can request that all the elements in the "direct-list" and in the "element-list" be saved in the set in their external forms for output by adding the EXTERNAL option to the DEFINE SET command, following the DIRECT option. Important: That does not affect what form is stored for each element in the "element-list" for sorting purposes. In other words, if EXTERNAL is specified for the direct set and all the elements in the "element-list" for sorting are to be sorted in their internal form, then SPIRES will place both the internal and external form of each in the set.

You can override the EXTERNAL option for elements in the "direct-list" on an individual basis by adding the "(I)" option after its name in the list. For example:

-> define set sortaray direct external elem type + name contact(i)

In that case, SPIRES will generate sort entries where the TYPE element is stored in its internal form for sorting, and the TYPE element is stored in its external form for displays. Also, the NAME element will be stored in its external form; and the CONTACT element will be stored in its internal form. That might be useful in a table-lookup situation where you want the lookup to be done at the time of the display under FOR SET, not at the time the set is generated. (See below.)

Note these implications of storing the external form of a direct element in a set -- the direct element behaves as if it had no processing rules at all, which means:

 - If an element normally does a lookup to another record-type to get its value  on  output,
 that value will be looked up at the time the set is generated, not at the time the set is
 used.   If that is a problem, you can either request the internal form of the element, so
 that the lookup must be done when the set is processed, or process the set under FOR  SET
 without the DIRECT option.

 - both $UVAL and $CVAL will be the same in a format label group containing  a  GETELEM  for
 that element;

 - the functions $GETUVAL, $GETCVAL, $GETIVAL and $GETXVAL, used in a format, will  retrieve
 the same value for that element;

 - if you need the internal form, you will probably have to use the  $PROCSUBG  function  to
 retrieve it.

Be aware that elements defined as "OUTPROC-required" by the file definition can be placed in the set only in their external form, and the above implications will apply to them too.

You can request that a sort element be sorted by its internal form but be stored as a direct element in its external form by explicitly adding it to the direct element list, e.g.,

+> define set sortsox direct, element color + color(X)

Conversely, you can request that the sort element be sorted by its external form but be stored as a direct element in its internal form in one of the following ways:

+> define set sortsox direct, element color(X) + color(I)
 or
+> define set sortsox direct, element color(X,I)

The "I" option indicates that the element should be carried as a direct element in its internal form.

You can specify that SPIRES not include a sort element in the collection of direct elements by using the "- direct-list" option, as in this example:

+> define set sortsox direct, elem=sex,size,color - color

The COLOR element would be used as a sort element but would not be stored as a direct element, and thus would not be accessible under direct set processing.

Virtual Elements, Hidden Elements

Virtual elements can be specified as either sort or direct elements. They become "real" elements when the set is created, in that their current value (at the time the set is created) is stored in the set. Either the internal or external form may be specified for the set. The external form for direct set storage is determined by executing the virtual element's OUTPROC rules; the internal form is determined by executing its OUTPROC rules and then its INPROCs.

You cannot use sets to circumvent security provisions. For example, elements whose values are hidden from your account may not be placed into a direct set. SPIRES may not display an error message when you issue the DEFINE SET command, but the GENERATE SET command will certainly not place the hidden data into the set.

Direct Sets are Larger

The GENERATE SET command works the same for direct sets as for non-direct sets, except that the generated set will be larger. [See 1.3.2.] There is an absolute limit of 5500 bytes per sort entry, but as sort entries become very large, they cause inefficient processing. Hence, do not include non-sort elements in a direct set just to carry them along; be sure you need them in your final product.

The special form in which SPIRES stores direct data in the set requires eight extra bytes of overhead for each element or structure. It may thus be more efficient to place entire structure occurrences in the set (eight bytes for each occurrence of the entire structure) than a few individual elements from the structure (eight bytes for each element value). Of course, for overall storage savings, it is best to erase sets as soon as you no longer need them, since they are duplicating data already stored. [See 1.2.]

1.8.2 Direct Sets and the FOR SET Command

Direct sets are processed like other sets -- under the FOR SET command. However, you must use the DIRECT option to request that the set be processed as a direct set, rather than just as a regular set.

Here again is the syntax of the FOR SET command:

FOR SET setname [DIRECT|UNFILTERED] [WHERE clause]
  or
FOR source VIA SET setname [DIRECT|UNFILTERED] [WHERE clause]

Only the DIRECT option is discussed here. [See 1.5.]

Even if a set is a direct set, it can be processed as if it were a regular set, by omitting the DIRECT option. However, if the DIRECT option is specified, SPIRES will not retrieve the goal record that a sort entry points to, but will instead use the data within the sort entry for any record processing under FOR SET.

Only the following Global FOR record processing commands may be used to process "direct records": DISPLAY, SHOW KEYS and SKIP. Most Global FOR commands will not work when a direct set is being processed, such as TRANSFER, REMOVE, MERGE, DEQUEUE, UNQUEUE and REFERENCE. Note that the STACK command does work, though any stack created will retrieve goal records, not direct records.

You should not create a direct set unless all of the data you will need for reporting is included in the lists of sort elements or direct elements. Otherwise, you will be unable to use the DIRECT option, since you will need other data in the goal records. It is easy, for example, to forget elements that are retrieved by a report format with a $GETxVAL function or by the $GETELEM (A79) processing rule.

Virtual elements are handled in an interesting way in direct sets. They become "real" elements when the set is created, in that their current value (at the time the set is created) is stored in the set. Either the internal or external form may be specified for the set. If you display a direct set record in the SPIRES standard format, any virtual elements in it will be displayed too, without you having to set them explicitly in a SET ELEMENTS command.

Any direct element stored in its internal form is converted to external form by processing it through its OUTPROC rule string. This is true even for virtual elements -- the internal form (which was created under GENERATE SET by executing the element's OUTPROC rules and then its INPROC rules) is run through the OUTPROC rules again to get the external form.

Since you are allowed to issue the GENERATE SET command during direct set processing, you can create a direct set from another direct set. This technique can be considerably more efficient than generating multiple sets from the goal records for several different reports. If you need to sort the goal records several times on different elements, consider creating a direct set for the first one and creating other direct sets from it for the others.

There is no guarantee that using direct sets will save you money. If set entries are very large because you have many direct elements, the overhead to process them could offset the savings from not processing the goal records. If you are concerned about costs and your set will have many direct elements and/or sort elements, you should certainly run some tests using real data before deciding to use direct sets.

1.8.3 Direct Sets and Filters

If you have explicit filters set when you begin processing a direct set, SPIRES will clear those filters:

-> set filter for wearer(first)
-> for set sortsox direct
-Warning: Filters have been cleared until ENDFOR
+> show filters
-No filters found
+> endfor
-Filters have been reestablished
-End of Global FOR
-> show filters
SET FILTER FOR wearer(first)
->

SPIRES clears the explicit filters because when you start processing a direct set, you are in effect working with an entirely different type of record than when you were working with the subfile's goal records, i.e., when the explicit filters were set. Such explicit filters for the goal records could cause problems if they were applied to the direct set. For example, such filters would not work properly with direct elements stored in their external forms.

Although pre-existing explicit filters are cleared when the direct set processing begins, you can set other filters (with the SET FILTER command) after issuing the "FOR SET setname DIRECT" command. Thus, you are allowed to continue filtering the direct set, but only after direct set processing has been initiated. As soon as an ENDFOR condition occurs, however, these filters are discarded, and the explicit filters that were automatically cleared are reestablished by SPIRES, as shown in the example above.

1.9 Display Sets

Display sets are a variation of direct sets. [See 1.8.] Display sets are created with the DEFINE SET and GENERATE SET commands, but unlike direct sets, display sets are not placed in ORVYL data sets. Instead, you specify an output area in which the display set should be placed (e.g. your terminal screen or your active file). The GENERATE SET command sends the set directly to the output area, so you do not need to use the FOR SET command to use the set.

Since display sets do not produce ORVYL data sets, you can't use SPISORT to sort the records in the set. But you may be able to take advantage of the automatic sorting in your record keys (using FOR SUBFILE) or in an index (using FOR INDEX) to control the order in which entries in the set appear. Or, since you can generate a set from a record stack, you could use the SEQUENCE command to arrange records in a particular order before you use the GENERATE SET command.

An advantage of this difference between display sets and normal sets is that display sets do not have the 5500-byte size limit that normal sets have.

Display sets are created by adding a DISPLAY option to the DEFINE SET command. Here is the syntax for display sets:

DEFINE DISPLAY SET [SCAN] [ALL] [TV=ALL] [EXTERNAL] ...
 ... ELEMENTS [=] element-list [+ display-list] [- display-list]

The ALL, TV=ALL, EXTERNAL, and "element-list" options were discussed earlier. [See 1.3.1.] The "display-list" options are similar to the "direct-list" options for direct sets. [See 1.8.1.] They let you add or subtract elements for inclusion in the display set. Note that you do not supply a setname for a display set.

The SCAN option works the same as for direct sets. It lets you apply WHERE clause processing as the sort entries are created for the display set. SCAN causes the GENERATE SET command to apply the current WHERE clause criteria to each sort entry to see if the entry should be included in the display set. [See 1.8.1 for an example.] The SCAN option must immediately follow the words DISPLAY SET.

Use the GENERATE SET command to send the display set to an output area:

[IN ACTIVE|areaname] GENERATE SET [ALL|FIRST|*|NEXT|n|REST|LAST]

If you omit the "IN areaname" prefix, the generated display set will be displayed at your terminal. The second option indicates which records in the Global FOR class are to be processed for the display set. The default is ALL records in the class.

Here is an example showing how to create a display set. Suppose you have a subfile whose goal records consist of a supply item, with structures for each order made for the item. The structures naturally occur in chronological order. You can create a set with an entry for each order this way:

-> display 1
 ID = 1;
 ITEM = Typewriter ribbon;
 ORDER;
   DATE.ORDERED = 03/21/87;
   SUPPLIER = Congden and Crome;
   QUANTITY = 3;
 ORDER;
   DATE.ORDERED = 04/01/87;
   SUPPLIER = Smith Brothers;
   QUANTITY = 5;
 ORDER;
   DATE.ORDERED = 05/04/87;
   SUPPLIER = Congden and Crome;
   QUANTITY = 1;
-> for index item
+> define display set elements = date.ordered(tv=all) + item id qty
+> generate set all
 ID = 2;
 ITEM = Light bulbs;
 ORDER;
   DATE.ORDERED = 03/21/87;
   QUANTITY = 3 boxes;

 ID = 2;
 ITEM = Light bulbs;
 ORDER;
   DATE.ORDERED = 04/13/87;
   QUANTITY = 2 boxes;

 ID = 2;
 ITEM = Light bulbs;
 ORDER;
   DATE.ORDERED = 05/12/87;
   QUANTITY = 5 boxes;

 ID = 4;
 ITEM = Quadrille pads;
 ORDER;
   DATE.ORDERED = 01/05/87;
   QUANTITY = 25;

 ID = 4;
 ITEM = Quadrille pads;
 ORDER;
   DATE.ORDERED = 03/04/87;
   QUANTITY = 13;

 ID = 3;
 ITEM = Rubber stamp;
 ORDER;
   DATE.ORDERED = 06/16/87;
   QUANTITY = 1;

 ID = 1;
 ITEM = Typewriter ribbon;
 ORDER;
   DATE.ORDERED = 03/21/87;
   QUANTITY = 3;

 ID = 1;
 ITEM = Typewriter ribbon;
 ORDER;
   DATE.ORDERED = 04/01/87;
   QUANTITY = 5;

2 An Alternate Form of the Subfile Name

Since file owners have almost complete freedom in choosing subfile names, the possibility always exists that you may have access to two or more subfiles with the same name. If you SELECT one of them, SPIRES will ask you which one you meant, showing you the file names of the subfiles to distinguish between them. Alternatively, when you SELECT one of the subfiles, you can specify all or part of the file name before the subfile name, in order to uniquely identify the desired subfile. This alternate syntax for the subfile name is also available:

 - in SPIBILD for the "ESTABLISH subfile-name" command

 - in the $LOOKSUBF function

 - in system proc $SUBF.LOOKUP (action A65)

 - in the SUBFILE statement in a format definition

 - in the SUBFILE statement in the PHANTOM section of a file definition

The syntax for the subfile name in these commands is:

[&gg.uuu.filename] subfile-name

where the bracketed material is the name of the file as it appears in the first line of the file definition. You may also include a comma after the "filename" if it helps clarify the command syntax for you.

Starting from the left end, only as much of the file name as is needed to distinguish the subfile name from any other is necessary. Thus, for a file named "GA.JNK.MUSIC", you could specify "&GA.JNK.MUSIC" or as little as "&G". The "&" (ampersand) character is required if any part of the file name will be used, since it tells SPIRES that what follows immediately is a file name rather than a subfile name, which is expected.

Here are examples using the file name "GA.JNK.MUSIC" and a "RECORDS" subfile:

SELECT &GA.JNK.MUSIC, RECORDS
ESTABLISH &GA.JNK RECORDS
SELECT &G RECORDS

Though it is not recommended, an even shorter method is available. In some cases, if the file being named is your own, you can replace your account number with an asterisk (*) or a period (.), as in "&*MUSIC" or even "&.". This technique is not recommended when you are writing code that will be executed by other users, since SPIRES may try to substitute their account numbers instead of yours.

3 Sharing Records in SPIRES System Subfiles with Other Users

All of the SPIRES system subfiles, such as FILEDEF and FORMATS, control access to their records by means of the record key, which begins with the account number of the user owning the record. By default, only you, the owner, can display and update your records in those subfiles.

Using a system subfile called METACCT ("met-account"), however, you can allow other users to display and possibly update your records in almost all system subfiles. Hence, you can share your system-subfile records with other users, letting them examine or copy a file definition, for example, or add their own procs to an EXTDEF record of yours. You simply add a record to the METACCT subfile that tells SPIRES what users have what access ("see-only" or "update") to your records in which subfiles. The only exception is METACCT itself, which is not affected by METACCT. A complete list of the affected subfiles appears in the next section.

To handle your records, users must first issue the SET METACCT command, naming the account whose records they want to see or use (yours). Their subsequent commands referring to your records for display and update or even compiling purposes will succeed or fail depending on the level of access you gave them in your METACCT record.

The next section of this chapter describes in detail the METACCT record that you the owner must create in order to give users access to your records. [See 3.1.] The section following that talks about this feature from your user's standpoint, describing the SET METACCT command in particular. [See 3.2.]

Having several people updating your system-subfile records can lead to confusion regarding the "current" version of a record. For example, suppose a user gets a copy of a file definition and makes changes to it; meanwhile, you do the same, updating the record before the other user does. When the other user's update occurs, your changes will be discarded.

To help avoid that type of problem, records in system subfiles may contain a special structure, called VERSION-STR. If used, this structure would in effect block the other user from updating the record until he or she takes your update into account. Details on VERSION-STR appear later in this chapter. [See 3.3.]

3.1 The METACCT Subfile

To give other users access to your records in one or more system subfiles, you must create a record stored under your account in the METACCT subfile. In addition, unless the access is limited to See-Only, the users can add new records to those subfiles for your account, make changes to records and update them, compile them, or even remove them. In a sense, by putting together a METACCT record, you are allowing some user or users to be you, at least in regard to some system subfiles (possibly limited to certain records; see KEY-PREFIX below). [See 3.2 for specific details on what users with METACCT access can do.]

The basic structure of the METACCT goal record-type is this:

ACCT = gg.uuu;             - your account number (record key)
COMMENTS = comments;       - optional comments
ACCESS = level;            - "See-only" or "Update"
  ACCOUNTS = gg.uuu, ...;  - accounts given that access
  [SUBFILE = subfile;]     - specific subfile it pertains to
  [KEY-PREFIX = gg.uuu...] - specific record key prefixes

Since you may have only a single record in the subfile, all special access to your records must be defined in this record. To that end, ACCOUNTS with SUBFILE may repeat under ACCESS, and ACCESS may repeat as well, forming multiple access-structures (see example below).

In case of contradictory statements in regard to a specific account or subfile, be aware that the least restrictive interpretation will always be chosen by SPIRES. For instance, if you give another account the ability to update records in one or all files, no other statements in the record can override that. Examples below will make this point clear.

Here are some other important details about ACCESS, ACCOUNTS and SUBFILE:

ACCESS -- This statement names the access level to be given to the accounts named in the list that follows. The allowed values are See-Only (or S) and Update (or U). A null value for ACCESS (i.e., "ACCESS;") is equivalent to Update. See-Only means that the named users are allowed only to examine the records; they may not add, update or remove your records. Update allows the named users to both examine and update (i.e., add, update and remove) your records in the system subfiles named below. It will also give the user the ability to compile or recompile those records, if the records are compilable. Update is less restrictive than See-Only. Multiple occurrences of ACCESS are allowed, each of which is usually followed by ACCOUNTS and SUBFILE. To summarize, the allowed values are:

 - See-Only -- users may only examine records;

 - Update -- users may add, update, and remove records;

ACCOUNTS -- This statement names the ACCOUNTS affected by the ACCESS statement above. If SUBFILE statements follow, the accounts listed here receive the named access only to those subfiles; hence, METACCT records often have several occurrences of ACCOUNTS, each followed by one or more occurrences of SUBFILE. You provide the same type of values here as you do for the ACCOUNTS statement in a file definition, as listed below. To specify multiple accounts (or groups) in an ACCOUNTS statement, separate the individual values with commas. Allowed values are:

 - PUBLIC -- to give access to all users;

 - gg.uuu -- to give access to the named account;

 - gg.... -- to give access to all users in the named group;

 - g..... -- to give access to all users in the named community.

SUBFILE -- This statement names the specific system subfile whose records you want the users to have access to. If you omit it, access is given to the entire list of subfiles shown below. Multiple ones may be listed for the list of accounts by specifying separate occurrences of the SUBFILE statement beneath the ACCOUNTS statement. Here is the list to choose from:

FILEDEF       RECDEF        BACKFILE       STATCHAR
FORMATS       EXTDEF        BACKRECS       FORCHAR INDEX
VGROUPS       RECHAR        BACKCHAR       SYS PROTO
FORSTAT       STATIC        BACKDEFS       COMP PROTO
FORLOAD       FORCHAR

The only significant omission from the list of system subfiles above is METACCT itself. You cannot give other users access to your own METACCT record; only you can add it or change it.

If you give Update access to the FORCHAR subfile, then you are permitting the other user to remove any compiled formats for any of your files, even if the format's id belongs to neither you nor the other user but to a third user.

Similarly, if you give a user Update access to the COMP PROTO subfile, that user can remove any compiled protocols in any of your protocol files, even if others defined the SYS PROTO records for those protocols.

KEY-PREFIX -- This statement, which may occur multiple times for each subfile (but can occur only if the SUBFILE statement occurs), limits the affected records in the subfile only to those with the specified prefix. The prefix should begin with your account number. For example, if your account is GQ.JNK and your METACCT record has "SUBFILE = FILEDEF; KEY-PREFIX = GQ.JNK.T;" then the access would apply only to file definitions that began with the prefix GQ.JNK.T.

The METACCT record takes effect as soon as you add it to the METACCT subfile.

Here are some sample METACCT records:

ACCT = AB.USE;
ACCESS = Update;
  ACCOUNTS = AB....;

That record gives all accounts in group AB access to user AB.USE's records in all the system subfiles listed above.

The next example is more complicated:

ACCT = AM.USE;
ACCESS = See-Only;
  ACCOUNTS = LO.USE, AM....;
    SUBFILE = FORMATS;
ACCESS = Update;
  ACCOUNTS = AM.UCK;

In this example, account AM.UCK gets Update access to all the system subfiles listed above, including FORMATS. The See-Only limitation for group AM users for the FORMATS subfile doesn't apply to AM.UCK, because the specific account reference to AM.UCK under "ACCESS = Update" is less restrictive than the specific subfile reference for the AM group.

Once the barn door is opened and an access-structure gives Update access to an account for some or all the subfiles, that access will not be revoked by any other access-structure. Below is a bad example, in that the record owner is trying to allow a particular user to update everything but the FORMATS subfile; it will not achieve the desired aim:

ACCT = GA.TES;
ACCESS = Update;
  ACCOUNTS = BY.TES;
ACCESS = See-Only;
  ACCOUNTS = BY.TES;
  SUBFILE = FORMATS;

Since the first access-structure has already given Update access to all subfiles for account BY.TES, the second structure does not override it; hence, BY.TES has Update access to the records in the FORMATS subfile belonging to GA.TES.

If it is really necessary to give an account Update access to all but one or two subfiles to which it should have See-Only access, you must type the name of each of the subfiles for Update access. In practical usage, however, this seems to be an infrequent need. In most cases, you should be able to state the requirements fairly simply.

You could easily concoct more complicated examples, but generally speaking, practical uses are relatively simple, more like the first two examples than the third.

If SPIRES's interpretation of that third example record still strikes you as odd, consider how you would interpret the same record if the See-Only account for the FORMATS subfile were listed as BY....(i.e., group BY) rather than the specific BY.TES. Is the record owner trying to allow group BY to have See access to FORMATS and in addition allow account BY.TES to have Update access to all the subfiles? Or does the record owner really mean to prevent BY.TES from updating those FORMATS records? The former interpretation seems more plausible, which is the way SPIRES would interpret that METACCT record.

3.2 Using METACCT Privileges: The SET METACCT Command

This section describes the privileges that METACCT access provides to you as a user of someone else's records, as well as the procedure you must follow in order to use them.

Generally speaking, you find out that you have been given METACCT access to someone's system-subfile records because he or she tells you about it. Aside from trial and error, you have no way of finding out whether you have access to other people's system-subfile records.

If you have been granted access to someone else's records, you can begin using them by issuing the SET METACCT command in SPIRES:

SET METACCT gg.uuu[, gg.uuu...]

where "gg.uuu" is the other person's account number, i.e., the account of the user whose records are to be retrieved or updated or compiled. By including multiple accounts in the command, you can request METACCT access to the records of several accounts at one time.

SET METACCT is a session command, remaining in effect for the duration of the SPIRES session, or until it is cancelled.

CLEAR METACCT cancels the current METACCT access. Issuing another SET METACCT command also cancels the current METACCT access, and then establishes access for the new list of accounts. SHOW METACCT shows you a list of the accounts currently set, or displays the message: "No accounts defined." (Syntax note: You can also spell out METACCOUNT in these commands if you desire.)

What Access Does METACCT Provide?

Generally, Update access to a system subfile through METACCT means that SPIRES treats you as if you were using both your account and the other user's when you have the subfile selected. Thus, you can see their records with the DISPLAY command, make changes to their records using the MERGE command or the TRANSFER/UPDATE sequence, or discard their records using the REMOVE command. You can even add new records for their account.

If you have been given access to compilable records, such as file, record, vgroup or format definitions, you can compile or recompile those records as well. Note, however, that for file definitions, you must have WRITE access (via ORVYL permits) to the owner's account. Additionally, if the compilation creates new ORVYL data sets, they will not have the normal PUBLIC access permits that are set when files are compiled or recompiled on the owning account. These will need to be set appropriately on the owning account. See section B.12.1 of the manual "SPIRES File Definition"; online, [EXPLAIN ORVYL FILES, PERMITS FOR IMMEDIATE INDEXING.]

See-Only access is a subset of the Update privileges. As its name implies, See-Only access allows you only to see the other person's records (for example, with the DISPLAY or TYPE commands), but not to add new ones, change existing ones or remove old ones.

As described in the previous section, [See 3.1.] the owner may grant this access to his/her records in one, some or all of the following system subfiles:

FILEDEF       RECDEF        BACKFILE       STATCHAR
FORMATS       EXTDEF        BACKRECS       FORCHAR INDEX
VGROUPS       RECHAR        BACKCHAR       SYS PROTO
FORSTAT       STATIC        BACKDEFS       COMP PROTO
FORLOAD       FORCHAR

Other commands that work with source records will probably work through METACCT access. A good example is PERFORM FILEDEF SUMMARY, which summarizes the file definition for the selected subfile. If you have at least See-Only access to FILEDEF for someone else's file definitions, you can issue this command when you have one of their subfiles selected. [See 28.4.] PERFORM PRINT is also allowed when appropriate METACCT access is in effect. [See 28.1.]

3.3 Using VERSION-STR to Track Copies of System File Records

Several system subfiles in SPIRES (e.g., FILEDEF, FORMATS) contain a structure called VERSION-STR. This structure can help you keep track of what "version" (i.e., which copy) of a system-subfile record you are working with. Because VERSION-STR is designed to solve a particular problem, it is necessary to understand the problem in order to understand how VERSION-STR works.

Suppose you and a co-worker are both working on an application. Via SET METACCT [See 3.2.] you transfer the application's file definition from FILEDEF and begin to make some changes to it. Your pal needs to make some changes to it too, so independently of you, she transfers the file definition, makes her changes, and updates it. Then you finish your work and update the record, losing her changes.

The data maintained by the VERSION-STR structure can prevent this problem. When you transfer your copy of the record to work with, the VERSION-STR structure contains a version number, such as "12". Similarly, your co-worker, getting a copy shortly thereafter, would also get version 12 of the record. You both make your changes, leaving the VERSION-STR untouched. When she issues the UPDATE command, SPIRES compares the version number of the stored record with the version number in the input; since they are the same, the record goes back in without an error. At that time, SPIRES assigns the record the next version number, which here is 13.

When you issue the UPDATE command, the command fails, because the version number of the stored record, 13, doesn't match the version number in the input data, 12. Hence you are warned that the record has been updated since you retrieved your copy of it. Presumably, you'll get a copy of the record in its latest form, and merge your changes in with that one. [You could also simply choose to change the version number in your copy of the record to 13 and issue the UPDATE command again. The VERSION-STR feature serves only as a warning; it cannot absolutely prevent records from being updated inappropriately.]

To request that version information be maintained for a system-subfile record, you simply add the VERSION-STR structure to the record:

[VERSION-STR;]
  VERSION-NUMBER = n;
  [VERSION-ACCT = gg.uuu;]

Because VERSION-NUMBER is the key of the VERSION-STR structure, you do not need to include the VERSION-STR statement.

For VERSION-NUMBER, "n" must be a positive integer no greater than 999999; it's most often set to "1". (After 999999, VERSION-NUMBER wraps around to 1 again.)

If you omit VERSION-ACCT, SPIRES will provide it with your account number, i.e., the account of the logged-on user. SPIRES does not verify the account value, so it will accept any value you type, even if it is an invalid account number or form; similarly, if you use an account abbreviation, such as ".", which is recognized as meaning "your account" in most other SPIRES contexts, it will be treated as the typed character and not translated into the account number here. SPIRES will, however, display the account number to you in lowercase, for a reason explained below.

The VERSION-ACCT is significant because SPIRES will ignore the VERSION-STR if the value for VERSION-ACCT doesn't match the account in the key of the record. The example described below will explain why that is useful.

The VERSION-STR structure is available in these system subfiles:

FILEDEF   FORMATS   VGROUPS   RECDEF   BACKFILE   BACKRECS

Practical Uses of VERSION-STR

If you are the only one who updates your system-subfile records, VERSION-STR will be of limited use. You could use it to keep track of copies of a record by their version numbers, just as you might use the MODDATE and MODTIME elements, e.g., to verify that the paper copy you have is the latest version.

When several users may be updating your system-subfile records, VERSION-STR is more useful, as described in the earlier example. Here, in more detail, is how you might use it.

Suppose that one of your applications keeps its production code under account GQ.PRD, and its test code under GQ.TES. Within each GQ.PRD record, you could add the VERSION-STR structure, like this:

FILE = GQ.PRD.ALMANAC;
 ...
VERSION-NUMBER = 1; VERSION-ACCT = gq.prd;

On account GQ.PRD, put the record back into the system subfile (FILEDEF, for this example). SPIRES will immediately change the record's version number to "2" internally.

Sometime later, you get a copy of file definition GQ.PRD.ALMANAC, moving it under your test account, GQ.TES. With the WYLBUR command CHANGE, you change "GQ.PRD" to "GQ.TES". (Since the VERSION-ACCT value is in lowercase, it remains "gq.prd".) You then make other changes you want to make, adding the record to FILEDEF as GQ.TES.ALMANAC. Because the VERSION-ACCT doesn't match the account in the key of the record, the VERSION-STR data is ignored, and isn't updated.

Eventually you are ready to replace the GQ.PRD.ALMANAC file definition with the new version under GQ.TES. Again you issue the CHANGE command to change occurrences of "GQ.TES" to "GQ.PRD" (again leaving the VERSION-ACCT value untouched), and then update the file definition with the new copy.

Because the VERSION-ACCT value now matches the account in the record key, SPIRES will pay attention to the structure. If the input version number ("2") matches the version number in the stored record, then the update can continue. If it doesn't match, then the record has been updated since the time you made the copy of it; SPIRES will reject the new copy, meaning you need to resolve the discrepancies between them.

Technical Notes on VERSION-STR

VERSION-STR and the MERGE command

WARNING: If you use the MERGE command to update a system-subfile record (rarely done under any circumstances, we hope) and the stored record contains the VERSION-STR structure, be sure to include the VERSION-STR in your input as well. If you don't, the VERSION-NUMBER will not get checked/updated appropriately.

The warning also applies to updates done through partial processing; be sure to open and close the VERSION-STR structure during your processing.

Getting rid of the VERSION-STR

Generally speaking, the only way to eliminate the VERSION-STR structure once you have begun using it is to get a copy of the record, remove the record from the subfile, delete the VERSION-STR structure from the copy, and then add the record back into the subfile again.

VERSION-STR and Secure-Switch 4

Users of a system subfile with secure-switch 4 set (only people involved with SPIRES system administration) may also eliminate the VERSION-STR structure of records directly, simply by transferring them, deleting the VERSION-STR lines, and updating them.

4 Object Code Management: The ZAP Subfile-name Commands

In SPIRES, the following kinds of data are or can be compiled:

 - file definitions, stored in the FILEDEF subfile

 - format definitions, stored in the FORMATS subfile

 - file record-type definitions, stored in the RECDEF subfile

 - protocols, whose control information is stored in the SYS PROTO subfile

 - variable group definitions, stored in the VGROUPS subfile

Whenever one of these kinds of data is compiled, SPIRES stores the compiled object code. For file definitions, the compiled characteristics are stored in an ORVYL file, on the file-owner's account, called "filename.MSTR". For all other object code, SPIRES creates a record in another SPIRES system file. In addition, when a user issues a STORE STATIC command, SPIRES creates a record in another SPIRES system file.

To allow you to manage (i.e., remove) records created in these other SPIRES system files by the COMPILE and STORE STATIC commands, a set of ZAP commands is available in SPIRES to delete these records, and optionally to delete the source records from which they were derived.

The general form of these commands is:

ZAP   source-subfile   source-record-key   [SOURCE]

where "source-subfile" is: FORMATS, RECDEF, SYS PROTO or VGROUPS. (Note: the source-subfile names can be abbreviated to three or more characters.) The "source-record-key" is the key of the record given in the COMPILE command; as in these commands, the key need not be fully qualified by the user's account number, since the user can only ZAP object-code records defined by the logged-on account. A ZAP STATIC command is available to remove data records for stored variables.

To use any of these ZAP commands, the source-record must be stored in the appropriate SPIRES system file. That is, to ZAP the compiled code for a format, the definition for that format must be in the public FORMATS subfile. If the source record is not available, then more manual methods of object-code management must be used. Contact the SPIRES consultant for more information.

The following sections describe most of the various ZAP commands available in more detail.

4.1 The ZAP FORMAT Command

To remove object code generated when a format is compiled, the following ZAP command is available in SPIRES:

ZAP FORMAT[S] source-record-key [SOURCE]

The source-record-key is the value of the ID statement in the format, which is the key of the FORMATS subfile record. The source-record-key need not be prefixed by the logged-on user's account number. If the SOURCE option is used, then the source record is removed from the FORMATS subfile.

For example:

ZAP FORMAT XA.G01.DISPLAY
ZAP FORMAT DISPLAY SOURCE
ZAP FORMATS DISPLAY

You can only zap the object-code of a format when you own the format. Whether or not you own the file to which the format applies is immaterial. The file owner can therefore only zap the formats he created for the file.

4.2 The ZAP RECDEF Command

To remove object code generated when a record definition is compiled from the RECDEF subfile, the following ZAP command is available in SPIRES:

ZAP RECDEF source-record-key [SOURCE]

The source-record-key is the value of the ID statement in the record definition, which is the key of the RECDEF subfile record. The source-record-key need not be prefixed by the logged-on user's account number. If the SOURCE option is used, then the source record is removed from the RECDEF subfile.

For example:

ZAP RECDEF GC.JCB.SIMPLE.INDEX
ZAP RECDEF SIMPLE.INDEX SOURCE
ZAP REC SIMPLE.INDEX

4.3 The ZAP SYS PROTO Command

To remove object code generated when a protocol is compiled using the older SYS PROTO method of compilation, the following ZAP command is available in SPIRES:

ZAP {SYS PROTO|SYSPROTO} control-record-key [SOURCE]

The control-record-key is the value of the ID statement in the protocol's compiler-control definition, which is the key of the SYS PROTO subfile record. The control-record-key need not be prefixed by the logged-on user's account number. If the SOURCE option is used, then the control record is removed from the SYS PROTO subfile.

For example:

ZAP SYS PROTO AC.DBA.FULLFACE.PROTOCOLS
ZAP SYSPROTO FULLFACE.PROTOCOLS SOURCE
ZAP PROTOCOL FULLFACE.PROTOCOLS

If the protocol is compiled directly from the source subfile (the newer method of compiling protocols), you should use the ZAP PROTOCOL command:

ZAP PROTOCOL protocol-name [SOURCE] OF protocol-subfile

4.4 The ZAP VGROUP Command

To remove object code generated when a variable group definition is compiled from the VGROUP subfile, the following ZAP command is available in SPIRES:

ZAP VGROUP[S] source-record-key [SOURCE]

The source-record-key is the value of the VGROUP statement in the variable-group definition, which is the key of the VGROUPS subfile record. The source-record-key need not be prefixed by the logged-on user's account number. If the SOURCE option is used, then the source record is removed from the VGROUPS subfile.

For example:

ZAP VGROUP BG.VLJ.GLOBAL.VARIABLES
ZAP VGROUP GLOBAL.VARIABLES SOURCE
ZAP VGROUPS GLOBAL.VARIABLES

4.5 The ZAP STATIC Command

This command removes a particular set of stored static variables created by the STORE STATIC command, or it can remove all sets of stored static variables for a particular variable group.

Three separate command forms are allowed:

ZAP STATIC storage-record-name OF vgroup-name
ZAP STATIC vgroup-name ALL
ZAP STATIC * ALL

The first form removes a particular stored set of static variables; "storage-record-name" is the name given to the stored vgroup when the "STORE STATIC vgroup-name TO storage-record-name" command is issued. The second removes all stored static groups belonging to you for a particular vgroup. The third form eliminates all stored static groups belonging to you for any vgroups.

The ZAP STATIC commands will remove stored sets of static variables only if they belong to you, regardless of whose vgroup they apply to.

5 Multiply Defined Elements

You can have great flexibility with files containing multiply defined element mnemonics -- that is, two or more elements with the same name occurring in a file definition. For example, the element COMMENTS might occur in several different structures in a file definition. Also, "floating structures" [See "SPIRES File Definition", section B.3.6.] cause multiply defined element mnemonics.

The SET ELEMENTS, TYPE, ALSO and SEQUENCE commands allow specification of elements either by a simple element mnemonic or by a "structure@...@element" form (see examples below) which specifies a structural path to the desired element. Note that this second form is needed only if the element mnemonic is not unique in the file definition, i.e., the same name is used for several different elements.

If you use any of these commands with a simple element mnemonic when that element is multiply defined, not all of the occurrences of that element throughout a record are examined or displayed. If we say that record-level elements are at the highest level, and that elements in a record-level structure are at the next lower level, and that the innermost-nested structure elements are at the lowest level, we can say that the simple mnemonic option will cause SPIRES to locate the first appearance of the mnemonic at the highest level it is found in the file. However, using the "structural path" option will locate any specific appearance of the mnemonic.

The SET ELEMENTS and TYPE commands also allow the "@element" form, which means "all elements having the specified mnemonic". Again, this form is only needed if the same element mnemonic occurs more than once in the file definition.

The effects of these features are best explained with examples. You will note that different aliases for the same mnemonic can be used for clarity. The following is a skeletal part of a file definition, followed by three records from the file:

     1.  RECORD-NAME = ID;
     2.  SLOT;
     3.  OPTIONAL;
     4.     ELEM = X;
     5.        ALIAS = XR;
     6.     ELEM = S;
     7.        TYPE = STR;
     8.  STRUCTURE = S;
     9.     REQUIRED;
    10.        ELEM = X;
    11.           ALIAS = XS;

Note that element X appears twice in the definition -- once at the record level (alias "XR") and once within a structure (alias "XS").

ID = 1;          ID = 2;          ID = 3;
X = Afford;      S;               X = Change;
S;                X = Before;
 X = Delete;

The SEQUENCE command

The command "SEQUENCE X" would arrange these records in the order 2,1,3. SPIRES examines values only at the highest level in all records where it occurs in any record -- in this case, the record level. Since null values are listed first, record 2, with no value for X at the record level, comes first, followed by records 1 (X = Afford;) and 3 (X = Change;). The command "SEQUENCE XR" would give the same sequence.

However, you can type "SEQUENCE S@X" and the arrangement becomes 3,2,1. SPIRES looks for occurrences of X specifically in the structure S. Thus, record 3 with no occurrences of the element at that level comes first, then records 2 (X = Before;) and 1 (X = Delete;). "SEQUENCE XS" would also have the same result.

The ALSO Command

If you have a search result containing these three records, and you issue the command "ALSO X STRING FOR", only record 1 will be retained. SPIRES examines only the occurrences of the X element at the highest level the element can appear -- the record level, in this case. Thus, only the "XR" occurrences are examined. ("ALSO XS STRING FOR" would give the same result.)

"ALSO S@X STRING FOR" retains only record 2. ("ALSO XS STRING FOR" would also give the same result.)

The TYPE and SET ELEMENTS Commands

Both of these commands are affected similarly to the ALSO command, though they both have an additional feature, discussed below. For example, "SET ELEMENT X" will cause commands that display the records to display only the "XR" values -- that is, occurrences of the first element X found at the highest possible level, which in this case is the record level.

"SET ELEMENT S@X" will set the element with the "XS" alias -- that is, all occurrences of the element X within the "S" structure.

"SET ELEMENTS @X" is the additional feature: all occurrences of the element X at any level in a record will be "set".

The values for the element list on a TYPE command can also use the "structure@...@element" or "@element" options described here, or they can use neither option, with the same results as described above for the SET ELEMENTS command.

Remember that this information is only relevant to files with multiply defined element mnemonics, which are relatively rare.

5.1 The Throw-away Element: The "-" Element in SPIRES Subfiles

Each SPIRES subfile has a special element, the "-" (hyphen) element, that can be used during data entry for comments. When a record containing occurrences of this element is added to the subfile, the values for the "-" element are thrown away; they are not stored within the record. But why would anyone want to type comments that are thrown away on input?

Some SPIRES users find that they update the same large record or records over and over again. Each time they update the record, however, they may not remember why a particular element or occurrence of an element has a particular value. What these users do is to insert these "throw-away" comments throughout their input data and then save the input data separately from the subfile before adding the record to the subfile. The extra saved copy becomes the master copy of the record; when the record needs to be updated, the master is changed, saved again, and then used to replace the record in the subfile. When the record is added or updated in the subfile, these extraneous comments that are saved in the master copy somewhere else are not stored in the subfile copy.

Here is how some input using this element might look:

NAME = John Klemm;
ADDRESS = 188H Pine Hall;
- This address may change at the end of the year.;
CITY = Stanford;
STATE = CA;
PHONE = 497-4420;
  AREA-CODE = 415;
  - will probably keep this phone number after move.;

As you can see, the "-" element can appear anywhere and any number of times in the input data, when you are using the standard SPIRES format. Of course, the value must end with a semicolon.

6 Storing DECLARE Data for Multipurpose Use

Declare data, the commands and metadata that define declared elements and output control packets, is normally found in protocols. However, in situations where you'd like to use the same declare data either in multiple protocols or repeatedly in command mode, you can store the declare data in a separate subfile and execute the declaration from there. The subfile is possibly one of your own making, but system files are also available in which you may store your declare data.

The first section of this chapter describes how to set up a subfile for declare data. [See 6.1.] The following one describes how to use your stored declare data. [See 6.2.]

6.1 Declare Data Subfiles

You can either create your own subfiles where you can store declare data or else, if your declare data is for declared elements or for output control packets, use two system subfiles.

The DATA MOVE DECLARES and DATA OUTPUT CONTROL Subfiles

Two system subfiles already exist for declare data; anyone can put declare data into them:

DATA MOVE DECLARES  - for declared element data
DATA OUTPUT CONTROL - for output control packets

Additionally, anyone can refer to those records, although only the record owner, plus anyone to whom the record owner gives METACCT access, can update or remove them. If privacy of the declare data itself is an issue for you, you should create your own subfile(s) to hold it, as described below.

For each declared element or each output control package you want to store in one of the above subfiles, you just add an ID statement at the top and add it into the appropriate subfile:

DATA MOVE DECLARES subfile     DATA OUTPUT CONTROL subfile
  ID = gg.uuu.name;              ID = gg.uuu.name;
    [declare element             COMMENTS = optional comments;
     statements]                  PACKET = name;
    ...                            [output control statements]
                                   ...
                                  PACKET = name;
                                   [output control statements]
                                   ...

The next section describes the commands you issue to use these records when you need them. [See 6.2.]

Creating Your Own Subfiles for Declare Data

You can create your own subfiles to hold your declare data, by defining the subfile's goal record-type with a DEFINED-BY statement that names a SPIRES system-supplied definition:

for elements:        DEFINED-BY = $ELEMENT;
for output control:  DEFINED-BY = $OUT.CONTROL;

As in the DATA MOVE DECLARES and DATA OUTPUT CONTROL records, the records you add to the subfiles need to have an ID key supplied, which you will use in the WITH DECLARE prefix of the commands you issue to use the stored declare data.

For more information about the specific form for all of these different records, see the documentation for each type, e.g., declared elements or output control. [See 7.1, 20.3.]

6.2 Using Your Stored Declared Data

To use stored declared data, you need to:

 - 1) through a path, select the subfile where it is stored

 - 2) point SPIRES to that path (SET DECLARE PATH command)

 - 3) issue the desired DECLARE command, using the WITH DECLARE prefix to identify the  record
 holding the declared data you want to use

Here are the steps in detail:

1) Through a path, select the subfile where it is stored

Of course, this implies that you have first selected a primary subfile to use.

In this example, data defining a declared element is stored in the DATA MOVE DECLARES subfile:

-> select almanac
-> through 1, select data move declares
-Path established: 1

2) Point SPIRES to that path

Use the SET DECLARE PATH command:

SET DECLARE PATH {pathname|pathnum} FOR declare-type

where:

pathname or pathnum is the name or number of the path defined in the previous step; and

declare-type is the type of declare data records in that subfile. Possible values may be:

ELEMENT         - for declared elements
OUTPUT CONTROL  - for output control
TABLE           - for declared tables
EXTERNAL DATA   - for external file processing

-> set declare path 1 for element

3) Issue the desired DECLARE command using the WITH DECLARE prefix

The WITH DECLARE prefix, used on the DECLARE command, provides the key of the record holding the stored declare data:

WITH DECLARE key-value DECLARE ...

Hence, the "key-value" is the key of a record within the named path for the particular type of declare data.

The rest of the command is the normal DECLARE command, like DECLARE ELEMENT or DECLARE OUTPUT CONTROL or DECLARE TABLE... You do not use the ENDDECLARE command, as you do when using the DECLARE command without the WITH DECLARE prefix, in a protocol.

-> with declare $name.middle declare element midname for name
-> set element name midname
-> display 1
 NAME = Karl Friedrich Abel;
 MIDNAME = Friedrich;

In the example, a system-defined declared element definition called $NAME.MIDDLE, is used to redefine the NAME element of the selected subfile. This specific technique could be used in any SPIRES subfile for any element that uses standard name-handling processing rules if you wanted to return just the middle parts of a name. There are also $NAME.FIRST and $NAME.LAST declared element definitions, among others, that could be used as well. See the keys in the DATA MOVE DECLARES subfile for a complete list of declared element definitions. See the DATA OUTPUT CONTROL subfile and the TABLES subfile for corresponding types of declared definitions.

7 Output Control

A SPIRES feature called output control lets you produce multiple reports or output processes while examining a set of subfile goal records just one time. By running multiple reports simultaneously rather than in sequence, you can save significant amounts of I/O and hence, in many cases, run time.

Additionally, output control's capabilities provide new options for creating output. For example, you can create output from multiple formats with the data going into a single device area. You may also have different filtering criteria set for the different reports or output processes.

Output control is established through a DECLARE command, DECLARE OUTPUT CONTROL. Like other declare processes, output control may be established in two ways:

 - in a protocol in which the output control declaration is defined; or

 - either from command mode or in a protocol, when the output control declaration is a  record
 in a declare data subfile.

The first section of this chapter describes the output control declaration statements; the second describes how to use output control using the WITH OUTPUT CONTROL prefix. [See 7.1, 7.2.]

7.1 The Output Control Declaration

Output control is defined by a collection of statements known as "an output control declaration". The output control declaration consists of one or more "output control packets", each of which describes a piece of the output processing to be done. If you were using output control to write three different reports, you would probably have three output control packets, one for each report format that needed to be set.

The heart of an output control declaration looks like this:

PACKET = 1st-packet-identifier;
  output-control statements
  ...
PACKET = 2nd-packet-identifier;
  output-control statements
  ...

Up to 36 packets may be defined in a single declaration. The output control statements are individually discussed below.

Each PACKET statement signals the start of another output control packet. The identifier value may be anything; there are no restrictions on it. The packets remain and hence are executed in the order in which they are defined; they are not sorted by the PACKET value.

If you are storing the declaration in a declare data subfile (probably the DATA OUTPUT CONTROL subfile), you need to add an ID statement at the top of the declaration:

ID = gg.uuu.name;

where "gg.uuu" is your account number and "name" can be any alphanumeric name (it may include periods as well). [See 6 for more information on storing output control declarations.]

On the other hand, to define the output control declaration within a protocol, you need to surround it with the DECLARE OUTPUT CONTROL and ENDDECLARE commands:

DECLARE OUTPUT CONTROL
  PACKET = 1st-packet-identifier;
  output-control statements
  ...
ENDDECLARE

Output Control Statements

Below is a description of each of the possible output control statements in a packet, in the order in which they would be stored in a declare data subfile. (The order in which you enter them is irrelevant, aside from the placement of the PACKET statements, described above.) Each of the statements is optional, though the occurrence or value of one statement might cause another one to be required.

First, here's a summary list of all the output control statements:

PACKET = packet-identifier;
 FORMAT = format-name;
 USING.FRAME = frame-name;
 PARMS = format-parameters;
 AREA = area-name;
 OUTPUT.OPTIONS = option1, option2...;
 TRACE;
 WHERE = clause;
 FILTER = FOR element WHERE clause;
 FOR.EACH = elem1, elem2, ...;
 PLUS.ELEM = elem1, elem2, ...;
 GEN.SET;
 CONTROL.OPTIONS = option1, ...;
 BYPASS.LIST = elem1, elem2, ...;
 EXCEPTION.FILE = orvyl.file.name;
 TABLE.NAME = table-name;
 TABLE.WHERE = clause;

FORMAT = format-name;

This statement names the format that will be in control during the output for this packet. It will be set at the time the DECLARE statement is executed; startup frames will be executed as normal. However, unless the startup frame does something to call attention to itself (like allocating a global vgroup in shared mode; see CONTROL.OPTIONS below), the format, including the setting of it, is invisible outside of the declaration.

If no FORMAT statement appears, the format that is set at the time the output command (that is, the command with the WITH OUTPUT CONTROL prefix) is issued will be used. To explicitly request the standard SPIRES format, issue a CLEAR FORMAT command before issuing the output command. Note that vgroup sharing (one of the CONTROL.OPTIONS below) is automatically in effect for a packet with no FORMAT statement.

You may also specify a system format like $REPORT or $PROMPT. However, the options that you would specify on the SET FORMAT command after the format name may not be specified here; you must enter them in PARMS statements (see below).

USING.FRAME = frame-name;

You can use the USING.FRAME option to name a specific frame to execute within this output packet. The frame must also be defined with a USAGE of NAMED. See the Formats manual, section D.1.1.1 for more information; online, EXPLAIN USING FRAME COMMAND PREFIX.

PARMS = format-parameters;

These multiply occurring statements are the parameters you would normally specify on SET FORMAT (following the format name) and SET FORMAT * commands to set the format from command mode. For example, if you would issue these commands to set the format:

-> set format $report Dept Name EMail
-> set format * grouped by Dept

then in output control, your packet would include:

FORMAT = $REPORT;
PARMS = Dept Name Email;
PARMS = grouped by Dept;

AREA = area-name;

Here you name the device services area to serve as the destination for this packet's output. In most cases, you would define and assign the area(s) to be used prior to the output control declaration (see the example below).

If you omit this option, the output is sent to the terminal.

An interesting twist for output control is to use the Subfile (SBF) area to use the output data directly as input for records into another subfile. EXPLAIN SBF AREA for more information on this technique. Note that exception file processing is available if needed; see the EXCEPTION.FILE statement, described below.

There are two ways to use the active file. If you are directing the output of only one packet to the active file, you can specify ACTIVE as the area. (In fact, you may specify ACTIVE for multiple packets to interleave the output from the different packets.)

However, if you don't want interleaving, use the second technique: define an area on the FILE area, and assign it to an active file; then name that area in the AREA statement. To define several areas on the active file, assign them to different active files (using the "new" option):

-> define area active1 on file
-> define area active2 on file
-> assign area active1 to active file new
-> assign area active2 to active file new
-> declare output control
   PACKET = 1;
    AREA = active1; ...
   PACKET = 2;
    AREA = active2; ...

Use WYLBUR's SHOW ACTIVES ALL command to help you locate the output:

-> show actives all
     1 ACTIVE   Area ACTIVE1                     6 lines
 =>  2 ACTIVE   Area ACTIVE2                    11 lines
->

OUTPUT.OPTIONS = option1, option2...;

You can specify one or two of these traditional options:

 - CLEAR or CONTINUE (or APPEND) -- this option tells SPIRES either to clear the  device  area
 (see above) prior to putting output in it, or to continue (append) the output there without
 clearing  it  first.   If neither is specified, and the output is directed to the non-empty
 current active file, SPIRES will ask "OK to clear?"  when the output  command  is
 issued.

 - CLEAN -- this option tells SPIRES to suppress the separating  lines  that  normally  appear
 when multiple records are displayed in non-report mode.

 - REPORT -- this option sets report mode for the output of the packet.

TRACE;

This statement, which takes no value, turns on format tracing (SET FTRACE ALL) for the packet. Interleaving of trace data will appear if tracing is requested in more than one packet. You may use the SET TLOG command to send the trace data to a log file rather than your terminal.

WHERE = clause;

With this option you can specify a WHERE clause; only records that pass the WHERE clause criteria will then be processed by output control for this packet. This is useful when you need to create different subsets of records to be processed by different packets. Remember that the entire set of records being processed by all the packets in the output control declaration can also be limited by a Global FOR command with a WHERE clause.

FOR SUBFILE WHERE PURCHASE.DATE = 1996
DECLARE OUTPUT CONTROL
  PACKET = January;
    WHERE = "PURCHASE.DATE = 1/1996";
    ...
  PACKET = February;
    WHERE = "PURCHASE.DATE = 2/1996";
    ...
ENDDECLARE
WITH OUTPUT CONTROL DISPLAY ALL

In this example, output control will process all 1996 records, but only the January 1996 records will be processed by the January packet, etc.

Since you rarely write a WHERE clause as an element value, be careful to follow data entry rules for the standard SPIRES format when you add one to an output control packet.

FILTER = FOR element WHERE clause;

With this multiply occurring option, you can specify overlay filters for the packet that will augment any filters already set globally. Since the only type of overlay filter allowed in this context is a display filter, which is the default, you need not specify the filter type. [See 21.1.1.]

In the syntax, "element" is the name of the element being filtered; any element can be filtered, including a virtual or dynamic element. [See 20.] The element to be filtered can also be a structure, which is quite common.

"WHERE clause" is a clause following the same rules as a WHERE clause in Global FOR. [See the SPIRES Global FOR manual for more information on WHERE clauses.] Among other uses, the "WHERE clause" option lets you filter an element's occurrences according to each occurrence's value.

Note that the "(occ)", SEQUENCE and "IN limit" options of the SET FILTER command are not available for overlay filters. If you need to use them, they must appear on the first SET FILTER command for the element, which for output control would have to be issued before the output control declaration and which would thus apply to all the output control packets.

If the format in the output control packet contains SET FILTER Uprocs, they are executed after the filters specified here in the FILTER statement.

FOR.EACH = elem1, elem2, ...;

This option sets up output control's equivalent to the DEFINE DISPLAY SET command; the elements listed here have the same effect as the DEFINE DISPLAY SET command's element list. Basically, display sets create the effect of multiple records from one record when the named elements occur multiple times within it. [See 1.9.] This is typically used to create tabular output from hierarchical records.

The DEFINE DISPLAY SET command that is issued for you behind the scenes looks like this:

DEFINE DISPLAY SET TV=ALL, EXTERNAL, SCAN, ELEM = elem1, elem2, ...

That means, in other words, that the FOR.EACH option has these effects:

 - TV=ALL -- all occurrences of all the elements in the element list will be used to  generate
 occurrences  of  the  record.   You can figure for any given record that the product of the
 number of occurrences (or 1, whichever is greater) of each element in the list will be  the
 number  of  records created.  For example, if you name elements A, B and C in the list, and
 in one record, element A occurs 2 times, B 3 times and C  no  times,  then  the  number  of
 records  generated  for  that  record  will be 2 times 3 times 1, i.e. 6 records generated.

 - EXTERNAL -- all occurrences of the named elements will be generated for processing  in  the
 external (i.e., post-Outproc) form. This is important to know if you are processing records
 through  a  format  --  the  element values processed in the format will start out in their
 external form, not their internal form as normal.  You will not have access to the internal
 form -- even a function like $GETIVAL will retrieve  the  external  form  of  the  element.

 - SCAN -- If any filters have been set  either  outside  and  ahead  of  the  output  control
 declaration,  or have been set within the packet in FILTER statements, they will be applied
 as SPIRES determines what occurrences of the elements to use in the display.

Important: When the FOR.EACH option is used, only elements listed here and in the PLUS.ELEM list below are available for direct output in the packet. (But elements that are used in the creation of the elements in either list, e.g., in a virtual element's Userproc, do not need to be included in the list, unless you specifically need to use them in the packet as well.)

PLUS.ELEM = elem1, elem2, ...;

As noted just above, if you use the FOR.EACH option, you may need to use this option too. Elements listed here do not create more tabular occurrences of a goal record; they merely tag along in the "set". Only elements listed in either list are available to the output control packet when the FOR.EACH option is used (with the exception noted in the previous paragraph).

For example, if you have a FAMILY record, with three occurrences of the PERSON structure:

FAMILY = Tucket;
  COUNTRY = United States;
  STATE = Rhode Island;
  PERSON; NAME = Paw;
  PERSON; NAME = Maw;
  PERSON; NAME = Nan;

You might code the following if you wanted each PERSON to cause a separate occurrence of the record in the output and want the output to include the FAMILY and STATE elements too:

FOR.EACH = NAME; PLUS.ELEM = FAMILY, STATE;

In standard format, the output created from the sample record would thus be:

****
NAME = Paw; FAMILY = Tucket; STATE = Rhode Island;
;
****
NAME = Maw; FAMILY = Tucket; STATE = Rhode Island;
;
****
NAME = Nan; FAMILY = Tucket; STATE = Rhode Island;
;

Note that the COUNTRY element is not included, since it was not named in either list. [See 1.8.1 for an explanation of direct lists, the direct set equivalent of PLUS.ELEM lists.]

GEN.SET;

Besides using the single pass through the records to generate multiple types of output, you may also use it to generate a set, which might be useful if you need to sort the same group of records in a different way, or if you just need to save the set of records gathered here so that you can process them again in the future.

To do that, you must issue a DEFINE SET command outside and ahead of the output control declaration. Inside the output control declaration, you include an output control packet that includes GEN.SET and, optionally, a WHERE and/or multiple FILTER statements.

Structurally, adding set generation to output control in a protocol looks like this:

FOR class ...  <- global FOR command to establish records to
                  be processed under output control as well
                  as to go into the set
DEFINE SET xxx ...
DECLARE OUTPUT CONTROL
PACKET = PACKET01;
  ...
PACKET = PACKET02;
  ...
PACKET = PACKET03;
  GEN.SET;
    WHERE = where-clause; FILTER = filter-clause;
ENDDECLARE
WITH OUTPUT CONTROL DISPLAY ALL

The result: besides creating the output defined in the first two packets, SPIRES also generates the set named "xxx".

You can include an additional WHERE clause in the GEN.SET output control packet, which has the effect of further restricting the records processed by the packet. Additionally, you can include one or more FILTER statements, which are treated as overlay filters of type SCAN, meaning that they will affect the set entries that are created. [See 21.1.] Aside from WHERE and FILTER statements, any other statements in a packet containing GEN.SET are ignored; so be sure to treat set generation as a separate output packet from any others.

CONTROL.OPTIONS = option1, ...;

You can request one or both of these options by coding the CONTROL.OPTIONS statement:

 - SHARE.VGROUPS -- This option should be specified if you want any vgroups allocated  by  the
 format  in  this  packet to be shared by other formats in other packets (they too must have
 this statement) or by the calling protocol.  Hidden vgroups may not be shared.

 - GENERATE.CHANGES -- This option requests information about changes  between  the  tree  and
 defq  copies of a record.  It is described elsewhere in this manual; online, EXPLAIN CHANGE
 GENERATION.  [See 9.]

BYPASS.LIST = elem1, elem2, ...;

Used in conjunction with the GENERATE.CHANGES control option, this option names elements whose values should not be compared between copies of a record and hence not generate change information. In other words, if the values of these elements change between the tree and defq copies of the record, they will not generate changes. [See 9.]

EXCEPTION.FILE = orvyl.file.name [REPLACE];

If you use the SBF (SuBFile) area (see the AREA statement above) to direct output data into a subfile as input, you can code this statement to request exception file processing. For more information about exception files, EXPLAIN EXCEPTION FILE PROCEDURE.

Add the REPLACE option to request that SPIRES replace the data set, if it already exists, without asking you for permission to do so.

If several packets write to separate SBF areas, you must specify a separate exception file for each packet.

TABLE.NAME = table-name;

Output control can invoke tables that have been pre-declared with the DECLARE TABLE command. [See 17, 17.2.] This tool lets you in effect re-map a SPIRES subfile into one or more flat, relational tables. In this statement, you name the pre-declared table you want to use.

Note that if you use the $REPORT or $PROMPT format, the elements you would name in the PARMS statement would be "table" elements (that is, elements defined in the table declaration), not the primary subfile's elements. Note too that other set-related statements, such as FOR.EACH and PLUS.ELEM, are not used in the same packet as TABLE.NAME because they are relevant to normal sets, not sets that are generated via tables.

TABLE.WHERE = clause;

You add this statement if you want to filter the entries in the table with where-clause criteria. This is equivalent to adding the "SET FILTER FOR * WHERE clause" command in the table path prior to generating the table set. [See 17.2.] It limits the "row" output to create only rows that match the criteria expressed in the where clause. The elements named in the clause should be elements defined in the table declaration.

7.2 Using Output Control

Once the output control definition has been declared, you request output control processing by adding the WITH OUTPUT CONTROL prefix to an output command, either TYPE or DISPLAY.

WITH OUTPUT CONTROL TYPE
  or
WITH OUTPUT CONTROL DISPLAY ...

Note that with this prefix you cannot also use the IN ACTIVE prefix to direct output that would normally go to the terminal to the active file instead. If you want output from output control to go to the active file, you must specify that within the output control declaration, as described in the previous section. [See 7.1.]

You can issue the CLEAR OUTPUT CONTROL command to cancel output control for the selected subfile. A new DECLARE OUTPUT CONTROL command will replace any output control declaration already in effect (unless the new one fails, in which case the previous declaration remains in effect).

Here is an example demonstrating how to use output control. On a daily basis, you use a subfile (OUR DEPARTMENT PEOPLE) to both create a report listing staff members and add any new staff members in that subfile into another subfile, called MY PHONE BOOK. A protocol to do that might include these commands:

SELECT OUR DEPARTMENT PEOPLE
- set up the subfile path for adding into PHONE BOOK
THROUGH 1, SELECT MY PHONE BOOK
DEFINE AREA PHONEBOOK (1,80) ON SBF
ASSIGN AREA PHONEBOOK TO SUBFILE PATH pathnum
- Now the output control declaration....
DECLARE OUTPUT CONTROL
  OUTPUT = Daily Report;
    FORMAT = Daily.List;
    OUTPUT.OPTIONS = REPORT;
    AREA = ACTIVE;
  OUTPUT = Add New Staff;
    FORMAT = Add.From.ODP;
    AREA = PHONEBOOK;
    WHERE = Add.Date = Yesterday;
    EXCEPTION.FILE = Bad.Phone.Records;
ENDDECLARE
- Set up the records to process and process them
FOR INDEX NAME
WITH OUTPUT CONTROL DISPLAY ALL
MAIL TO GQ.JNK TITLE "Staff's Daily List"
CLEAR OUTPUT CONTROL

In the first output control packet, the Daily.List report is set, with its output directed to the active file; after all the processing is over, the report is mailed to GQ.JNK. In the second, a WHERE clause restricts the records processed by the packet to just those that were "added yesterday". The Add.From.ODP format transforms a couple elements from the OUR DEPARTMENT PEOPLE records into the input for a MY PHONE BOOK record. The PHONEBOOK area, into which that data is directed, is defined on the SBF (SuBFile) area, which in turn passes the output to MY PHONE BOOK for input.

Note that the protocol takes advantage of the natural ordering of the records in Name order that exists in the NAME index by using the FOR INDEX command to establish the processing order of the records.

7.3 DATA MOVE Processing

DATA MOVE is the name given to a process whose task is to "move" data from a SPIRES "Source" hierarchical data base (Subfile) to one or more "Target" device areas. Typically, this is a process of creating external tables or other subfiles that have a table nature.

The DATA MOVE process is controlled by a SPIRES command called PERFORM DATA MOVE. The PERFORM DATA MOVE command is in turn under the control of information supplied by a SPIRES meta data record stored in the SPIRES Data Move subfile. [EXPLAIN PERFORM DATA MOVE COMMAND.]

Internally, DATA MOVE utilizes the OUTPUT CONTROL process to direct the movement of data from the Source subfile to multiple Target areas. These target areas may be any SPIRES Output Device Areas -- the ACTIVE file, OS or ORVYL files, or other SPIRES subfiles (SBF devices). The DATA MOVE meta data records provide the information needed by SPIRES to build an Output Control Declaration structure and to issue the command under Global FOR processing that will output the source information to the one or more target areas. [See 7, 7.2.]

DATA MOVE Subfile Information

Those familiar with SPIRES terminology and SPIRES tools will see and understand the DATA MOVE process most easily by studying the data values stored in DATA MOVE subfile records. The metadata record consists mainly of three separate structures that are used to control data movement and transformation. These structures: SOURCE.INFO, TARGET.INFO and TARGET.AREAS are described in detail below.

You should note that source data value descriptions and target data values are optional and may not even exist in some DATA MOVE records. In this case the source/target information is described by individual TABLE declaration records defined in a separate TABLE subfile. [See 17.1.] You can see an example of the setup and generation of Declared Tables and how they may be used by DATA MOVE. [See 7.5.]

RECORD Level Elements of the DATA MOVE Subfile

These values form the SPIRES information used by any system supplied meta-data record structure.

ID -- The key value of the DATA MOVE record. This field is account protected and has the form: gg.uuu.Key-value.

COMMENTS -- Multiple valued comments field.

AUTHOR -- Multiple valued Author field(s).

DEFDATE -- System supplied date the record was ADDed to DATA MOVE.

MODDATE -- System supplied date that the record was last modified.

MODTIME -- System supplied time of the last update.

SOURCE.INFO Structure Elements

SOURCE.INFO -- Information concerning the SOURCE data to be moved by the DATA MOVE process. This is a singly occurring structure.

SUBFILE.NAME -- This is the name of the primary SPIRES subfile that provides the source data values for this DATA MOVE process. Note that all source values moved during this process must be related to this primary subfile. This relationship may be established through phantom structures whether they be defined as part of the Source.Subfile or defined dynamically.

7.4

At this point you have a choice of doing Subfile output or Table output. Choose one of the following paths:

[See 7.3.1.]

[See 7.3.2.]

7.3.1 DATA MOVE for Subfile Output

Continuation of DATE MOVE processing. [See 7.3.]

For Subfile output, the SOURCE.INFO structure in DATA MOVE may contain:

SOURCE.VALUES Structure Elements

SOURCE.VALUES -- This multiply occurring structure holds the information concerning each source value to be moved.

SOURCE.VALUE -- This is the name of the source data value. Note that the same Source Value name may be used any number of times as a source value (more than one target subfile may receive the source value). The value of this field may be one of the following:

 - A source subfile data element name.

 - A source subfile phantom data element name.

 - A dynamic element name (See DECLARE.KEY below).

 - A String value or "Literal" (See SOURCE.TYPE below).

SOURCE.TYPE -- If you code LITERAL (or LIT) as the value of this field then the Source.Value field will be taken as a string or Literal value that is to be moved to the corresponding target field.

DECLARE.KEY -- This is the key of a record in one of the DECLARE.SUBFILEs described above. The appearance of this field for a Source Value means that the source information is a Dynamic Element whose definition is given in the DECLARE Subfile record with this key value. (Note: The Source.Value field must not be a data element name).

DECLARE.FOR -- This field holds the SOURCE Subfile data element name that is to be used as the source value for the dynamic element process specified by the DECLARE.KEY information. In other words, this value REDEFINEs the Source data element value. If this value is specified then the Source Value OCCURs only when the DECLARE.FOR data element occurs.

DECLARE.IN -- This field holds the SOURCE Subfile structure name in which this Source value is to occur. The Source Value then OCCURs only when the Source STRUCTURE occurs. If this field is given then the Source Value is defined as a Dynamic Element whose description is held in the record DECLARE.KEY.

SOURCE.DEFAULT -- This field may be used to generate a Source Value when there is no occurrence of the source value supplied from the source subfile.

DECLARE.OCC -- This field may only be specified if a DECLARE.FOR value is given. The DECLARE.OCC value must be an integer value and is used to specify the occurrence number (beginning at 1,2 etc.) of the Declare.FOR data element used to supply a source value. Generally, you will want to use the system supplied DECLARE.KEY $PASSOCC record when you use this field [EXPLAIN DATA MOVE DECLARES.]

SOURCE.SINGLE -- This zero length field may be coded to specify that the source data element is really singly occurring even if that fact is not specified in the Source file definition (OCCURS = 1). DATA MOVE utilizes the knowledge of singly occurring data elements to eliminate the possible appearance of S324 error diagnostics during execution. This error could occur if there are too many multiply occurring data elements being moved to a particular target.

SOURCE.BUILD -- This field may be used to provide a value to separate the multiple values of a source data element and concatenate all the multiple values (and separator fields if given) together resulting in a single source field upon movement to the target.

COMMENTS -- Multiple value field for any comments about this Source.Value.

TARGET.AREAS Structure Elements

TARGET.AREAS -- This multiply occurring structure may be used to generate SPIRES Device Services AREAs that are to be subsequently used as the destination area for one of the output streams from the source subfile. Generally, these target areas are used for DECLARE TABLE output.

AREA.NAME -- This field is the key of the TARGET.AREAS structure and specifies the AREA name that will be ASSIGNed for an External Target.

DEVICE.TYPE -- This field may be used to specify a FILE Device Type other than an ORVYL file type. A value of OS may be coded to specify an OS File, and ACTIVE may be used for an ACTIVE file.

FILE.NAME -- This field holds the name of the File Device to be ASSIGNed.

 - An OS File Name would have the form: WYL.gg.uuu.name.

 - An ORVYL File Name might be ORV.gg.uuu.name.

RECORD.LENGTH -- This field holds the width of the AREA on the File Device.

DEVICE.OPTIONS -- This field hold any other options that you wish to pass on to the ASSIGN AREA command. [EXPLAIN ASSIGN AREA COMMAND.] to see what options are available for the given Device Type.

TARGET.INFO for Target Subfile output

The following information is given for each TARGET Subfile:

TARGET.NAME -- This required field is the Target Subfile name. The value may be any valid SPIRES subfile name including SPIRES temporary file names (E.G. @ORV.gg.uuu.Recdef.Key).

EXTERNAL.AREA -- This field holds the name of a SPIRES Device AREA that is to be used during External File processing for the Target. This field also designates that the Target is an External File. The value has meaning only if your Target Subfile is a TEMPorary Target Subfile since a normal SPIRES EXTERNAL Subfile already specifies the External AREA name. See the TARGET.AREAS description above for information about a simple way to preset SPIRES Device AREAs.

SOURCE.WHERE -- This singly occurring value may be used to specify which records of the Source Subfile may be used when generating record values of this particular Target Subfile. The value is in the same form that you would use during any Global FOR WHERE Clause but does not include the "FOR class WHERE" portion of the command. E.G. If this Target is to be produced only for Source Subfile records in which source element values for the UNIVERSITY data element is Stanford then.

 - SOURCE.WHERE = University = Stanford;

 - Note: the Global FOR "class" is provided during execution of the  PERFORM  DATA
 MOVE command as an input parameter.  [EXPLAIN PERFORM DATA MOVE.]

SOURCE.FILTER -- This multiply occurring value may be used to further restrict the Source Subfile information that may be used to generate information for this Target Subfile. This value is used to construct FILTERs to be applied to any Source Subfile record presented to this Target for processing. The Source record must have previously passed any SOURCE.WHERE criteria of course. The value of this element would be the value of information supplied for any SET FILTER command but cannot include OCC range criteria, IN-limit range criteria, or SEQUENCE elem criteria. In other words, your SOURCE.FILTER value must look like this:

 - SOURCE.FILTER = FOR element.name WHERE where.clause;

EXCEPTIONS -- This field holds the name of an ORVYL file that may be used to store records being updated in the Target Subfile that have been rejected by SPIRES for some reason. The exception file is created and used in the same manner as existing SPIRES Exception files. You may express this value in the form: EXCEPTIONS = orv.gg.uuu.exceptions SCR; using the optional SCRatch field to initialize the file if it exists. You may access this exception file with the ORVYL GET command. [EXPLAIN EXCEPTION FILE.] in SPIRES for more information.

GENERATE.CHANGES -- If this field is coded then DATA MOVE will only generate Target records that reflect the changes that have occurred in the corresponding Source fields [EXPLAIN CHANGE GENERATION.] in SPIRES for more information ).

TARGET.VALUES Structure Elements

TARGET.VALUES -- This multiply occurring structure holds information concerning each Target Subfile date element that is to be generated from a corresponding Source Value.

TARGET.VALUE -- This is the name of the Target Element that is to be generated in the current Target Subfile.

TARGET.LEVEL -- This field is an integer that may be used to override DATA MOVE provided the DATA MOVE generated Target levels. DATA MOVE uses the characteristics of Target Subfile data element descriptions to set this value. A discussion of this process will be provided.

OCCURS.WITH -- This field may be used to specify that the target element occurs with another target element. This value may be used to override normal default target value processing and will be described later.

COMMENTS = Use this multiple valued field for any comments about this target value.

OUTPUT.CONTROL Packets

OUTPUT.CONTROL -- This multiply occurring field holds the key(s) of one or more records in the DATA OUTPUT CONTROL subfile. This field enables you to provide your own control structures that you may use to output information that will be processed in conjunction with DATA MOVE system supplied output.

7.3.2 DATA MOVE for Table output

Continuation of DATE MOVE processing. [See 7.3.]

For Table output, the SOURCE.INFO structure in DATA MOVE may contain:

TABLE.SUBFILE -- This field holds the name of the subfile to be used as the source of DECLARE TABLE records that describe the source/target values for the DATA MOVE process.

TARGET.AREAS Structure Elements

AREA.NAME -- This field is the key of the TARGET.AREAS structure and specifies the AREA name that will be ASSIGNed for an External Target.

DEVICE.TYPE -- This field may be used to specify a FILE Device Type other than an ORVYL file type. A value of OS may be coded to specify an OS File, and ACTIVE may be used for an ACTIVE file.

FILE.NAME -- This field holds the name of the File Device to be ASSIGNed.

 - An OS File Name would have the form: WYL.gg.uuu.name.

 - An ORVYL File Name might be ORV.gg.uuu.name.

RECORD.LENGTH -- This field holds the width of the AREA on the File Device.

DEVICE.OPTIONS -- This field hold any other options that you wish to pass on to the ASSIGN AREA command. [EXPLAIN ASSIGN AREA COMMAND.] to see what options are available for the given Device Type.

TARGET.INFO Structure Elements in DATA MOVE

TARGET.INFO -- This multiply occurring structure holds information for each target AREA that will receive Source Values during this DATA MOVE process.

TARGET.INFO for Table output

TARGET.NAME -- This is only a descriptive name and is the Key of the TARGET.INFO structure. These names must be distinctive from each other within the first 16 characters, and must be acceptable as element names. It is best to limit them to 16 characters or less.

GENERATE.TABLE -- This field holds the key of a Declare TABLE record within the TABLE.SUBFILE described earlier in DATA MOVE processing. The record itself describes the Source elements and Target values along with any transformations involved. [EXPLAIN DECLARE TABLE COMMAND.]

AREA -- this field holds the name of the SPIRES Device Area that is the destination for the table to be generated.

SOURCE.WHERE -- This singly occurring value may be used to specify which records of the Source Subfile may be used when generating record values of this particular Target table. The value is in the same form that you would use during any Global FOR WHERE Clause but does not include the "FOR class WHERE" portion of the command. E.G. If this Target is to be produced only for Source Subfile records in which source element values for the UNIVERSITY data element is Stanford then:

 - SOURCE.WHERE = University = Stanford;

 - Note: the Global FOR "class" is provided during execution of the  PERFORM  DATA
 MOVE command as an input parameter.  [EXPLAIN PERFORM DATA MOVE.]

SOURCE.FILTER -- This multiply occurring value may be used to further restrict the Source Subfile information that may be used to generate information for this Target table. This value is used to construct FILTERs to be applied to any Source Subfile record presented to this Target for processing. The Source record must have previously passed any SOURCE.WHERE criteria of course. The value of this element would be the value of information supplied for any SET FILTER command but cannot include OCC range criteria, IN-limit range criteria, or SEQUENCE elem criteria. In other words, your SOURCE.FILTER value must look like this:

 - SOURCE.FILTER = FOR element.name WHERE where.clause;

TABLE.OPTIONS -- This structure holds information that you may use to alter the settings for the output of a TABLE on the output device. The default settings are: TABLE.TYPE = DELIM; HEADING = Y; DELIMITER = X'05'; (tab delimiter).

"table.options" consist of the following key words and their options:

 - TABLE.TYPE=DELIM|FIXED|BTF|SQL

 - DELIM is variable column width separated by a delimiter character  (tab  is  the  default
 delimiter).   The  first  row contains the table name (or title), the second row contains
 the field names, and successive rows contain data. This is the default type.

 - FIXED is fixed column, non-delimited. Column  size  is  derived  by  the  WIDTH  "as
 element parm", or by $Report using eleminfo or, in the absence of both, dividing the
 fields  evenly  across  the available space.  The first row contains field names (without
 underscores), successive rows contain data.

 - BTF is Wylbur's Basic Table Format used in the Forsythe RPC Server. It includes the  100,
 101, 103, and 109 lines.

 - SQL means SPIRES will generate a "table" of SQL  commands,  either  INSERTs  or
 DELETEs,   depending   on   whether   SQL   is   followed   by   ADD   or   REM,   as  in
 "TYPE=SQL,ADD".  See the information on TYPE=SQL at the end  of  this  section.

 - HEADING=Y|N -- This option controls the field names occurring in the first row for TYPE=DELIM
 or FIXED. HEADING is the default. This option has no effect when type is  BTF.   Field  names
 are derived from the "Rdbms_Column" ElemInfo value.

 - DELIMITER='character' or X'character'. This option changes the  default  delimiter  in  types
 DELIM  and  BTF.  It  adds  a delimiter character to type FIXED. "X" indicates hex.

 - TITLE='string'. This option is used as the name of the table  in  the  first  line.  If  this
 option is not given, the table "name" is used. If TYPE=SQL, the TITLE value is used
 as  the  SQL table name in the INSERT or DELETE statements created; again, if no TITLE option
 is specified, the table "name" is used.

 - EOR='character' or X'character'.  This option establishes an end-of-record  character  to  be
 printed  at  the  end  of  each  row  of  the  table.  For  style  BTF,  the EOR character is
 ";".   For  other  styles,  the  default  is  null.  "X"  indicates  hex.

OUTPUT.CONTROL Packets

7.4 DECLARE ELEMENT SUBFILE Description

DATA MOVE and DECLARE TABLE Source Values may be generated through Dynamic Elements. A convenient way to represent the dynamic element process is to use pre-defined dynamic element structures.

A DECLARE ELEMENT Subfile may be any SPIRES Subfile which is defined by the system $ELEMENT record definition in the RECDEF subfile. A subfile that is defined by $ELEMENT accepts the same data values as DECLARED dynamic elements [See 20.3.]

The DECLARE ELEMENT Subfile is referenced by the DECLARE.SUBFILE term in DATA MOVE and DECLARE TABLE processes.

DATA MOVE DECLARES -- The SPIRES System Declared Elements

A number of Declared Dynamic elements are currently available in the DATA MOVE DECLARES subfile. These have proven useful in data transformations from SPIRES to RDBMS tables. As time goes on we hope to build up this library. You reference this library by including:

DECLARE.SUBFILE = Data Move Declares;

Any one of the following system values may be given in DECLARE.KEY.

$DATEOUT.CCYY -- Generate full CCYY year value. This process converts a SPIRES internal date into its full CCYY value. (DECLARE.FOR = SPIRES Date element)

$DATEOUT.MONTH.CCYY -- Generates a date of the form 'Jan 24, 2001'

$NAME.FIRST -- Generate a value which is the first name of a SPIRES personal name field. (DECLARE.FOR = SPIRES Personal Name element))

$NAME.LAST -- Generate a value which is the last name of a SPIRES personal name field.

$NAME.MIDDLE -- Generate a value which is the middle name of a SPIRES personal name field.

$PASSOCC -- Generate a value which is the nth occurrence of a SPIRES data element. (DECLARE.FOR = SPIRES data element, DECLARE.OCC = element occurrence number -- 1,2 etc)

$VALUE.OCC -- Generate a value which is the occurrence number of a multiple valued source element. (DECLARE.FOR = SPIRES data element)

$VALUE.EXTERNAL -- Generates the external form of a non string data element

There are several other records in the DATA MOVE DECLARES subfile which you can view by selecting that subfile, browsing the goal records, and displaying records of interest.

DECLARE ELEMENT Subfile record example

The $NAME.FIRST process is coded as follows:

ID = $NAME.FIRST;
OUTPROC = $Call(First.Name);
USERDEFS;
  USERPROC = FIRST.NAME;
    UPROC = Set Val = $Parse($Rsub($Val,', '),', ',);
    UPROC = return;

As you can see, the form of the internal data is the same as you would code for a DECLAREd dynamic element.

If you have coded the following SOURCE.VALUES structure:

SOURCE.VALUE = FirstName;
  DECLARE.KEY = $NAME.FIRST;
  DECLARE.FOR = Name;

DATA MOVE generates the following DECLAREd dynamic element.

Declare Element FirstName FOR Name
  OUTPROC = $Call(First.Name);
  USERDEFS;
    USERPROC = FIRST.NAME;
    UPROC = Set Val = $Parse($Rsub($Val,', '),', ',);
    UPROC = return;
Enddeclare

7.5 DATA MOVE using PERFORM TABLE CREATE processing

This document gives you a sample SPIRES command stream that is intended to help you see and use the set of tools that have been implemented to enable you to generate Tables from SPIRES hierarchical data bases.

As an aid to your understanding, you should issue the EXPLAIN commands that are shown in the sample commands

The following File definition shows how to build a database for your own DECLARE TABLE and DECLARE ELEMENT descriptions. Usr_Tables will be used to store TABLE declarations. [EXPLAIN DECLARE TABLE COMMAND.] Usr_Elements may be used to hold your own Declared Elements. [EXPLAIN DECLARE ELEMENT COMMAND.]

Replace GENERATIONS by your own subfile name in these examples. Also, replace GP.USR with your own account. *names represent your account, so they don't need to be altered.

> select filedef

> display *usr_tables

FILE = GP.USR.USR_TABLES;
DEFDATE = MON. NOV. 7, 1994;
MODDATE = WED. JAN. 19, 2000;
MODTIME = 07:58:40;
BIN = PURGE;
RECORD-NAME = REC01;
  DEFINED-BY = $TABLE;
  REMOVED;
RECORD-NAME = REC02;
  COMBINE = REC01;
  DEFINED-BY = $ELEMENT;
  REMOVED;
SUBFILE-NAME = USR_TABLES;
  GOAL-RECORD = REC01;
    ACCOUNTS = GP.USR;
SUBFILE-NAME = USR_ELEMENTS;
  GOAL-RECORD = REC02;
    ACCOUNTS = GP.USR;

Our sample database will be one holding Genealogy records

> select generations

> show element characteristics

Subfile GENERATIONS
 Sec  Occ   Len  Type    St/El       Element
 ---  ----  ---  ------  -----       -------
 Fix  Sing    4  Int     00/00  key  ID
 Req  Sing       String  00/01       NAME
 Opt  Sing       String  00/02       CALLED
 Opt  Sing       String  00/03       SEX, S
 Opt  Mult    4  Int     00/04       PARENT, P
 Opt  Sing       Hex     00/05       BIRTH, BD, BIRTH.DATE
 Opt  Sing       String  00/06       BIRTH.PLACE, BP, PLACE
 Opt  Sing       String  00/07       BIRTH.COUNTRY
 Opt  Sing       String  00/08       BIRTH.HOME, BH
 Opt  Sing       Hex     00/09       DEATH, DD, DEATH.DATE
 Opt  Sing       String  00/0A       DEATH.PLACE, DP
 Opt  Sing       String  00/0B       DEATH.COUNTRY
 Opt  Mult       String  00/0C       NOTE, C
 Opt  Mult    4  Int     00/0D       CHILD, POINTER
 Opt  Mult       Struc   00/0E       MARRIAGE.STR
 Fix  Sing    4  Int     01/00  key  . SPOUSE
 Opt  Sing       Hex     01/01       . MARRIAGE.DATE, MD
 Opt  Sing       String  01/02       . MARRIAGE.PLACE, MP
 Opt  Sing       String  01/03       . MARRIAGE.COUNTRY
 Opt  Sing       String  01/04       . MARRIED.NAME, MN
 Opt  Sing    0  String  01/05       . DIVORCED, DV
 Vir  Sing    4  String  01/07       . SPOUSE.NAME
 Vir  Sing    4  Struc   01/08  phan . SPOUSE.STR
 Vir   --         ---    --/--       . . (Record REC1)
 Vir  Sing       String  00/0F       NAME.FIRST
 Vir  Sing       String  00/10       NAME.LAST
 Vir  Mult    4  String  00/11       CHILD.FIRST.NAME
 Vir  Sing       String  00/12       CHILD.COUNT
 Vir  Mult    4  Struc   00/13  phan PARENT.STR
 Vir   --         ---    --/--       . (Record REC1)
 Vir  Mult    4  Struc   00/14  phan CHILD.STR
 Vir   --         ---    --/--       . (Record REC1)
 Vir  Sing       String  00/15       BIRTH.INFO
 Vir  Sing       String  00/16       DEATH.INFO
 Vir  Sing       String  00/17       SORT.NAME

It's a good idea to set up ELEMINFO data for each element especially "Width" and "Input-Occ" for multiple occurring data elements. This information will be carried over to the tables that are generated.

> show elem info

Subfile GENERATIONS

Key ID -- Id
  Width: 4       Adjust: RIGHT    Indent:        Edit:
  Description:
    Record Id assigned as integer

Element NAME
  Width: 28      Adjust:          Indent:        Edit:
  Description:
    Person's name at birth

Element CALLED
  Width: 12      Adjust:          Indent:        Edit:
  Input occ: 1
  Description:
    Called first name (optional)

Element SEX, S -- Sex
  Width: 3       Adjust: CENTER   Indent:        Edit:
  Description:
    Enter M or F

Element PARENT, P -- Parents
  Width: 9       Adjust: RIGHT    Indent:        Edit:
  Input occ: 2
  Description:
    Parent record Id numbers

Element BIRTH, BD, BIRTH.DATE -- Birth Date
  Width: 12      Adjust: RIGHT    Indent:        Edit:
  Description:
    Birth date

Element BIRTH.PLACE, BP, PLACE
  Width: 24      Adjust:          Indent:        Edit:
  Description:
    Actual city and state of birth

Element BIRTH.COUNTRY
  Width: 16      Adjust:          Indent:        Edit:
  Description:
    Country of birth if not USA

Element BIRTH.HOME, BH
  Width: 20      Adjust:          Indent:        Edit:
  Description:
    Residence at birth (optional)

Element DEATH, DD, DEATH.DATE -- Death Date
  Width: 12      Adjust: RIGHT    Indent:        Edit:
  Description:
    Date of death

       ..............................

Here is how to generate TABLE records in your $Table subfile. [EXPLAIN PERFORM TABLE CREATE DECLARE.]

> perform table create declare subfile generations, type sybase, options mult, dest Usr_Tables

> select usr_tables

> show subfile transactions

01/27/2000     Transaction Log of File GP.USR.USR_TABLES
                       for Record-type REC01

               File last processed: 01/20/2000 at 00:19:48

Date      Time      Account   Id   Type  Command  Grp  Key Value
01/27/00  09:54:48  GP.USR         UPD   Update        *NOTE
01/27/00  09:54:48  GP.USR         UPD   Update        *CHILD
01/27/00  09:54:48  GP.USR         UPD   Update        *MARRIAGE.STR
01/27/00  09:54:48  GP.USR         UPD   Update        *GENERATIONS

> display *generations

Note: I have removed a number (four) of generated column names

ID = *GENERATIONS;
COMMENTS = Table Generation for Record Level Elements;
DEFDATE = THUR. JAN. 27, 2000;
MODDATE = THUR. JAN. 27, 2000;
MODTIME = 13:22:20;
SUBFILE.NAME = GENERATIONS;
DECLARE.SUBFILE = Data Move Declares;
COLNAME = ID;
  COMMENTS = Required;
  ISKEY;
  COLTYPE = INT;
  COLWIDTH = 4;
  RDBMS_COLUMN = id;
  RDBMS_DATATYPE = INT;
  RDBMS_DATALENGTH = 4;
COLNAME = NAME;
  COMMENTS = Required;
  COLWIDTH = 28;
  RDBMS_COLUMN = name;
  RDBMS_DATATYPE = VARCHAR;
  RDBMS_DATALENGTH = 28;
COLNAME = SEX;
  COLWIDTH = 3;
  RDBMS_COLUMN = sex;
  RDBMS_DATATYPE = VARCHAR;
  RDBMS_DATALENGTH = 1;
COLNAME = PARENT_1;
  COLTYPE = INT;
  COLWIDTH = 9;
  SOURCE.ELEM = PARENT;
  DECLARE.KEY = $VALUE.EXTERNAL;
  SOURCE.OCC = 1;
  RDBMS_COLUMN = parent_1;
  RDBMS_DATATYPE = INT;
  RDBMS_DATALENGTH = 4;
COLNAME = PARENT_2;
  COLTYPE = INT;
  COLWIDTH = 9;
  SOURCE.ELEM = PARENT;
  DECLARE.KEY = $VALUE.EXTERNAL;
  SOURCE.OCC = 2;
  RDBMS_COLUMN = parent_2;
  RDBMS_DATATYPE = INT;
  RDBMS_DATALENGTH = 4;
COLNAME = BIRTH;
  COLTYPE = DATE;
  COLWIDTH = 12;
  SOURCE.ELEM = BIRTH;
  DECLARE.KEY = $DATEOUT.CCYY;
  RDBMS_COLUMN = birth;
  RDBMS_DATATYPE = DATETIME;
COLNAME = BIRTH.PLACE;
  COLWIDTH = 24;
  RDBMS_COLUMN = birth_place;
  RDBMS_DATATYPE = VARCHAR;
  RDBMS_DATALENGTH = 24;
COLNAME = BIRTH.COUNTRY;
  COLWIDTH = 16;
  RDBMS_COLUMN = birth_country;
  RDBMS_DATATYPE = VARCHAR;
  RDBMS_DATALENGTH = 16;
COLNAME = DEATH;
  COLTYPE = DATE;
  COLWIDTH = 12;
  SOURCE.ELEM = DEATH;
  DECLARE.KEY = $DATEOUT.CCYY;
  RDBMS_COLUMN = death;
  RDBMS_DATATYPE = DATETIME;
FILE = GP.USR.GENERATIONS;
TABLE.NUM = 001;

A separate table has been created for the "Child" data element since there may be any number of children. The "Child.Occ' column was generated to hold the occurrence number of a particular child.

> display *child

ID = *CHILD;
COMMENTS = Table Generation for Element CHILD;
DEFDATE = THUR. JAN. 27, 2000;
MODDATE = THUR. JAN. 27, 2000;
MODTIME = 09:54:48;
SUBFILE.NAME = GENERATIONS;
DECLARE.SUBFILE = Data Move Declares;
COLNAME = ID;
  ISKEY;
  COLTYPE = INT;
  COLWIDTH = 4;
  RDBMS_COLUMN = id;
  RDBMS_DATATYPE = INT;
  RDBMS_DATALENGTH = 4;
COLNAME = CHILD.OCC;
  COMMENTS = Occurrence Number;
  ISKEY;
  COLTYPE = INT;
  SOURCE.ELEM = CHILD;
  DECLARE.KEY = $VALUE.OCC;
  SOURCE.SINGLE;
  RDBMS_COLUMN = child_occ;
  RDBMS_DATATYPE = SMALLINT;
  RDBMS_DATALENGTH = 4;
COLNAME = CHILD;
  COMMENTS = Multiple;
  COLTYPE = INT;
  COLWIDTH = 5;
  RDBMS_COLUMN = child;
  RDBMS_DATATYPE = INT;
  RDBMS_DATALENGTH = 4;
FILE = GP.USR.GENERATIONS;
MULTI.ELEMENT = CHILD;
TABLE.NUM = 003;

A number of Declared elements were called in by these table descriptions. These element transformation structures are in the system DATA MOVE DECLARES subfile. [EXPLAIN DATA MOVE DECLARES SUBFILE.] You can build your own database for this purpose (see the definition for Usr_Elements above) or store your Declared elements in the system subfile.

If you are going to move the tables to an external RDBMS database you have the ability to generate DDL statements from your Declared Tables. [EXPLAIN PERFORM TABLE CREATE DDL.]

> perform table create ddl subfile generations from usr_tables

- DDL Statement Totals for Subfile generations
  Table: generations          Columns: 9      Row length: 81
  Table: note                 Columns: 3      Row length: 78
  Table: child                Columns: 3      Row length: 10
  Table: marriage_str         Columns: 8      Row length: 68
  ------
  Totals      Tables: 4       Columns: 23     Row length: 237

> list unn

CREATE TABLE generations
  (id                   INT NOT NULL,
   name                 VARCHAR(28) NOT NULL,
   sex                  VARCHAR(1),
   parent_1             INT,
   parent_2             INT,
   birth                DATETIME,
   birth_place          VARCHAR(24),
   birth_country        VARCHAR(16),
   death                DATETIME,
      CONSTRAINT generations_pk PRIMARY KEY (id));

CREATE TABLE note
  (id                   INT NOT NULL,
   note_occ             SMALLINT NOT NULL,
   note                 VARCHAR(72),
      CONSTRAINT note_pk PRIMARY KEY (id, note_occ),
      CONSTRAINT note_generations_fk FOREIGN KEY (id) REFERENCES
generations (id));

CREATE TABLE child
  (id                   INT NOT NULL,
   child_occ            SMALLINT NOT NULL,
   child                INT,
      CONSTRAINT child_pk PRIMARY KEY (id, child_occ),
      CONSTRAINT child_generations_fk FOREIGN KEY (id) REFERENCES
generations (id));

CREATE TABLE marriage_str
  (id                   INT NOT NULL,
   spouse               INT NOT NULL,
   marriage_date        DATETIME,
   marriage_place       VARCHAR(24),
   marriage_country     VARCHAR(16),
   married_name         VARCHAR(15),
   divorced             VARCHAR(1),
   spouse_str           VARCHAR(4),
      CONSTRAINT marriage_str_pk PRIMARY KEY (id, spouse),
      CONSTRAINT marriage_str_generations_fk FOREIGN KEY (id)
 REFERENCES generations (id),
      CONSTRAINT marriage_str__fk FOREIGN KEY (spouse_str) REFERENCES
 (id));

In order to extract table data we must build a DATA MOVE record. [EXPLAIN PERFORM TABLE CREATE DATA MOVE.]

> perform table create data move subfile generations from usr_tables

You can [EXPLAIN DATA MOVE PROCESSING.] to see a description of the following DATA MOVE record fields.

> list unn

Id = *GENERATIONS_TABLE_MOVE;
  Subfile.Name = &GP.USR.GENERATIONS GENERATIONS;
  Declare.Subfile = Data Move Declares;
  Table.Subfile = &GP.USR.USR_TABLES USR_TABLES;
  Area.Name = Table1;
    Device.Type = OS File;
    File.Name = WYL.GP.USR.GENERATI.ONS;
    Record.Length = 117;
    Device.Options = var lrecl=117 repl refor temp;
  Area.Name = Table2;
    Device.Type = OS File;
    File.Name = WYL.GP.USR.NOTE;
    Record.Length = 82;
    Device.Options = var lrecl=82 repl refor temp;
  Area.Name = Table3;
    Device.Type = OS File;
    File.Name = WYL.GP.USR.CHILD;
    Comment = Row data length is 15;
    Record.Length = 19;
    Device.Options = var lrecl=19 repl refor temp;
  Area.Name = Table4;
    Device.Type = OS File;
    File.Name = WYL.GP.USR.MARRIAGE.STR;
    Comment = Row data length is 81;
    Record.Length = 89;
    Device.Options = var lrecl=89 repl refor temp;
  Target.Name = GENERATIONS;
    Generate.Table = *GENERATIONS;
    External.Area = Table1;
  Target.Name = NOTE;
    Generate.Table = *NOTE;
    External.Area = Table2;
  Target.Name = CHILD;
    Generate.Table = *CHILD;
    External.Area = Table3;
  Target.Name = MARRIAGE_STR;
    Generate.Table = *MARRIAGE.STR;
    External.Area = Table4;

You should not expect that this record is in its "final" form. An ensuing DATA MOVE request may not complete successfully because there is not enough space allocated on the OS volume. This problem may be caused by the fact that you are extracting more data than the OS default allocation gives you (16 extents).

It is a good idea to get an estimate of the amount of space needed. You can issue the PERFORM DATA MOVE command with a "COUNT" value (e.g. COUNT = 1000) [EXPLAIN PERFORM DATA MOVE COMMAND.] and then extrapolate the resulting total length values for each table based upon the total number of records to be extracted. You should then replace the "lrecl=value" field in the "Device.Options" data value with something like "MBYTE=value". [EXPLAIN ASSIGN AREA COMMAND.] and view the "tracks=n" option.

Now let's Add the above DATA MOVE record to the system subfile and perform the DATA MOVE to generate the tables.

> select data move

> add

> perform data move from *generations_table_move for subfile

Totals for Subfile GENERATIONS Table Move            Jan 27, 2000
Table Name          Table     Row     Max     Line      Total
                    Type     Count   Length   Count     Length
=================   =====  =======   ======  =======  =========
generations         DELIM       32      76       33       1775
note                DELIM       20      66       21        870
child               DELIM       82       9       83        740
marriage_str        DELIM       20      40       21        613

The default output is tab delimited data with a header line.

> use wyl.gp.usr.generati.ons clear

> ch x'05' to '|' nol

> list unn

id|name|sex|parent_1|parent_2|birth|birth_place|birth_country|death
123|Josephs Father|M||||||
124|Josephs Mother|F||||||
133|Josephs Uncle|M||||||
237|Joseph John Hickner|M|123|124|02/24/1825|Baden|Germany|04/12/1890
238|Elizabeth Fischer|F|||1825|Baden|Germany|
239|Frank Joseph Hickner|M|237|238|08/15/1854|Louisville, KY||02/16/1933
240|Anna Isabelle Easterwood|F|563|564|04/29/1855|Rising Sun, IN||10/01/1892
241|Cordelia Pearson Allen|F|||04/07/1861|Ohio||12/22/1921
242|Joseph Jacob Hickner|M|239|240|10/09/1879|Osgood, IN||03/--/1953
243|Mary Agnes Hickner|F|239|240|07/22/1883|||06/04/1901
244|Peter Francis Hickner|M|239|240|09/22/1884|Osgood, IN||03/22/1968
245|Hazel Virginia Greenwalt|F|||12/24/1889|||01/04/1943
246|John William Hickner|M|239|240|04/27/1886|Osgood, IN||09/06/1945
247|Rose Mary Schmitt|F|||09/21/1886|||01/09/1980
248|Edward Hickner|M|239|240||||
249|Mary Hickner|F|239|240||||
250|John Hickner|M|239|240||||
252|George Hickner|M|239|240|1889|||09/27/1892
291|Charles Joseph Hickner|M|237|238|1849|Baden|Germany|
292|Mary Magdalena Hickner|F|237|238|08/25/1856|Louisville, KY||07/10/1907
293|Floren Hickner|M|237|238|1859|Louisville, KY||
294|George Hickner|M|237|238|12/17/1863|Louisville, KY||12/05/1947
295|Emma Elizabeth Darling|F||||||10/03/1912
296|Joseph Hickner|M|237|295|12/23/1872|Cleves, Ohio||10/27/1953
297|Katherine Hickner|F|237|295|10/25/1877|Cleves, Ohio||01/31/1958
299|Frank Hickner|M|239|240|09/--/1892|||10/05/1892
326|Rose Elizabeth Altherr|F|||02/22/1884|Tipton, IN||09/21/1946
563|Eli Easterwood|M|||1832|Ohio||06/15/1864
564|Mary Ann Trader|F|||1835|Dearborn, Co. IN||10/01/1892
565|John Easterwood|M|563|564|06/28/1857|Dearborn, Co. IN||08/31/1915
566|Magdalena Hickner|F|||08/25/1856|Kentucky||07/10/1907
567|William Easterwood|M|563|564||Kentucky||

8 External Files

A SPIRES file can be defined in such a way that one of its record-types is an External record-type. This is done by coding the element EXTERNAL-TYPE in the record-type definition. A subfile which has an EXTERNAL goal record-type has some properties that are quite different from other SPIRES subfiles. This subfile will have a look similar to normal subfiles in that its Deferred Queue can be used for retrieval and update but the "tree" portion of the subfile is "external" to SPIRES. That is, the "tree" is not in an SPIRES RECn data set. Rather it is located either on a SPIRES Device or on some medium foreign to the SPIRES environment.

This "foreign" information source refers to any source of information that can be moved in some fashion into a SPIRES Device area. This data could come from a WYLBUR data set, or a remote database accessed through a "perl" script that creates a WYLBUR data set. Whatever its source we will refer to it in this document as "Remote" data.

Accessing External Subfile Data

Data from the external "tree" of a subfile is accessed through normal SPIRES Device Services areas through the use of Formats which transform the information in exactly the same way that INPUT formats do. In fact INPUT formats are used to transform the information from its external form into the SPIRES internal record form. Thus the DISPLAY of a record of an external subfile involves extracting that record from the external device via an input format prior to presenting it to its final destination, possibly via an OUTPUT format.

Similarly, the movement of a record to an external device (the external "tree") is done using an OUTPUT format which converts data from its internal SPIRES form to the form that must be presented to that external device.

Advantages of External Subfiles

You may ask why anyone would want to access and/or store data in this manner when SPIRES has always had the capability to move data to and from device Areas such as the ACTIVE file or other SPIRES files and more recently through the use of OS files. Several advantages come to mind.

 - The generality of SPIRES internal data

You can define and manipulate external data in ways that take advantage of the generality, variability and power that is inherent in data stored in a SPIRES subfile. In simple terms you can deal with external data described by the SPIRES record definition language rather than dealing with it through more rigid Vgroup definitions.

 - Accessing the data as a subfile

You have the ability to access external data in its context as a subfile, using SPIRES single record display, search commands and sequential scan commands. You are also able to filter the data through normal SPIRES filters, to sequence the data, to access it through Paths and subgoals and to transform it with Output formats or $REPORT, in effect acting upon that information with the full capability of SPIRES.

 - Utilization of the SPIRES file Deferred Queue

The Deferred Queue is unique to SPIRES and has a number of properties that can also be used to advantage with external subfile manipulation.

For example, the Defq may be used as a place holder for updates to the external subfile and used to increase efficiency.

 - The Defq may be used to store  accessed  external  records  so  that  they  need  not  be
 re-accessed if remote changes are not a factor.

 - Multiple updates to the external records can be made without having to  ship  them  to  a
 remote platform each time.

 - SPIRES Fastbild Services are available for quick loading.

 - Data may be acquired once from a remote platform and shared by many on the  SPIRES  side.

The Defq may be used for its power in conjunction with Transaction Group processing and record locking activity.

 - Allows data migration to remote platforms

Data that is currently stored in SPIRES data sets as SPIRES subfiles may be moved to other remote platforms for various reasons. Perhaps data acquisition and maintenance can be better achieved there. Converting the subfile from its current SPIRES form into an External subfile could greatly simplify and smooth the transition.

[See 8.1.]

8.1 External File Record Processing

1. External Record Retrieval

The actual process of record retrieval from an external subfile consists in two or three phases of activity. The optional first phase is essentially a non SPIRES phase in that SPIRES itself does not have control of the process. SPIRES does have control during the second phase though the external file definer controls the type of activity that takes place. The final phase is totally under SPIRES control.

 - Data Acquisition Phase -- From Remote Medium to SPIRES Device

This phase consists of the processes needed to move remote information into a SPIRES medium, a Device Area known and understood by SPIRES, with a possible transformation of that data into a form that is to be read by a SPIRES format.

Example:  A simple example is to USE a WYLBUR data set. The ACTIVE
file would then be used as the FILE device that SPIRES Device
Area.  A more complex activity would be to execute a program such
as a "perl" script which routes requests to other machines such as
SYBASE or which causes the execution of batch processes which
extract data from some other external source.

 - Data Input Phase -- From SPIRES Device to Subfile Record

The execution of this phase is under SPIRES control but the subfile definer provides information to govern that control (through the EXTERNAL DATA declaration). Data on the SPIRES device is transformed by an input format and presented to the SPIRES environment as a subfile record. The example of USEing a Wylbur data set above in phase 1 could be eliminated through the use of the OS FILE Device AREA.

 - Data Presentation Phase -- From Subfile Record to Final form

This phase mimics the normal SPIRES process once a subfile record has been accessed in a "tree" dataset. The accessed subfile record in its internal form is presented to the calling SPIRES process (FIND, DISPLAY, Global FOR etc.)

2. External Record Updating

The preceding discussion has been geared to the activity of retrieving records from an external subfile. The updating of external information involves processes similar to those described for the phases of retrieval.

The phases of external file update activity is the reverse of the phases of retrieval activity.

 - Data Update Phase -- From SPIRES external form to Subfile record.

This phase of the update activity is totally controlled by SPIRES and concludes with the generation of a subfile record to be added, updated or removed.

 - Data Output Phase -- From Subfile Record to SPIRES Device

This phase consists of two distinct types depending upon the Control information specified by the external subfile definer. These types are the following.

 - Deferred Data Output -- This activity  consists  in  accumulating  the  external  subfile
 modifications  in  the  Defq to be PROCESSed as a "batch" of updates at a later
 time (via the SPIRES PROCESS command).

 - Direct Data Output -- Each  update  request  is  passed  directly  to  the  Device  Area.

Whether the Data Output phase is Direct or Deferred the data must be transformed into its external form through an OUTPUT format like $OUTPUT or a subfile definer designated format.

This phase may be modified by the subfile definer by setting control information for the commands which are used to cause data to flow out to a Device Area (and to a remote medium if one is specified).

 - Data Storage Phase (Optional) -- From SPIRES Device to Remote Platform

This phase is triggered by a SPIRES or user provided process that can transform and/or move the data to its remote destination.

[See 26, 8.2.]

8.2 External File Data Declaration -- the EXTERNAL subfile

SPIRES provides a metadata structure developed in conjunction with External subfile support. The purpose for this model is to hold all of the information that the External subfile definer needs to control SPIRES interaction with his or her data. If the subfile definer utilizes the features made available through this structure he/she should find that it will be much easier to interact with the data, and the subfile's user community will find that their interaction with the data is the same as if the primary data source were a SPIRES data base.

Any permanent external subfile must be defined with the EXTERNAL-TYPE data element coded within its goal record definition. EXTERNAL-TYPE may be coded as a null value (EXTERNAL-TYPE;) which indicates that the EXTERNAL DATA information will be DECLAREd dynamically or it may contain the key of a record in the EXTERNAL system subfile (e.g.. EXTERNAL-TYPE = *External-data-record;). The data elements that currently make up the External data package are as follows:

Subfile EXTERNAL

Type           Element                Description

String key    ID                     - External -Type record key
String        COMMENTS
String        AUTHOR
Hex           DEFDATE
Hex           MODDATE
Hex           MODTIME
Structure     RDBMS                  - External Database Structure
String        key . DATABASE         - External Database name
String            . SERVER           - Server/Host Machine name
String            . USER             - RDBMS user name
String            . PASSWORD         - RDBMS user password
Structure         . HIERARCHY (mult) - RDBMS table structure
String        key . . TABLE          - RDBMS Table or View
String            . . KEY.COLUMN     - RDBMS Key Column
String            . .  FROM (mult)   - SQL FROM statement
String            . .  WHERE.X  (mult) SQL WHERE statement
String            . .  GROUP.X  (mult) SQL GROUP BY statement
String            . .  ORDER.X  (mult) SQL ORDER BY statement
String            . .  EXCLUDE.LIST  - Data element Exclusion list
Int               . .  WAIT.TIME     - SQL Search Wait Time
String            . PORT             - Host Port number
Structure     TRANSFORM              - Data transformation structure
String        key . DIRECTION        - Input or Output direction
String            . AREA             - Device Area name
String            . FORMAT           - Data transformation Format name
String            . PARMS            - Parms to pass to format
String            . TRACE            - To set Format tracing
String            . HIDDEN           - To hide transformation data
String            . ASSIGN.TO        - Device Area assignment statement
String        PTRACE                 - To set Elem processing tracing
Structure     COMMAND.INFO (mult)    - SPIRES command control structure
Int           key . COMMAND          - Command value (Select, Search etc.)
Hex               . COMMAND.OPTIONS  - Command control options
String            . PROCS            - External process to call for Command
String            . USING.FRAME               - Using Frame name for Command
String            . LOAD.PROC        - External process to call for load
String        LIBRARY                - Library name for External Processes
String        LOGGING                - Continue with any logging (Tlog, Elog)
Struc         USERDEFS               - USERPROC definition structure
        ....................

[See 8.3.]

8.3 External File Control Data

SPIRES provides a number of mechanisms to aid in the processing control of data movement from its external to SPIRES internal form (data retrieval), or from its SPIRES internal form to external form (data update). The type of control provided and the timing of that control is determined by the values coded for the various data elements in the External Data Declaration structure shown above. You could circumvent these mechanisms by providing alternative methods but there should not be good reasons to do this. SPIRES external subfile support provides the best and most efficient processing in a manner that most closely approximates the feel to which its users have become accustomed.

External Record Retrieval

Control during Retrieval Acquisition

This phase consists in the acquisition of data from the remote data medium and its movement into the SPIRES Device Area. The data element values that control this activity are the following:

The RDBMS structure -- The values in this structure must be coded if the remote medium is an RDBMS database or if it is to be accessed via the NIO Device Area. The SERVER machine, DATABASE name, Port value Key Columns and Table or View values must be supplied. This structure is not used currently if the target external medium is not RDBMS, however it may be that its use will vary in the future for non-relational external data.

PROCS -- User specific processing code may be provided either by SPIRES or by the user depending upon the kind of remote information being accessed. These processes will be triggered at the appropriate times (like user exits) depending upon the type of SPIRES command being executed. SPIRES has determined the conventions for parameter passing.

LOAD.PROC -- This data element may be named for a specific task, that of loading the external subfile during SELECT. This data element enables you to have two separate processing steps at subfile startup time. The PROCS value may be used to establish external subfile "index" names while LOAD.PROC may be used to load the data base.

LIBRARY -- This data element provides the name of a WYLBUR Library data set in which the processing routines (XPROCs) named by the PROCS values reside. Currently, SPIRES assumes that all PROCS are coded as XPROCs and reside in the named Library.

Example:   An example of an acquisition process is the RD_SRCH and
RD_DISP XPROCs written for the Search and Display exits to access
remote RDBMS data.  FIND and DISPLAY key-value requests to the
external subfile are reinterpreted in terms understood by the
relational data system in which the remote data is stored and
maintained.  Note that these XPROCs are no longer needed if you
choose to utilize the NIO (Network) Device Area.

Control during Retrieval Input

SPIRES must be told how to locate, transform and process the external data. Information must be supplied by the external subfile definer for each of these three aspects of processing.

The TRANSFORM structure -- Several data elements described below are coded within this structure. The key value here is the direction indicator.

DIRECTION -- INPUT must be coded for data retrieval.

Device AREA information -- SPIRES needs to know what Device Area has received data from the Acquisition phase. SPIRES also must know whether the Device Area is established (ASSIGNed) visibly or invisibly. An invisible Device Area is one which is OPEN only during the Data Acquisition and Data Input phases. The following data element values give this information:

AREA -- Device Area name for data retrieval

ASSIGN.TO -- Device Area assignment statement if it has not been preset.

HIDDEN -- Tells SPIRES to OPEN the device only during data input. This is currently used for OPENing temporary WYLBUR Active files.

Data transformation information -- SPIRES must know the name of the format to use to transform data from its Device Area image into its SPIRES internal record image. This format may be a system supplied format coded to transform data on the Device Area in one of a few standard forms (e.g. $INPUT, $RDBMSCAN, etc.). This format is established at the time of subfile Select, kept hidden (thus not interfering with user supplied formats), and cleared when the subfile is cleared. The data elements involved are:

FORMAT -- The Input Format name used for retrieval

TRACE -- This element may be coded for input format debugging.

PARMS -- This value may be supplied to be passed to the Format (in $Parm) at the time of the SELECT.

PTRACE -- May be coded to aid in debugging during Input.

LOGGING -- This element may be coded if you wish to continue with Tlog and Elog tracing, otherwise they will be turned off during retrieval.

Command processing control information -- SPIRES must be told what activity it is to perform for various kinds of commands. Note that this control information may be used only if the "tree" of the external subfile is to be accessed. If an external subfile record is found in the Defq during a DISPLAY request then this control information has no effect.

The following information is used for this purpose:

COMMAND.INFO structure -- Data presentation control may be specified for up to four instances during which external data retrieval may occur. The COMMAND data element may have one of the following values:

 - SELECT -- Initiate a process at the time of subfile Select.  You  may  wish  to  load  an
 entire  external  subfile  or  there may be some "handshaking" activity between
 SPIRES and a remote system to determine such things as  how  to  transform  RDBMS  column
 names into SPIRES element or index names.

 - SEARCH -- Initiate a process whenever a FIND, AND or OR command is invoked.   The  search
 request  is  transformed  by  an  internal  SPIRES  process  and  the resulting retrieval
 parameters may be sent to the external source.  Data is returned to SPIRES in the form of
 a RESULT stack.

 - SINGLE -- Initiate a process when a single "tree" record access takes place (Eg
 for DISPLAY key-value or $Lookup requests)

 - SCAN -- Initiate a process for a number of sequential external  subfile  access  requests
 which will access the "tree" (Global FOR SUBFILE or FOR TREE).

COMMAND.OPTIONS -- If no command options are specified then any commands that are designated above that present data in the form of a SPIRES record will perform its service by accessing the specified data directly from the Device Area transformed by the INPUT format. For example. DISPLAY key-value will bypass all data in the Area until it finds the one desired which will then be presented to the Display Processor. The COMMAND.OPTIONS data element may take on the following values during external subfile retrieval:

 - CLEAR -- Remove all goal records in the defq prior to  servicing  the  specific  command.

 - LOAD -- Load any records from the Device Area that are accessed from the  device.   These
 records will be both ADDed or UPDATEd to the external subfile's Defq.

 - NEW -- Load only the NEW records that are accessed.  In other  words,  records  are  only
 ADDed.

 - ALL -- All records in the Device Area  are  loaded  into  the  subfile  defq.   Then  the
 specific command is serviced.

USING.FRAME -- You can name a particular frame of the Input Format that is to be executed during the retrieval phase for the COMMAND.

Command processing control examples

During SELECT you may decide that it is most efficient to load the
entire data base.  This could be quite advantageous if the subfile
is shared in a public subfile.  Then if you ask for LOAD NEW,
loading would only take place once for that subfile by the first
SELECT following a PROCESS.

The SEARCH process will create an in core RESULT of the records that
meet the external subfile "search" criteria.  You have the option to
LOAD full records as well as to generate a RESULT but generally if
the result can be large or if there is a chance for iterative
searches it is best not to code the LOAD option.

Summary of Control Activity

SPIRES provides convenient and straightforward means to control External subfile Acquisition and input processing. Some of this control is very loose and dynamic and open to extensive change by the user. Other control is tightly maintained and may be changed only by the subfile owner or by some SPIRES system process.

External Record Updating

Control during Update Output

Like the Retrieval Input control aspects, the Update Output processing involves the TRANSFORM structure along with COMMAND control options that are exercised during specific update activity. Because of data integrity considerations SPIRES must also enable control of lower level operations such as record locking, commit and decommit.

TRANSFORM structure -- The following data elements are available:

DIRECTION -- OUTPUT must be coded for update

Device AREA information -- See the description for data retrieval. The device AREA named is to be used for data output.

Data transformation information -- See the description for data retrieval. The FORMAT named is an Output format since it is called to move SPIRES internal form records to the Device Area in external form.

Command processing control information -- SPIRES expects external subfile control information during any subfile update request. The subfile definer determines the timing and the type of activity that is to take place,

COMMAND.INFO structure -- Update control may be specified for several commands. Note that the timing of this control (Direct or Deferred data output) depends upon the PROCESS command option (see below). The COMMAND data element may take on these values:

 - ADD -- Initiate a process to add an external subfile record.

 - UPDATE -- Initiate a process to update an external subfile record.

 - REMOVE -- Initiate a process to remove an external subfile record.

 - LOCK -- The process defined  by  the  Lock  "command"  control  information  is
 initiated  when  it  is necessary to synchronize data within the external subfile and the
 remote database.  The lock process is used as follows: If the LOCK  command  option  (see
 below)  is  specified  during  an UPDATE command then SPIRES executes the LOCK processor.
 The PROC is expected to issue a request to the remote database to COMMIT the record being
 updated and to retrieve that record for testing by SPIRES.  If the retrieved record  does
 not  match  the pre-updated SPIRES record then SPIRES returns an error and calls the LOCK
 process to DECOMMIT the transaction in the remote database.  If the records do match than
 the update is allowed to proceed.

 - PROCESS -- This process is initiated at the beginning and at the conclusion  of  activity
 during  the subfile PROCESS command activity for the external subfile.  It is intended to
 control the initial and ending activity for a "batch" of updates to the  remote
 database.   The  individual  updates  are  still  handled by the processes coded for ADD,
 UPDATE and REMOVE commands.

COMMAND.OPTIONS -- The update request options control the timing of the remote database update activity (Direct or Deferred update) as well as whether or not record locking is to take place. The following values may be coded for external subfile update commands:

 - PROCESS -- SPIRES normally will just add or update the given external subfile  record  in
 the  normal  manner.  That is, the transaction is placed in the subfile's Deferred Queue.
 If the PROCESS option is coded then the transaction will  be  moved  immediately  to  its
 remote site.

 - DEQUEUE -- If this option is coded then SPIRES will not hold the PROCESSed record in  the
 external  subfile  defq.   The  transaction  will  be  DEQueued  once the update has been
 PROCESSed.

 - LOCK -- If this option is specified for a given update command, then SPIRES  will  invoke
 the LOCK command process.  See the description above for information about the use of the
 LOCK command process.

Control during Data Storage Phase

This phase is triggered by a SPIRES or user provided process that can transform and/or move the data to its remote destination. See the description above concerning control during retrieval acquisition concerning the RDBMS structure and the PROCS and LIBRARY data elements.

[See 8.4.]

8.4 External File Data Declation -- Examples

The following sample File Definition will be used to provide an example of how one might use this facility to access and even update data in a remote RDBMS database. This simple file definition is as follows:

FILE = GQ.WCK.PATIENTS;
DEFDATE = WED. JAN. 16, 1985;
MODDATE = THUR. JULY 13, 2000;
MODTIME = 08:43:40;
BIN = PURGE;
NOAUTOGEN;
RECORD-NAME = REC01;
  DEFINED-BY = GQ.DOC.SYB.PATIENTS;
  EXTERNAL-TYPE = gq.wck.patients;
SUBFILE-NAME = WCK PATIENTS;
  GOAL-RECORD = REC01;
    ACCOUNTS = GQ.WCK;

The Record definition defined by "GQ.DOC.SYB.PATIENTS" forms a structure which matches a table that is to be accessed through the use of a record in the EXTERNAL subfile whose key is "GQ.WCK.PATIENTS".

This EXTERNAL record is as follows:

> select external
> display gq.wck.patients
 ID = GQ.WCK.PATIENTS;
 DEFDATE = TUES. AUG. 15, 1995;
 MODDATE = THUR. JULY 13, 2000;
 MODTIME = 08:41:12;
 DATABASE = mtp_test;
   SERVER = renoir;
   USER = gqdoc;
   PASSWORD = *******;
   TABLE = patients;
     KEY.COLUMN = patient_id;
   PORT = 201;
 DIRECTION = INPUT;
   AREA = NIO;
   FORMAT = $tbtf.read;
 DIRECTION = OUTPUT;
   AREA = NIO;
   FORMAT = $tdbm.updt;
 COMMAND = SEARCH;
   COMMAND.OPTIONS = CLEAR;
   USING.FRAME = TABLE;
 COMMAND = SINGLE;
   COMMAND.OPTIONS = LOAD, NEW;
   USING.FRAME = SINGLE;
 COMMAND = ADD;
   COMMAND.OPTIONS = PROCESS;
   USING.FRAME = ADD;
 COMMAND = UPDATE;
   COMMAND.OPTIONS = PROCESS;
   USING.FRAME = UPDATE;
 COMMAND = REMOVE;
   COMMAND.OPTIONS = PROCESS;
   USING.FRAME = REMOVE;

This record is set up to access a particular SYBASE table named "patients" accessed through the given Server/Port shown. The RDBMS table has a SPIRES equivalent defined by the RECDEF. The table structure can be shown as follows:

> select @orv.gq.doc.syb.patients
> show elem char
Subfile @GQ.DOC.SYB.PATIENTS
 Sec  Occ   Len  Type    St/El       Element
 ---  ----  ---  ------  -----       -------
 Fix  Sing    2  Int     00/00  key  PATIENT_ID
 Opt  Sing       String  00/01       NAME
 Opt  Sing       String  00/02       ADDRESS1
 Opt  Sing       String  00/03       ADDRESS2
 Opt  Sing       String  00/04       PHONE
 Opt  Sing       String  00/05       INSURANCE

> select wck patients

> show indexes
Goal Records:  GOAL

RDBMS Search term Relationships
Goal-elem-name:  PATIENT_ID    Column-name:  patient_id  (binary)
Goal-elem-name:  NAME          Column-name:  name  (char)
Goal-elem-name:  ADDRESS1      Column-name:  address1  (char)
Goal-elem-name:  ADDRESS2      Column-name:  address2  (char)
Goal-elem-name:  PHONE         Column-name:  phone  (char)
Goal-elem-name:  INSURANCE     Column-name:  insurance  (char)

You may use the above search terms in an "external" find request.

> via external find name str Sickly
-Result: 6 RECORDS

Note that since the Search COMMAND.OPTIONS specifies CLEAR, SPIRES will ZAP the DEFQ at the time the search is issued. The external search request is sent to the remote host and the keys of the search result are returned.

> sho file trans

   07/13/2000     Transaction Log of File GQ.WCK.PATIENTS
                          for Record-type REC01

   Date      Time      Account   Id   Type  Command  Grp  Key Value

> type patient_id name

 PATIENT_ID = 2;
 NAME = Sickly, I. M.;

 PATIENT_ID = 8;
 NAME = Sickly, A. B.;

 PATIENT_ID = 9;
 NAME = Sickly, C. D.;

 PATIENT_ID = 52;
 NAME = Sickly, U. M.;

 PATIENT_ID = 82;
 NAME = Sickly, I. R.;

 PATIENT_ID = 92;
 NAME = Sickly, U. R.;
>

Since the command to access SINGLE records specifies "LOAD, NEW", SPIRES retrieves the records and loads each of them into the Defq along with displaying them. The input format $tbtf.read deals with the conversion of the column values from each row into the data element values as seen in the TYPE request.

> sho file trans

   07/13/2000     Transaction Log of File GQ.WCK.PATIENTS
                          for Record-type REC01

   Date      Time      Account   Id   Type  Command  Grp  Key Value
   07/13/00  14:52:09  GQ.WCK2        ADD   Add           2
   07/13/00  14:52:10  GQ.WCK2        ADD   Add           8
   07/13/00  14:52:13  GQ.WCK2        ADD   Add           9
   07/13/00  14:52:15  GQ.WCK2        ADD   Add           52
   07/13/00  14:52:17  GQ.WCK2        ADD   Add           82
   07/13/00  14:52:19  GQ.WCK2        ADD   Add           92
>

You can also cause a record to load through DISPLAY.

> display 6
 PATIENT_ID = 6;
 NAME = Person, D. O. A.;
 ADDRESS1 = 386 Samson Court;
 ADDRESS2 = Homebody, IO 84501;
 PHONE = (814) 555-1212;
 INSURANCE = Quibble and Dragout;
>

9 Change Generation

In the client server environment, data is frequently moved or, rather, copied from hierarchical SPIRES records into flat tables in a data base (e.g., a Sybase database) on a different machine. The master record currently resides in SPIRES, where it is updated by Prism users or other mainframe processes. But the updates need to be applied not only to the SPIRES file, but also to the server data base.

Records added to the SPIRES subfile are easily handled by the tables: the table entries are generated, and inserted. Removed records are relatively simple too: the table entries for the tree copy of the removed record are generated, and then passed to the tables marked as deletes. But updated records are more complicated, combining the add and remove procedures: the table entries for the tree copy are generated and then passed to the tables as deletes; and then the table entries for the defq copy of the record are generated, and passed to the tables as inserts. Obviously, if only one element value is changed in the updated record, a lot of unnecessary inserts and deletes would be generated.

With the feature called "change generation", SPIRES can help sort out the update data, returning only the changes that need to be made to the tables. This can save a great deal of processing time on the server, radically reducing the number of table updates that need to be done to keep the table data in synch with the SPIRES records, as well as reducing the amount of data that needs to be schlepped from the mainframe to the server.

The rest of this introduction is devoted to an example that may help clarify these points. The rest of the chapter, starting with the next section, describes how to use this feature. [See 9.1.]

Suppose you have this source record and its update:

Original                       Update
----------------------------   ------------------------------
ID = 49322;                    ID = 49322;
 NAME = Lottie Dah;             NAME = Lottie Dah;
 ADDRESS;                       ADDRESS;
   ADDRESS-TYPE = Home;           ADDRESS-TYPE = Summer;
   STREET = Lackluster Lane;      STREET = Lackluster Lane;
 ADDRESS;
   ADDRESS-TYPE = Work;
   STREET = Drab Drive;
 CHILD;                         CHILD;
   NAME = Veran;                  NAME = Veran;
   BIRTHDATE = 05/05/92;          BIRTHDATE = 05/05/92;
                                CHILD;
                                  NAME = Du;
                                  BIRTHDATE = 02/29/96;

When you first copy the data of the original record from SPIRES to flat tables, you flatten it into these records, as new records to add:

Address table
  Insert  49322  Lottie Dah     Home      Lackluster Lane
  Insert  49322  Lottie Dah     Work      Drab Drive
Child table
  Insert  49322  Lottie Dah     Veran     05/05/92

But after the record is updated in SPIRES with the new data, shown on the right above, you want the two tables to reflect that data too, with the entries in the tables ending up like this:

Address table
          49322  Lottie Dah     Summer    Lackluster Lane
Child table
          49322  Lottie Dah     Veran     05/05/92
          49322  Lottie Dah     Du        02/29/96

The question is, how much do you need to pass to do the update of the tables? Do you want to pass all the data from the original record, marked as data for removal from the table, followed by all the data from the updated record, to be added?

Address table
  Delete  49322  Lottie Dah     Home      Lackluster Lane
  Delete  49322  Lottie Dah     Work      Drab Drive
  Insert  49322  Lottie Dah     Summer    Lackluster Lane
Child table
  Delete  49322  Lottie Dah     Veran     05/05/92
  Insert  49322  Lottie Dah     Veran     05/05/92
  Insert  49322  Lottie Dah     Du        02/29/96

Deleting all of the table entries for the old copy and inserting all of them for the new copy can lead to much unnecessary work. Because there are several changes between the old and new versions of our sample record, there isn't much unnecessary work there: only the deletion and insertion of the Veran child isn't necessary. However, in a larger record, with only a single piece of data being changed, this method of handling changes could lead to an enormous number of unnecessary data updates to the tables.

The alternative that SPIRES offers is "change generation". As it does when it determines what indexes need updating when records are changed, SPIRES will examine the tree and defq copies of a record, determine what has changed, and only report those changes back. In the above example, SPIRES would generate only the changes that needed to be made to the table, eliminating the changes that would cancel each other out:

Address table
  Delete  49322  Lottie Dah     Home      Lackluster Lane
  Delete  49322  Lottie Dah     Work      Drab Drive
  Insert  49322  Lottie Dah     Summer    Lackluster Lane
Child table
  Insert  49322  Lottie Dah     Du        02/29/96

The rest of this chapter describes how to set up change generation for your application.

9.1 The Change Generation Procedure

To make change generation work for your application, you need the following:

 - the "source" subfile containing  the  data  whose  changes  you  are  monitoring.

 - the ability to tie or at least  coordinate  change  generation  for  that  subfile  to  the
 subfile's  file-processing  procedure.   This is because change generation uses the defq as
 the source for change  information.   For  example,  suppose  you  didn't  have  the  right
 coordination:  file  processing happens at midnight but change generation happens at 10 pm.
 Any changes to the file between  10  and  midnight  would  not  make  it  into  the  change
 generation  process  since  they  would  be  cleared  from  the defq before the next change
 generation process began, meaning that your change generation data might not  be  complete.

 - a "changes" subfile that will hold the change generation  data.   This  might  be
 part of an external file, i.e., a direct connection to a table in another DBMS, or it might
 be  a  regular  SPIRES subfile; in either case, the "changes" subfile serves as a
 staging area where you prepare the change data for another DBMS.  It will  NOT,  therefore,
 be  the final destination for the table data updates.  That's because this process can only
 add data to the changes subfile, not remove it; so the data being  passed  to  the  changes
 subfile  is  the  data  to  be  added  to  or  removed from the destination table, plus the
 instruction to add or remove.  More details about designing the changes subfile  appear  in
 the next section.  [See 9.2.]

 - an output format for the source subfile that creates input for the changes  subfile.   This
 must  be written according to the standards for using the SBF (SuBFile) area to use records
 being output from  one  subfile  as  input  data  for  another  subfile.   See  the  manual
 "SPIRES Formats", section B.16.4; online, EXPLAIN SBF AREA.

 - an output control declaration to pull most of the pieces together (not strictly  necessary,
 but the most common method).  [See 7.1.]

How It Works

Understanding the process of using the change generation feature will help you understand exactly what the feature can do for you. Chances are that you will do these steps in a protocol, but here we'll demonstrate it as if you were doing it interactively. At the end, we'll discuss the changes you would make to use change generation with output control in a protocol.

Step 1: Selecting the source subfile

Select the source subfile.

-> select my-source-subfile

Step 2: Selecting the changes subfile

Through a path, select the changes subfile.

-> through next, select my-changes-subfile
-Path established: 1

Step 3: Set up the SBF area

Define an area on the SBF (SuBFile) area, and then assign that area to the path, naming the path you just opened.

-> define area my-changes on sbf
-> assign area my-changes to subfile path 1

Step 4: Set the output format

Set the output format written for the source subfile that generates input for the changes subfile.

-> set format change-gen

Step 5: Set up Global FOR

Establish Global FOR, choosing the class of records you want to examine for changes. [Not all Global FOR classes make sense in this context; in particular, do not use FOR TREE, since that rules out examination of the deferred queue completely.] Chances are, you want to work with all the deferred queue data. The best way to establish that is shown in the example below:

-> for defq               <- all defq data, including removes
+>

Note: the DEFQ class under Global FOR usually does not include removed records, but it does include them under change generation.

Step 6: Define a display set

The elements that are to be passed from the source subfile to the changes subfile are identified in the format set in step 4. In this step, you define a display set that will flatten the hierarchical SPIRES records into the table entries you need to create. This step must include the names of all elements being passed, essentially listing all the elements in the source record that were named in the output format of step 4. This is also where you add an option that requests that the generated entries be limited to only those that represent changes, using either the CHANGES option on the DEFINE DISPLAY SET command or specifying GENERATE.CHANGES as a control option in the output control packet. [See 1.9, 7.1.]

The elements that will determine the number of set entries should be specified in the ELEMENTS list on the DEFINE DISPLAY SET command (or in the FOR.EACH statement in the output control packet); other elements whose data you want to pass should be included in the "+ elements" list (or in the PLUS.ELEM statement in output control. So, for instance, using part of the record structure of the example in the previous section:

ID (key)
NAME
ADDRESS
  ADDRESS-TYPE (key)
  STREET

you would define a display set that generated table entries for the Address table based on the ID and ADDRESS-TYPE elements, since you want an Address entry for each ID and ADDRESS-TYPE combination in the record. You would then add the NAME and STREET elements as "plus elements" to be passed along to the table.

+> define display set changes tv=all elements id address-type ...
... + name street +>

Remember, all elements from the source subfile that you refer to in the output format must be named here as well. Another reminder: if it's appropriate, and it usually is, don't forget to include the TV=ALL option to ensure the inclusion of all occurrences.

Step 7) Generate the changes

Use the GENERATE SET command to generate the changes, adding them into the changes subfile:

+> in my-changes generate set all
+> endfor

To get an idea of what records have been generated, see "Continuing the Example" below.

Using Output Control with Change Generation

It is undoubtedly more common for a change generation procedure to be constructed using output control.

Changes you would make to the above procedure would probably include:

 - Step 4: You would name the  output  format  in  the  output  control  packet  (see  below).

 - Step 6: Replace the display-set step with the output control declaration, where you put the
 elements whose occurrences generate table  entries  (those  in  the  DEFINE  DISPLAY  SET's
 ELEMENTS  option)  in  the  FOR.EACH  statement,  and  the tag-along elements (those in the
 "+ elem" list) in the PLUS.ELEM statement.  Of course, you may also use any other
 output  control  statements  that  would   be   useful   to   you   as   well.    [See   7.1.]  For our example, it would look like this:

DECLARE OUTPUT CONTROL
  PACKET 1;
    FORMAT = Change.Gen;
    AREA = My-Changes;
    FOR.EACH = ID, Address-Type;
    PLUS.ELEM = Name, Street;
    CONTROL.OPTIONS = Generate.Changes;
ENDDECLARE

 - Step 7: Replace the GENERATE SET command with the DISPLAY command, using  the  WITH  OUTPUT
 CONTROL  prefix.   Note that you may instead use the TYPE command with a search result with
 output control to do change generation.

WITH OUTPUT CONTROL DISPLAY ALL

Continuing the Example...

When you complete your work, SPIRES will have created records in the changes subfile based on the changes to the source records.

-> set path 1                <- selecting the changes subfile
-> for adds
+> display all
****
 CHANGE.NUMBER = 376;
 ID = 49322; NAME = Lottie Dah;
 ADDRESS-TYPE = Home; STREET = Lackluster Lane;
 DELETE;
;
****
 CHANGE.NUMBER = 377;
 ID = 49322; NAME = Lottie Dah;
 ADDRESS-TYPE = Work; STREET = Drab Drive;
 DELETE;
;
****
 CHANGE.NUMBER = 378;
 ID = 49322; NAME = Lottie Dah;
 ADDRESS-TYPE = Summer; STREET = Lackluster Lane;
;
+>

If you compare those records to the changes we identified in the previous section that would need to be made to the tables, you'll see they are basically the same:

Delete  49322  Lottie Dah     Home      Lackluster Lane
Delete  49322  Lottie Dah     Work      Drab Drive
Insert  49322  Lottie Dah     Summer    Lackluster Lane

The notable exception is the appearance of the DELETE element, an element whose existence indicates whether the data is meant to be deleted (representing the old, removed data) or added (the new, updated data).

Your next step would be to take the newly added records in the changes subfile and massage them into input for the tables you want to update. It would be quite easy, for instance, to write an output format that converts these change records into INSERT or DELETE commands for SQL.

In the next section of this chapter, we'll cover aspects of the design of the changes subfile. That will include what was needed to make the DELETE element work. [See 9.2.]

(*) Technical Details of How SPIRES Generates the Changes

For anyone interested in the specific details of how this works, here is an explanation of the steps taken by SPIRES, record by record, when change generation is triggered:

1. Retrieve the latest copy of the record of the Global FOR Class.

2. Process the record according to the DEFQ transaction type:

 - a.  If the transaction type is an ADD, process the record normally.  (The normal process is
 to generate all of the display set's entries/rows and move them to the SBF  area.)   Go  to
 step 5.

 - b.  If the transaction type is UPDATE, then process the record normally but do not move the
 entries to the output device.  Instead save them in core.  Go to step 3.

 - c.  If the transaction type is REMOVE, then go to step 3.

 - d.  If the transaction type is DEQUEUE or the record came from  the  tree,  then  skip  all
 operations on this record.  No changes have been made.  Go to step 6.

3. Retrieve the tree copy of this record if the transaction type was UPDATE or REMOVE.

4. Process the tree copy as follows:

 - a.  Set the $DELETE system variable, a new variable that indicates that the deletion  phase
 is  taking place.  This is used by action A130 (see section 9.2) in creating occurrences of
 a "delete" element like the DELETE element in our example above.  It  would  also
 be  useful  to  you  in  the  output  format if you are creating your own delete indicator.

 - b.  If the transaction type is REMOVE, then process the record normally  but  with  $DELETE
 set.  All set entries are generated and moved to the Output device.

 - c.  If the process was an UPDATE, then as each set entry is generated,  compare  its  value
 with  the  copies  stored  in core (see 2b above).  If the entry matches a core entry, then
 delete the core entry.  If the entry does not match, then save this  entry  in  core  as  a
 DELETE entry.

5. SPIRES generates the two sets of core entries if any exist. First $DELETE is cleared and the entries generated from the DEFQ copy of the record are moved to the output device. Next $DELETE is set and the entries generated from the TREE copy of the record are moved to the output device.

6. Proceed to step 1 to access the next record.

9.2 The Changes Subfile

As the description of the change generation procedure explained earlier [See 9.1.] the changes subfile will serve as a staging area for moving data from the SPIRES subfile to the target data base, which is usually one or more tables in another DBMS. From the changes subfile, you may create records in the appropriate format for the target DBMS or perhaps SQL statements such as INSERT and DELETE, using custom SPIRES formats or the DEFINE TABLE facility.

The changes subfile may be designed however you like, since it is your tool, for your convenience. You may want to place all the changes from one SPIRES subfile in a single changes subfile, or you may want to generate changes into a different changes subfile for each target table. You may decide that the data in the changes subfile is worth keeping in SPIRES for awhile as a transaction log; or you may decide to make it part of a temporary file that exists only for the duration of the change generation process.

One key element that you will want to add to your changes subfile is a flag that signals whether the data of the change record is new data that needs to be inserted into the target DBMS or is old data that needs to be removed from it. Here is one easy way to set a "delete" signal element up.

In the technical details of how SPIRES generates change records, described in the previous section, you may have read about how SPIRES generates change records from the tree copy of removed and updated records in the source subfile with a system variable called $DELETE set; it is not set when change records from the defq copy of new and updated records are generated. Since the records are added into the changes subfile as they are generated, the $DELETE flag remains set, if it was set.

So when the change records for the source record's tree copy are added into the changes subfile, the $DELETE flag is set. In the changes subfile, you have created an element whose definition looks like this:

ELEM = DELETE;
  OCC = 1; LEN = 0;
  INPROC = A130;

Action A130, an Inclose rule, is designed to test the value of $DELETE; if it is set, then A130 assigns a zero-length value to the element. If it is not set, no value is assigned, and hence the element does not occur. That means that change records generated from the tree copy of a record from the source subfile will have an occurrence of the DELETE element; those generated from the defq copy will not. Whatever you use to format the change records for input to the target DBMS can thus use the DELETE element as a signal for whether the change represents data to be added to the DBMS or removed from it.

Because you can test the $DELETE flag yourself either in the source subfile's output format (the one that creates input for the changes subfile) or in Userprocs called from INPROC strings of the elements in the changes subfile, you may develop your own way to create a flag element like this one.

10 XEQ DATA Processing

A new SPIRES facility called XEQ DATA is being made available as a means of offering applications programmers a radically new method of control for certain applications. It is believed that the use of this new technique could result in greatly simplified code as well as offering a much more efficient processing environment for these applications.

This new technique is probably most useful to those applications which use meta-data records to hold the basic information that controls the flow of the application program. There may be other useful opportunities for the use of XEQ DATA concepts but these have not yet been examined.

XEQ DATA processing forces you to think in new ways about the flow of control in an application. You are to consider that the meta-data record itself along with its structures and data values are the primary means of control. This type of processing by the way, is not new to SPIRES. The SPIRES Compiler has used this technique to compile meta-data records ever since its inception.

This document will describe the various components brought together to provide this new service -- the MSEMPROC and XSEMPROC actions which provide the control, and the XEQ DATA commands which have been built to trigger the processing.

[See 10.1.]

10.1 The Meta-Data record structure and XSEMPROC Actions

A number of SPIRES applications have already been built that utilize a meta-data subfile to hold information that is used to control the flow of the application or to control a part of an application. These records might contain structures that have data that is pertinent to a particular individual or to a certain type of activity that might be performed. The data could be quite complex and variable in its structure, and has to be extracted from the record by a SPIRES format, and stored into multi-dimensioned arrays before the program can begin to operate on its contents.

XEQ DATA processing eliminates the need for the extracting format and the multi-dimensioned Global Vgroups through the use of XSEMPROC actions which are stored with the database itself and dictate how the data is to be used and which portion of the application is to execute and in what order.

Two new data elements must be coded in a meta-data record definition to produce these results. The MSEMPROC element is to be coded in the record level structure and XSEMPROC elements are coded in lower level structures.

MSEMPROC and XSEMPROC values are coded in the same manner as INPROC, OUTPROC values, in that they are made up of strings of Action codes with P1, P2 and P3 values. These actions generally name a data element of the meta-data record and specify how the data values are to be used to control the application.

Section [See 10.4.] of this document describes the Actions that are available for this purpose. That description, along with the XEQ DATA example given in [See 10.5.] and [See 10.6.] should provide enough information to get the general idea of how this feature is used.

[See 10.2.]

10.2 The SET XEQDATA Command

You can tell SPIRES that a particular subfile is to be used in the Xeq Data process by issuing the command:

SET XEQDATA (Subfile-name)

This command is similar to the SET XEQ command in that the subfile designated (or the currently selected subfile if no name is given) is set aside for this particular use. Note that only one XEQDATA subfile can be in in effect at a time. The SET XEQDATA process will clear out any preceding subfile for XEQ DATA purposes.

You may clear out an Xeq Data subfile in the same manner as you clear out the XEQ subfile.

SET NOXEQDATA

[See 10.3.]

10.3 The XEQ DATA Command

You activate the XEQ DATA process by issuing the XEQ DATA command. This command has the following form:

XEQ DATA Key-value

Where the given Key-value specifies the key of a meta-data record within the current XEQDATA subfile. This command should be issued within a protocol, which normally has locally defined variables and labels that are referenced by the MSEMPROC / XSEMPROC actions. The XEQDATA subfile is called "tcontrol" in these examples. [See 10.5.]

The flow of control at this point is as follows:

1.  SPIRES accesses the XEQ DATA subfile and reads the record
    whose key is given in Key-value.

2.  The record level MSEMPROC data is accessed and the flow of
    control is dictated by the successive actions specified.

    An action may specify that a value of the record is to be
    picked up and stored in a currently allocated Vgroup.  It
    may direct that a structure is to be entered in which case
    processing continues based upon the values of the XSEMPROC
    actions that were coded for that structure.

    If an action specifies that execution is to proceed at a
    label in the currently executing protocol, then activity
    continues at that label, where a subsequent RETURN will
    enable the program to return to XEQ DATA control, resuming
    execution based upon the next value in the data record.

3.  XEQ DATA processing will continue until the data under
    control of MSEMPROC / XSEMPROC actions has "run out".
    Your application may wish to cause a premature halt to
    this activity however, because a serious error has been
    detected.  This may be done by issuing the command:

               SET XDATA END

    This sets the $XEQDATAEND condition.  When your protocol
    RETURNs, XEQ DATA detects the call to stop the activity
    and continues with subsequent steps that follow the
    XEQ DATA call.

    The control for this process then would follow this pattern:

              Select tcontrol
              set xeq data
             /xeq data #DataKey
              if $No then Jump Error
              if $XEQDATAEND Then Jump Abnormal.End
              ... continue ....

[See 10.4.]

10.4 XsemProc Action Descriptions

Action A101 :P1, P2 , P3

P2 = Data element name

P3 = Protocol Label name or data value

P1 = 0, If the data element (P2) exists, then the value of the element is stored in $Parm and execution proceeds at the label (P3) within the protocol (like an Xeq Proc).

If the data element (P2) exists and is a structure element then
the structure will be entered.  The value in P3 will be ignored.

P1 = 1, If the data element (P2) is a structure and exists, then check the first element of the structure. If the value of that element matches the P3 value, then enter the structure. Comparison is made with the element external value.

Action A102, P2

P2 = Structure element name

The structure named is reopened in order to proceed to the next
XsemProc action.  This action generally precedes Action A101:1
above.  If the P2 data element name in Action A101 is within the
current structure then you may code A102:0.

Action A103, P2

P2 = Protocol Label name

Protocol execution continues at the label (P2) within the
protocol.  Execution is similar to an XEQ PROC, and the
executed code should RETURN.

Action A104 :P1, P2 , P3

P2 = Data element name

P3 = Static variable name

P1 = 0, If the data element (P2) exists then the value of the data element is converted to its external form and then stored into the static variable named by P3. The element value will be converted to the proper variable type.

P1 = 1, If the data element (P2) exists then the value of the data element is converted to its external form and then stored into the static variable array named by P3. The element value will be converted to the proper variable type. Each occurrence of the element will be stored in successive array values beginning at array-name::0.

Note:  If the element's Outproc rules include the $BUILD PROC
   (Action A82) that action is ignored when creating the
   external form of the element.  Each occurrence of the
   element will be moved to the array.

P1 = 2, If the data element (P2) exists then the value of the number of occurrences of the data element is stored into the type INTeger static variable whose name is given in P3.

Action A105 :P1, P2

P2 = Static Variable name

P1 = 1, If the value of the static variable of type flag whose name is specified by P2 = $TRUE then processing is discontinued for the current XSemProc string.

[See 10.5.]

10.5 Xeq Data Sample Meta-Data Subfile Definition

The following record definition is similar to the Declare Output Control description. This will be used to show how multiple reports can be output and controlled by the data. A sample record from the XEQDATA subfile is also shown.

A. Subfile Record Definition (FILEDEF/RECDEF) for the XEQDATA subfile

RECORD-NAME = REC1;
  REQUIRED;
    KEY = ID;
      INPROC = A30/ AX53:4/ AS23,8/ AS49:1,' '/ A123,*OUT.CONTROL,1;
      OUTPROC = A53:7;
  OPTIONAL;
    ELEM = SUBFILE;
      OCCURS = 1;
      INPROC = A30/ A40;
    ELEM = PARTS;
      TYPE = STRUCTURE;
      INPROC = AS138:1;
  STRUCTURE = PARTS;
    KEY = OUTPUT;
    ELEM = FORMAT;
      OCCURS = 1;
    ELEM = USING.FRAME;
      OCCURS = 1;
      INPROC = "A51/ AS23:0,1/ AS49:1,' ''"":,/<(|)>;&~=@'/ AW22:1,16/
A30/ AS140,FORMAT";
      ALIASES = USING;
    ELEM = PARMS;
      INPROC = A123,,1;
    ELEM = AREA;
      OCCURS = 1;
      INPROC = "A44,' ',/ AS49:1,'"",/<|>;&~='/ A30";
    ELEM = OPTIONS;
      OCCURS = 1;
      LENGTH = 1;
      INPROC = A30/ A44,' ',/ A48, CONTINUE, CON, CONT, CON, APPEND,
CON, CLEAR, CLR/ AS48, REPORT,1, CON,2, CLEAN,3, CLR,4/ AS50:1,1;
      OUTPROC = A50,1/ A48, 1,REPORT, 2,CONTINUE, 3,CLEAN, 4,CLEAR;
    ELEM = WHERE;
      OCCURS = 1;
      INPROC = A51;
    ELEM = FILTER;
      INPROC = A51;
    XSEMPROC = A101,Output,New.Report/A104,Format,FormatName/
A104:2,Parms,ParmCnt/ A104:1,Parms,ParmVals/ A104,Using.Frame,FrameName/
A104,Area,AreaName/ A104,Options,Options/ A104,Where,WhereVal/
A104:2,Filter,FilterCnt/ A104:1,Filter,FilterVals/ A103,Process.Report;
  MSEMPROC = A101,ID,Start.Report/ A101,Subfile, Select.subfile/
A101,Parts,/ A103,End.Report;

B. Sample Meta-Data record within the XEQDATA subfile

ID = GQ.WCK.OUTC;
 SUBFILE = TAG1;
 OUTPUT = 1;
   FORMAT = $prompt;
   PARMS = id name city;
   AREA = AREAX;
   FILTER = for s1 where l str T;
   FILTER = for xx where xx str x;
 OUTPUT = 2;
   FORMAT = test;
   USING.FRAME = OA;
   PARMS;
   AREA = AREAY;
   FILTER = for s1 where l str T;
   FILTER = for name where name str bill;
 OUTPUT = 3;
   FORMAT = $report;
   PARMS = id;
   PARMS = + name city state;
   PARMS = + zip;
   AREA = AREAX;
   OPTIONS = CONTINUE;

[See 10.6.]

10.6 Xeq Data Sample Protocol

* OUT.CONTROL (Add 12/15/93, Upd 01/14/94 at 10:23 by GQ.WCK)

Declare Vgroup Local
  var FormatName;  Len 32;
  Var = ParmVals; length 132; occurs = 16;
        Indexed-By = ParmNum;
  var = ParmCnt; type int;
  Var = ParmNum; Type int;
  Var = FrameName; Len 32;
  Var = AreaName; Len 16;
  Var = Options;  Len 32;
  Var = WhereVal; Len 132;
  Var = FilterVals; Len 132; occurs 16;
        Indexed-By FilterNum;
  Var = FilterCnt; type int;
  Var = FilterNum; type int;
  Var = SetTrace;  Type flag;
  Var = Prefix; Len 64;
  Var = ProcessError; Type Flag;
  Var = DataKey;  Len 64;
    Value = '*outc';
Enddeclare

++Init
  * Begin Output Control process
   If $Ask Then Let DataKey = $Ask
  - DataKey defaults to *outc if $Ask is null.
   select tcontrol
   set xeq data
   define area areax(1,80) on file
   define area areay(1,80) on file
   assign area areax to filex edit,replace
   assign area areay to filey edit,replace
   /xeq data #DataKey
   If $No Then Jump Error
   close area areax
   close area areay
  Return

++Start.Report
  /* Begin processing reports for $Parm
  Return

++Select.Subfile
  /Select $parm
   If $No Then Jump Error
  Return

++New.Report
  /* Begin processing for report $parm
   Eval $Vinit(Local)
  Return

++Process.Report
  If #FormatName Then Begin
    If #ParmVals::0 Then Let ParmVals::0 = ', '#ParmVals::0
    /Set Format #FormatName #ParmVals
    If $No Then Jump Error
    Let ParmNum = 1
    While #ParmCnt > #ParmNum
      If #ParmVals::I Then Begin
        /Set Format * #ParmVals::I
        If $No Then Jump Error
      Endb
      Let ParmNum = #ParmNum + 1
    EndWhile
  Endb
  If #WhereVal Then Let WhereVal = 'Where '#WhereVal
  /For Subfile #WhereVal
  If #AreaName Then Let Prefix = 'In ' #AreaName ' '#Options
  If #FrameName Then Let Prefix = #Prefix ' Using '#FrameName ' '
  /#Prefix Display 10
  If $No Then Jump Error
  EndFor
  Return

++Error
   * Processing Error - Reporting terminated.
   Let ProcessError = $True
  Return

++End.Report
  * Finished Processing Reports
  Return

11 Input Control

A SPIRES feature called Input Control lets you produce SPIRES goal records of a single subfile from multiple streams of input in the form of Relational Data base tables.

The process is designed to create SPIRES record level data elements or structural data element occurrences from multiple "flat files" -- that is, files in the form of relational tables -- made up of multiple "rows", each row consisting of one or more "columns" of information. Additionally, the various streams of "table" input must be ordered and structured in a precise way if reasonable hierarchical SPIRES record structures are to be realized.

In this respect Input Control is much more restrictive than Output Control which may be used to generate multiple streams of output of extremely variable forms. [See 7.]

Input control processing is established through a DECLARE command, DECLARE INPUT CONTROL. Like other declare processes, input control may be established in two ways:

 - in a protocol in which the input control declaration is defined; or

 - either from command mode or in a protocol, when the input control declaration is  a  record
 in a declare data subfile.

The first section of this chapter describes the input control declaration statements; the second describes how to use input control using the WITH INPUT CONTROL prefix. [See 11.1, 11.2.]

11.1 The Input Control Declaration

Input control is defined by a collection of statements known as "an input control declaration". The input control declaration consists of one or more "input control packets", each of which describes a piece of the input processing to be done. The input processing expectation is that the input stream represented by a particular packet will be tabular in nature and will read and converted to represent a portion or all of a particular structure or record level occurrence of a SPIRES goal record.

The heart of an input control declaration looks like this:

PACKET = 1st-packet-identifier;
  input-control statements
  ...
PACKET = 2nd-packet-identifier;
  input-control statements
  ...

Up to 36 packets may be defined in a single declaration. The input control statements are individually discussed below.

Each PACKET statement signals the start of another input control packet. The identifier value may be anything; there are no restrictions on it. The packets may be defined in any order but will possibly be executed in a different order based upon the structure of the destination goal record.

If you are storing the declaration in a declare input data subfile; you need to add an ID statement at the top of the declaration:

ID = gg.uuu.name;

where "gg.uuu" is your account number and "name" can be any alphanumeric name (it may include periods as well).

On the other hand, to define the input control declaration within a protocol, you need to surround it with the DECLARE INPUT CONTROL and ENDDECLARE commands:

DECLARE INPUT CONTROL
  PACKET = 1st-packet-identifier;
  input-control statements
  ...
ENDDECLARE

Input Control Statements

Below is a description of each of the possible input control statements in a packet, in the order in which they would be stored in a declare input data subfile. (The order in which you enter them is irrelevant.) Each of the statements is optional, though the occurrence or value of one statement might cause another one to be required.

First, here's a summary list of all the input control statements:

PACKET = packet-identifier;
 FORMAT = format-name;
 USING.FRAME = frame-name;
 PARMS = format-parameters;
 AREA = area-name;
 TRACE;
 CONTROL.OPTIONS = option1, ...;
 TABLE.NAME = table-name;

FORMAT = format-name;

This statement names the format that will be in control during the input for this packet. It will be set at the time the DECLARE statement is executed; startup frames will be executed as normal. However, unless the startup frame does something to call attention to itself (like allocating a global vgroup in shared mode; see CONTROL.OPTIONS below), the format, including the setting of it, is invisible outside of the declaration.

Currently the FORMAT statement is required to be present.

At present there has been but one format utilized for any input control process -- the SPIRES standard $INPUT.CONTROL format. If you choose to write your own format however, the options that you would specify on the SET FORMAT command after the format name may not be specified here; you must enter them in PARMS statements (see below).

USING.FRAME = frame-name;

You can use the USING.FRAME option to name a specific frame to execute within this input packet. The frame must also be defined with a USAGE of NAMED. See the Formats manual, section D.1.1.1 for more information; online, EXPLAIN USING FRAME COMMAND PREFIX.

PARMS = format-parameters;

-> set format myformat parms1
-> set format * parms2

then in input control, your packet would include:

FORMAT = MYFORMAT;
PARMS = parms1;
PARMS = parms2;

AREA = area-name;

Here you name the device services area to serve as the source for this packet's input. At present the only device area type that has been utilized is the subfile device (SBF) assigned for input. The input data is input directly from a subfile path which has been defined as if it were a Declared Table. You must define and assign the area(s) to be used prior to the input control declaration (see the example below).

TRACE;

CONTROL.OPTIONS = option1, ...;

You can request the one current option by coding the CONTROL.OPTIONS statement:

 - SHARE.VGROUPS -- This option should be specified if you want any vgroups allocated  by  the
 format  in  this  packet to be shared by other formats in other packets (they too must have
 this statement) or by the calling protocol.  Hidden vgroups may not be shared.

TABLE.NAME = table-name;

Input control can invoke tables that have been pre-declared with the DECLARE INPUT TABLE command. [See 17.3.] This tool lets you in effect re-map a SPIRES subfile of tabular data into a set of data elements in the destination SPIRES subfile. In this statement, you name the pre-declared input table you want to use.

11.2 Using Input Control

Once the input control definition has been declared, you request input control processing by adding the WITH INPUT CONTROL prefix to an input command, for example INPUT ADD, INPUT ADDMERGE etc.

WITH INPUT CONTROL INPUT ADDMERGE
  or
WITH INPUT CONTROL INPUT BATCH

But input control comprises much more than this single command. The setup for any usage of this process requires knowledge of a number of commands and principles.

Input Control processing is much more restrictive in many ways from Output Control processing. Those who are familiar with Output Control are aware of the wide range of possibilities that it gives, even in it's "hidden" forms -- such as that utilized by DATA MOVE.

Input Control must be by its very nature a highly controlled process. The final object of the process is to create a specifically defined SPIRES goal record from a set of specifically defined SPIRES tables. These tables are read and transformed by a specifically written SPIRES format which expects the incoming data to be sorted based upon the relationship of structures within the destination goal record definition.

Since data setup and processing is so restrictive we have built a set of SPIRES commands which should go a long way to helping you achieve your final goal -- that of building final form SPIRES records from multiple streams of related RDBMS tables.

The best way to present these commands is in the form of a tutorial showing the building of simple SPIRES database records from multiple tabular data.

We will use a clone of the ubiquitous PATIENTS subfile as the destination subfile. This is a simple file with one multiply occurring structure. All of the data elements except ADDRESS are singly occurring as shown below. ADDRESS however has the ELEMINFO element INPUT-OCC = 2 which will be used when generating Table structures.

> select patient.records
> show elem characteristics
Subfile PATIENT.RECORDS
 Sec  Occ   Len  Type    St/El       Element
 ---  ----  ---  ------  -----       -------
 Fix  Sing    4  Int     00/00  slot PATIENT, POINTER
 Req  Sing       String  00/01       NAME
 Opt  Mult       String  00/02       ADDRESS
 Opt  Sing       String  00/03       PHONE
 Opt  Sing       String  00/04       INSURANCE
 Opt  Mult       Struc   00/05       VISIT
 Fix  Sing    4  Hex     01/00       . DATE
 Fix  Sing    3  String  01/01       . TCODE
 Fix  Sing    7  Pack    01/02       . COST

We have set up a database to hold Table definitions that will be used by Input Control:

> select filedef
> display *patient.tables
 FILE = gg.uuu.PATIENT.TABLES;
 DEFDATE = WED. SEPT. 12, 2001;
 MODDATE = WED. SEPT. 12, 2001;
 MODTIME = 14:23:56;
 BIN = PURGE;
 RECORD-NAME = REC01;
   DEFINED-BY = $TABLE;
   REMOVED;
 RECORD-NAME = REC02;
   COMBINE = REC01;
   DEFINED-BY = $INPUT.TABLE;
   REMOVED;
 SUBFILE-NAME = PATIENT_TABLES;
   GOAL-RECORD = REC01;
     ACCOUNTS = gg.uuu;
 SUBFILE-NAME = PATIENT_INTABLES;
   GOAL-RECORD = REC02;
     ACCOUNTS = gg.uuu;

Now we can generate tables that match the structures for this database. Several PERFORM commands are used in the following presentation. [EXPLAIN PERFORM TABLE CREATE INPUT COMMANDS.] for more information.

> perform table create declare subfile patient.records,
       type spires, options mult, destination patient_tables

To see the table record statements resulting from this command [See 11.2.1.]

From the Declared Tables generated for output we can generate a set of Input Tables as follows:

> perform table create input declare subfile patient.records
       from patient_tables dest patient_intables

To see the input table records resulting from this command [See 11.2.2.]

If you compare the Input Table records to the corresponding records for output tables you can see that they are very similar in appearance. The primary difference is that the SOURCE... statements in output tables have become DEST... statements in input tables.

Input Control will use these input table constructs in its work via a system format to extract source column data from the table subfiles to store Patient.records subfile data elements (DEST.ELEMs) based upon any Patient.records structure (DEST.STRUCTURE) information.

At this time it is necessary to say that it will probably be necessary to modify either the output table declaration or the input table declaration or both, depending upon the characteristics of the destination goal records.

The next task is to construct table subfiles which may be used to describe (and contain) the source data for input. The following command should go a long way to accomplishing this task since it builds a record structure based upon the input table declaration.

> perform table create input recdef subfile patient.records
          from patient_intables

To see the table record definitions resulting from this command [See 11.2.3.]

There is another PERFORM TABLE CREATE INPUT command which will generate a protocol that shows the steps needed to actually perform the input. The commands generated must be modified for your particular situation and of course data must be moved into the table subfiles that serve to provide source data.

> perform table create input control subfile patient.records
          from patient_intables

To see the protocol statements resulting from this command [See 11.2.4.]

If you look carefully at the protocol in Sample 4, you may see some commands that are unfamiliar to you. You are probably unfamilar with this command construct:

/Assign Area Table1 to subfile path $PathNum input

The "input" option is now available for the SBF device processor. This option was developed for this specific use during input control. Also, note that DECLARE INPUT CONTROL utilizes a specific system format $INPUT.CONTROL along with the "share.vgroup" control option. [See 11.1.]

There are also some steps omitted from the sample, primarily where did the subfiles "SUBF.PATIENTS" or "SUBF.VISIT" come from?

These subfiles represent the SPIRES view of the tables that are to be read by the WITH INPUT CONTROL ADDMERGE command. It is up to you to define and populate these subfiles (and rename them if you wish) with data from whatever source you wish. The SUBF table subfiles may be defined by the RECDEFs shown in Sample 3 above. If the tables are RDBMS tables -- say from ORACLE or SYBASE relational systems then the population can be done through the VIA EXTERNAL FIND command.

We have chosen to simplify this example by populating our tables from the existing PATIENTS data base and we will set up our environment to show the actual activity of input control.

Our record level subfile is constructed as follows:

> select subf.patients
> show elem characteristics
Subfile SUBF.PATIENTS
 Sec  Occ   Len  Type    St/El       Element
 ---  ----  ---  ------  -----       -------
 Req  Sing       Int     00/00  key  PATIENT
 Opt  Sing       String  00/01       NAME
 Opt  Sing       String  00/02       ADDRESS_1
 Opt  Sing       String  00/03       ADDRESS_2
 Opt  Sing       String  00/04       PHONE
 Opt  Sing       String  00/05       INSURANCE
> for subfile
> display 2

 PATIENT = 1;
 NAME = Heartburn, Sarah Jean;
 ADDRESS_1 = 1414 Ocean Way;
 ADDRESS_2 = Homebody, IO  84501;
 PHONE = (415) 921-761;
 INSURANCE;

 PATIENT = 2;
 NAME = Sickly, I. M.;
 ADDRESS_1 = 15 Hospital Drive;
 ADDRESS_2 = Homebody, IO 84501;
 PHONE = (814) 555-5135;
 INSURANCE = Quib;
>

And our Visit structure subfile looks like this:

> select subf.visit
> show elem char
Subfile SUBF.VISIT
 Sec  Occ   Len  Type    St/El       Element
 ---  ----  ---  ------  -----       -------
 Fix  Sing    8  Struc   00/00  key  KEYSTR
 Fix  Sing    4  Int     01/00  key  . PATIENT
 Fix  Sing    4  Int     01/01       . VISIT.OCC
 Opt  Sing       String  00/01       DATE
 Opt  Sing       String  00/02       TCODE
 Opt  Sing       String  00/03       COST
> for subfile
> dis 3

 KEYSTR = 1, 1;
 DATE = 05/13/1983;
 TCODE = X12;
 COST = $69.69;

 KEYSTR = 1, 2;
 DATE = 05/23/1983;
 TCODE = X12;
 COST = $69.69;

 KEYSTR = 1, 3;
 DATE = 04/12/1984;
 TCODE = X01;
 COST = $69.69;
>

Note that input control will read data from each table subfile in the order that the table records appear sequentially in the subfile. The order is of paramount importance. This is why the RECDEF's have fixed length structured keys. SPIRES utilizes this method to arrange the order of input. The number of data elements forming the structured key increases as the depth of the final destination goal record structure increases.

You can look at the actual execution of this protocol by issuing the command: [See 11.2.5.]