|
|
||
![]() |
||
|
The ANALYZE command is used to extract terms, excluding stopwords, from an L-number and to provide statistical analysis of the terms. |
|
This diagram illustrates extraction of author names from an answer set using ANALYZE. |

|
To use the ANALYZE command, follow this sample format: | |||||
|
|
|
|
|
|
||
|
|
|
|
|
|||
|
|
|
Use the guidelines below to enter command line information: |
|
|
|
|
|
|
Type: ANALYZE | |
|
|
Type:
Default: The last L-number created |
|
|
|
What you type in Field 3 depends on what you type in Field 2. | |
|
|
|
|
| An answer set L-number |
Type up to 5 extraction field codes. Default: The default extraction field of the file |
|
| An L-number created with ANALYZE | Do not type anything | |
|
|
What you type in Field 4 depends on what you type in Field 2. | |
|
|
|
|
| An answer set L-number |
Type: Example: A single answer number 3 Multiple answer numbers 1 5 7 A range of answer numbers 1-10 A combination of these 1-10 17 All the answers 1- Default: All answers |
|
| An L-number created with ANALYZE |
Type: Example: A single term number 1 Multiple term numbers 2,4 A range of term numbers 1-10 A combination of these 1-10, 15 All the answers 1- A range criteria OGT 100 Range Criteria Definitions: TOP n = the first n terms based on current ordering OGT n = the terms with occurrence counts greater than n DGT n = the terms with document counts greater than n PGT n = the terms with percent of document counts greater than n Default: All terms |
|
| Note: An L-number can only be ANALYZEd in the same file in which it was created. |
|
You may use ANALYZE to extract terms only if they meet certain criteria. |
|
|
|
|
| Terms that contain a 1-20 character string | WITH followed by the character string in quotes | => ANALYZE L5 AU 1-10 WITH "BROWN" |
| Terms that do not contain a 1-20 character string | NOT followed by the character string in quotes | => ANALYZE L9 TI 1- NOT "METHANE" |
| The first n characters of each term | LEN n | => ANALYZE L6 1- LEN 10 |
|
Rules:
|
||
|
Results obtained with the ANALYZE command are placed in a new L-number. Use the DISPLAY command with the ANALYZE L-number to view:
Specialized displays of ANALYZE results are available. The ANALYZE L-number may be used:
|
|
The following example shows use of ANALYZE to extract and perform statistical analysis on authors from answer set L1. |
=> FILE GEOREF
=> S VENUS AND (VOLCAN? OR OUTGAS? OR DEGAS?)
L1 917 VENUS AND (VOLCAN? OR OUTGAS? OR DEGAS?)
=> ANALYZE L1 AU 1-
L2 ANALYZE L1 1- AU : 815 TERMS
=> D L2
L2 ANALYZE L1 1- AU : 815 TERMS
TERM # # OCC # DOC % DOC AU
------ ------- ------ ------ ---------------
1 91 91 9.92 HEAD, JAMES W.
2 57 57 6.22 HEAD, J. W.
3 34 34 3.71 SOLOMON, SEAN C.
4 29 29 3.16 CRUMPLER, L. S.
5 25 25 2.73 BASILEVSKY, A. T.
6 24 24 2.62 HEAD, JAMES W., III
7 24 24 2.62 SAUNDERS, R. STEPHEN
8 23 23 2.51 STOFAN, ELLEN R.
9 21 21 2.29 GUEST, J. E.
10 21 21 2.29 WILSON, LIONEL
|
|
50,000 terms may be extracted with ANALYZE. There is a limit of 1000 terms per answer. |