PCTFULL Reload Offers More Content and New Numeric Property Search Feature
The PCTFULL database on STN is completely reloaded as of February 26, 2011, and will contain much more content, like names, addresses and full-text in various languages (e.g. English translations of German, French, Spanish or Russian full-texts), plus a new search feature for numeric property search in full-texts.
This reload has been implemented in order to offer improved identification of the claims and detailed description chapters in the full-text documents based on a better OCR quality.
Database content and new update procedure
The general document coverage (WIPO/PCT published application WOA1 and WOA2 documents) as well as the document key will stay unaltered in comparison to the replaced old PCTFULL database. This allows saved answer sets and SDI histories to be retained from the previous database (note that saved answer sets may nonetheless contain the documents in a new look and flavor). A new set-up by the database producer now provides regular updates for the PCTFULL records (i.e. WO documents). The Entry Date (/ED) marks the first entry of a document, an additional Entry Date for the full-text (/EDTX) will designate the first entry of a documents full-text (i.e. detailed description and/or claims). Each update on a document will replace the complete STN record and will trigger an Update Date (/UP). Extra fields - Data Entry Date (/DED) and Data Update Date (/DUPD) - indicate first entry and most current update dates of the WO documents within the database producers proprietary data repository. On the release date the complete reloaded database will comprise entry dates from 20101130 up to 20110224. Please see HELP UPDATE for available update codes for manual and automatic current awareness searches.
The reloaded database comprises about 1.8 million documents and around 1.3 million images (predominantly drawings from the front page).
New numeric property search feature
A new numeric search capability has been installed which allows numeric property search within the English text of all documents. Searchable properties and their respective base units comprise e.g. /SAR (Surface Area, m), /CMOL (Molar Concentration, mol/L), /DEN (Mass Density, kg/m), /MFL (Mass Flow, Kg/s), /MW (Molecular Weight, g/mol), /VOL (Volume, m) and many more. The /PHP index contains the complete list of codes and related text for all available properties. Please note that EXPAND is not available for the property fields of the numeric search in full-texts.
A search with the respective field codes will be carried out in all fields with English text (English title, abstract, description and claims). A search for a physical property value will not only find hits in the text showing the exact value but will also include hits where the value is part of a given range (e.g. => s 10/cmol will find "at least 0.008 mol/l" or => s 45/deg will find "0 to 80 degrees"). In addition, specific related units will be converted to SI base units in order to encompass a wider range of possible values (e.g. a search in Kelvin will also generate hits with the respective value (ranges) in Degrees Celsius or Degrees Fahrenheit, a search in Square Meter will also find hit values (ranges) if these are given in Square Inches or Square Feet).
As for general numeric search on STN, you may specify a specific tolerance for the numeric search with SET TOLERANCE. Search examples for the different properties will be included in the PCTFULL Summary Sheet. An online help HELP NUMERIC PROPERTY SEARCH or HELP NPS gives a complete list of properties plus a detailed description of the new numeric search capability in the PCTFULL database.
Improved application and priority number formats
For the PCTFULL reload database a new standardization procedure for application and priority numbers has been established. This new procedure guarantees a more reliable standardization of such numbers to STN standard with the effect of a greater number of standardized formats in /AP, /PRN and /APPS fields plus a better FSORT and cross-over behavior of PCTFULL application and priority numbers.
New content and fields
The reloaded PCTFULL database will introduce new fields as well as some additional content in established fields in order to mirror the extended availability of information in different languages:
- Fields for "Other Languages" than the four main languages English, French, Spanish and German have been introduced for the text fields and end with OL (e.g. TIOL, ABOL, DETDOL, CLMOL, and MCLMOL). Such Other Languages may comprise, for instance, Portuguese, Finnish, Swedish, Italian etc. The Other Language fields are individual search and display fields. In addition to these individual language related fields (e.g. /TIEN, /TIFR, /TIDE, /TIES, /TIOL), all languages which may be represented in Roman characters are indexed in the general STN fields /BI (Basic Index), /TI, /AB, /CLM, and /MCLM.
- Text containing national special characters (accents, umlauts, Asian characters etc.) will be available for display in Original Language individual display fields ending with OR (e.g. INOR, PAOR, AGOR, TIOR, ABOR, DETDOR, CLMOR, MCLMOR) and will also be displayable in the new pre-defined display formats ALLOR and MAXOR.
- The Original Language fields may be available for Roman character based languages like German, French, Spanish, and Portuguese etc. as well as for non-Roman character based languages like Asian or Cyrillic languages. For all Roman character based languages the general fields (e.g. IN, PA, TI, AB etc.) contain the content according to STN standard without the original accents and umlauts (e.g. as ue, as ae, as e, as a, etc.) whereas the respective Original Language fields (INOR, TIOR, ABOR etc.) display the same content including national special characters when available. For all non-Roman character based languages (e.g. Chinese, Japanese, Korean, Russian) the general fields will normally contain an English transliteration of names or text whereas the Original Language fields show the original national character based names or text. Thus, original non-Roman character based content is only available for display in the respective Original Language fields and is neither searchable nor selectable.
- The availability of Original Language fields may be checked with the field availability index (/FA) and will be designated in the FA display field (included in the free-of-charge pre-defined TRIAL format) for each document. An additional annotation designates the language of the Original Language field in /FA (e.g. TIOR.ES, ABOR.RU etc.).
Please note that the assignment to the different STN language fields is based on the designation of the respective language within the input data and according to the database producer. Incorrect definition of such languages in the input data may thus lead to an incorrect assignment to the respective language specific fields on STN. For a comprehensive search using the general fields /BI, /TI, /AB, /CLM etc. is strongly recommended.
Address information now searchable: Recognizing frequent customer requests we now provide searchable inventor, patent assignee and legal representative addresses in fields /INA, /PAA and /AGA (with implied (s) proximity). In addition, the complete information pertaining to one person may be searched in the fields /IN.T (Inventor, Total), /PA.T (Patent Assignee, Total) and /AG.T (Agent, Total).
Main claim added to BRIEF formats: PCTFULL pre-defined display formats BRIEF, IBRIEF, BRIEFG and IBRIEFG now contain the main claim (MCLM) when available. Thereby these PCTFULL display formats are now in sync with the formats in other full-text databases like EPFULL, FRFULL and GBFULL. Please see adapted cost with => HELP COST. Pricing of other individual fields or pre-defined display formats is not affected.