|
TERUSB III: A system description and some observations concerning the functions of terminological data banks (TDB'S)
Bolores Raventós de Castro
Ivan Centeno Román
Universidad Simón Bolívar
Venezuela
Resumo
The TERUSB III is a fourth generation Terminological Information Sytem (TIS), developed specificaily for a microcomputer environment and Third World context. It is an upgrade from our TERUSB II system. It has two access levels: one complete, whereby the Terminologist (or System's Manager) has full control of the system and can make any necessary changes; another, a limited access, whereby the user can access data but cannot change it. It is aimed at information professionals for the processing, analysis, consulíation and validation of multilinguai terminological data.
The basic philosophy of the system is its user-friendliness and requires a minimum of training. The user navigates through sounds, about important messages and can select the comands at high speed. Starting with the first session, the user determines the source and eventual target language(s). If the intended source language is not yet present in the system, it can be incorporated ínto it so that the screen comands can also appear in the said language after running an initial program installation program. The system will be described through some of its screens. Some observations will be made of the different types and functions of TDB's. Convergence and divergence of functions and applications. Sizes of setups and hardware-software requirements.
I. Background
A. The TERUSB emerges as a progression of the BTUSB (Terminological Data Bank of the Universidad Simón Bolívar), to satisfy the need to create an environment at a microcomputer level for information professionals [1], so that they may process, analyze and retrieve mulíilingual terminological data and validate the same [2].
II. Chronology of Systems and Experiences
A. We could mention seven phases-software experiences which disembarked in the development of the TERUSBIH:
1. TERUSBI (1976): program written in COBOL for a DEC10 environment; trilingual but in upper case; diacritics had to be incorporated by hand.
2. TERUSB 1.1 (1978-81): bilingual, semi-automatic; fírst ventures into a new more formal record structure following the recomendations of DUBUC [3] and Rondeau [4].
3. TERUSB 1.2 (1984): adaptation of the TERUSB I to a new subject field; stül confronting the limitation of upper cases and diacritics.
4. SITUSBI (1986): fírst venture into a microcomputer environment where we tested trilingual possibilities, using dBaseIII without programming but with the Applications Generator.
5. SITUSBII (1987-88): multilinguai processing in dBaseIII+, with programming, as a basis for the TERUSBII.
6. TERUSBII (1988): multilinguai in dBaseIII + and Foxbase +, with programming, over 90 screens - programs, to navigate through all the activities [5].
7. TERUSBIE (1989-90): an upgrade of the TERUSBII, multilinguai in dBaselV, which is considered to be essentially our final objective-system.
III. Terminological Information Systems (TIS's): four leves of implementation
A. Definition We defined a TIS as a "system which consiste of a series of elements or subsystems, whose functions are destined to converge or disseminate terminological information processes related to general or specific aspects of a given subject field". [5]
These systems can be developed adequately at four implementation levels, depending ou the number of users, subject fields covered and the processing equipment available.
1. First level: l user, l subject field, l workstation. Minimum hardware (workstation) configuration: l MB RAM, 1.2 MB disk drive, 40 MB Hard Disk. (Note: up to TERUSBII, we were contemplating the minimum hardware configuration at an IBM-XT level: 640 K, 20 MB Hard Disk level, but given the characteristics of the new DBMS - software packages, we had to increase-upgrade this configuration to an AT level, whether it be a compatible, MCA or EISA configuration). At this level, data can be processed adequately without programming, by using the "Application Generator" that comes with the software package, and the resulting data can be integrated manually with some word processors.
2. Second level: l or more users, severa! subject fields, one workstation. Minimum hardware configuration: the same as level one, but contemplating an average of 10-20 MgB per subject field. This level requires a certain amount of programming.
3. Third level: several users, severa! subject fields, several workstations. Minimum configuration: 4-6 MB RAM, 286 or 386 processor, multiuser or Local Área Network (LAN), and at present we are observing the possibility of interaction with compact disks beyond the read-only capacity.
4. Fourth level: the same as the third but at a mainframe level. Given the present trend of 386 and 486 super-micros, it becomes feasible to develop at a super-micro level what was required previously to do at a minicomputer or mainframe level.
B. Access and Control aspects The access and control by the user decrease proportionately as the systems grow. The access protocols are imposed by the managers of the systems (starting with the second level), and in the cases of minicomputers, these have to submit to and comply with higher controls.
C. Third World context For Third World applications, it is believed that the third level would be the most convenient, as it would offer the best of both extremes.
IV. The TERUSBIII as a TIS of the BTUSB (Conceptual Description).
A. Institutional Context Within our institutional context, the TERUSBn turns out to be the TIS that activates the function of the BTUSB.
The said system consists of eight subsystems through the incorporation of two additional systems to the TERUSBII [6].
1. Subsystem 1: SITER (Sistema de Información Terminológica = Terminological Information System) Constitutes the nucleus of the TERUSBIQ as it is the subsystem that processes the terms, while concentrating and codifying the information that is amplified in the other supporting subsystems.
2. Subsystem 2: SIAT (Sistema de Información de Áreas Temáticas = Information System for Subject Fields). This system acts as a processing module for the subject field information which shall serve as a basis for the final classification of the subject fields to be developed within the same. It reflects four nesting levels. The reason for our selecting such an option is that when we analyzed several classification systems for adaptation and incorporation into the TERUSBIII, such as the Lenoch, Tennium, UNESCO, etc., none of them were able to fully satisfy certain of our needs. We thus opted for setting on the way this open four level system. This approach is justified as a converging mechanism for the different prevailing classification systems, paving the way toward a more complete clas-sification, serving as a catalyst for interdisciplinary aspects in the search for and mating of decisions related to classifications. In this stage we shall be able to contemplate the dynamics of certain new sciences, such as Bioelectronics or Geophysics, and set down raeir terminological evolution.
3. Subsystem 3: SIDOC (Sistema de Información Documental = Documentary Information System). This system covers three types of reference sources: documente, bibliographies and standards. In a context-infrastnicture of advanced automation, it could be merged with a Micro-Marc or Micro-Isis, but bearing in mind that in some Third World situations one would not necessarily have access to any such options, tbis subsystem could be used to solve the most essential and relevant process.
4. Subsystem 4: SIUGE (Sistema de loformación de Uso Geográfico = Information System for Geographic Use). This system would contain and process information pertaining to linguistic variations within each language family.
5. Subsystem 5: SIET (Sistema de Información de Especialistas Temáticos-Terminológicos = Information System for Subject Specialists and Terminologists). This system stores and processes information to give access to the specialists when needed for consultation and publication purposes.
6. Subsystem 6: SIBEM (Sistema de Información de Bancos Emisores = Information System on Emitting Data Banks). This system accesses the code and amplifies the data of the EDB, such as the necessary information for data exchange.
7. Subsystem 7: SILEN (Sistema de Información de Lenguas = Information System on Languages). This system stores and processes the information of the different languages present in the system and facilitates the incorporation of new languages so that the screen commands could appear in the said languages when needed.
8. Subsystem 8: SIEST (Sistema de Información Estadística = Statistical Information System). Tbis system generates control statistics about the system, such as: the amount of terms, frequency of consultation and use.
V. TERUSBIII, version 1.0: Description through some screens.
A. Generalities The basic philosophy of the TERUSB is its userfriendliness and ease of use so as to reduce to a irrinimum any necessary training. The user navigates through the system with the cursor, being warned through sounds of important steps or messages. He is always inf ormed on the screen about the process, action or menu where ne is located. As of the first session, he determines his source language and can handle several target languages. Should his language not be present in the system, it can be incorporated into it so that the screen commands can appear in the said chosen language.
(NOTE: as the system has over 90 screens, depending on the combinations of options requested, we shall only dwell on a couple to give an idea of the gist of the system)
The Main Menu comprises three types of major actions:
1. Options: these are the main usage alternatives, such as the choice of source and target languages, the selection of subject field, the transfer of data to other programs and the determination of usage level (system manager or general user). Should he choose the user level, the person will have the capacity to consult the data, but will not be able to modify them. As the system manager, the terminologist, after password clearance, has full access to any part of the system and complete control of the same.
2. Maintenance: In tbis section, we have the backup, restore and file reconstruction options.
3. Exit: This Section contains the corresponding option to leave the TERUSB System either temporarily into the OS or definitely.
VI. Some aspects contemplated for our next version.
Until now, given our hardware limitations, we have always catered toward the individual microcomputer. That is not to exdude the possibility of amplifying the system for a LAN application.
Another aspect that will be considered in the next couple of years is the interaction with Compact Disks, beyond the ROM capacity.
A third aspect would be the graphic presentation of tenns in tree structures.
VII. Applications of this System.
A. Even though at this presentation we are contemplating the system from a ter-minological point of view, we should not forget that given our institutional characteristics our system and applications transcend towards other activities involving different kinds of information professionals. That is why this system is intended:
1. to facilitate the work of information professionals;
2. to simplify the classification processes of subject fields;
3. to reinforce the devellopment of documentary and controlled languages;
4. to eventually facilitate the development of multilingual spell-checkers for word processors;
5. to facilitate the determination of key descriptors in the search of on-line data bases;
6. to consolidate the basis of an institutional thesaurus; and
7. to create an information service environment.
VIII. A wish list for "an ideal system" and how do we rate.
Vasconcellos and Melby [6] established that an ideal system should: assure compatibility; offer a flexible record layout; easily handle diacritics; facilitate on-line search; offer Utilities for file handling; incorporate options for data exchange through predeter-mined formats; have a capacity to function as a network; have a sufficiently rapid answer mode; be able to interact with word processors; offer simple menus for beginners in the field; have the backing of terminologists; and finally should be of a reasonable price.
The BTUSB and TERUSB are at present at a 65% level of compliance with the above and we are working toward 100%.
IX. Institutional Projections.
A. A system such as this one could have national and interaational projections. We shall mention three national and two international projections.
1. Universidad Simón Bolívar (USB) The BTUSB and the TERUSB are going to actively back the activities of the new academic library in its automation and incorporation of new information transfer techniques. It is also serving as a Laboratory for the Graduate Program of Information Studies.
2. Universidad de Los Andes (ULA) A TDB and system such as ours could serve as a basis of agreement with other national institutions such as the ULA, where a TDB is being developed. We could share terminologies common to both institutions and use the same processing system.
3. USB + CONICIT + COVENIN On a national level, a TDB and system such as this one could serve as an element of convergence for the automation of terminologies of interest to universities, to the Ministry for Science and Technology (CONICIT) and the National Standards Institute (COVENIN).
4. Canada and Sweden Given the fact that the essence of our record is based on the Termium structure and that we have some twenty subject fields in common with the Canadian Government Tenninology Bank, it should not be difficult in some near-mid-term basis to interact with their CD format. We would have to wait for the CD juke-box contemplated for our library and study the possibility of completing their data in Spanish in our common fields. In the same manner, the vocabularies we processed-completed for the TNC-Sweden glossaries, which for hardware limitations remained on cards, could be incorporated to the CD of the TNC. At present, both are at a CD-ROM basis. We would work toward a possibility of wriing in those CDs.
X. Global perception of other systems.
One of the aspects of our reaching a certain maturity was when we decided to stop keeping up with any of the big systems such as those of the Canadian Government, Quebec, Eurodicautom, etc. We reached a point where it was either submitting and sacrifiting our needs and record structure to duplicate theirs, or catering our needs and establishing a record structure to satisfy our requirements. We opted for the latter. Nevertheless, we must state that given the present trends of conversions between different systems and programs, we envisage that the data exchange should not be as uphill as it was in the past. Ever since the minimum terminological data elements were agreed upon in 1981, we intend to at least exchange at that level, all other data being considered strictly for our local application and perhaps of little use to other setups. Data exchange for the time being and near future will have to be limited to bilateral agreements.
XI. Conclusions and future trends for the TERUSB.
A. The main conclusion that we can derive is that we have come a long way in the past ten years and have reached quite valid solutions for our environment and needs. We have made the most of our small size without hindering or closing our comniunication options with bigger systems.
B. Short-, mid- and long-term perspectives. On a short-term level, we shall be giving the final touches to the system and manuais and registering it for eventual marketing.
On a mid-term level, we shall be setting it up on a more powerful micro, with the incorporation of additional resident subject fields.
On a long-term level, we should be able to interact with and write on Compact Disks, which with our magnitude, would overcome all problems of size and limitations.

[1] Raventós de Castro (1988)

[2] Raventôs de Castro (1989)

[3] Dubuc (1978)

[4] Rondeau (1981)

[5] Op. cit. see 1.

[6] Vasconcellos (1988)

Bibliographies
DUBUC, Robert (1978): Manuel Pratique de Terminologie, Montréal, Linguatech.
RAVENTÓS DE CASTRO, Dolores (1988): "Sistemas de Información Terminológica (SITs): Cuatro Niveles de Solución y Aplicación", Caracas, I Simposio Latinoamericano de Terminología.
RAVENTÓS DE CASTRO, Dolores (1989): Desarrollo e implantación de una Opción de Bancos de Datos Terminológicos Multilingüe. A Nível de Microcomputadora para Aplicaciones en el Tercer Mundo, Caracas, Universidad Simón Bolívar.
RONDEAU, Guy (1981): Introduction à la Terminologie, Québec, Centre Educatif et Culturel, Inc.
VASCONCELLOS, Muriel, Editor, (1988): Technology as Translation Strategy, New York, American Translators Association - Scholarly Monograph Series, vol II, State University of New York.
|