|
|||||
| MAKING PLANS FOR NIGEL: Defining interfaces between computational representations of linguistic structure and output systems: Adding intonation, punctuation and typography systems to the PENMAN system. | |||||
6.0 OutlineThe structure of the conclusion is as follows. Firstly, the main points of the discussion so far are summarised (6.1). There are two main streams to this summary: those that concern the state of development of the PENMAN system; and those that concern the theoretical position taken in the thesis. Following this the discussion is split into two parts: section 6.2 is about the NIGEL system; presenting an agenda for further development. The last part (6.3) concerns the theoretical aspects of the work. 6.1 SummaryThe introduction is mainly concerned with identifying the “problems” with NIGEL that become apparent when intonation and a general graphology are considered. These problems lie mainly in the system architecture. (i) NIGEL’s output system - the punctuation system - is not sufficiently modular with respect to the resource - the grammar. (ii) Nor is the architecture of NIGEL sufficiently general to expand to implement systems for intonation.
Chapter two has two main parts. The first part develops an interface code for intonation specification based on Halliday’s (1967, 1969, 1985.a,b) phonological system - and summarises the theoretical background of the speech synthesiser. The second part is concerned with how the systems for generating this code might be implemented in NIGEL: new parts of the lexico-grammar (Key and information systems), the semantic level (new choosers and inquiries), and the environment (discourse and interactant models) - all need to be developed. Similarly, the choice of the interface code must be compatible with the speech synthesis module - the speech synthesiser was developed with Halliday’s system as its input. Thus the specification of the interface code is the first step in a developmental path that extends both upwards and downwards. Looking upwards, it is possible to install some version of the Key network in NIGEL quite simply, by adopting Halliday’s network which includes Key and Mood (cf Halliday 1967). However, it is less easy to generate the Information Unit independently of the clause because the lexico-grammatical rank-scale is ‘hard-wired’ into the system; it was never designed to have two such rank-scales. Higher up the stratal model further development of discourse and interactant models will be essential to the information structuring and Key systems respectively. Looking downwards: the speech synthesiser is already developing along the right lines. It is pointed out in chapter two that the phonological system - the system of Tones - is essentially redundant. If the Key system is distributed through the Mood network then the realisation of Key in Tone is simply automatically passed on to the output program. This redundancy is very helpful; it is not necessary actually to add such an extra stratum to NIGEL. However there is an alternative model in which the phonology would have take over the role of Key and and IU systems. When the text-to-speech method of speech synthesis is replaced with a more ‘natural’ phonological model then such an autonomous phonology may be introduced. Chapter three takes a view different from that of chapter two. There was no model of graphology to simply adopt: so a graphological system is sketched. Instead of focussing on lexico-grammatical systems and higher level systems the focus is on the graphological system - analogous to the system of Tone, rather than Key. As well, the three ways that strata interact to make meaning are discussed.
Chapters two and three are complementary. Chapter two is rather tied to the NIGEL system and extensions that will be made to it; while chapter three is more ambitious in its scope - aiming to ‘push back’ the boundary between paralinguistic and linguistic systems. Intonation is not inherently discrete; implying that a grammatical treatment of intonation represents a compromise (cf Halliday 1967 section 1) between the discreteness demanded by formal linguistic models - like system networks - and actual intonation. In contrast to intonation, computational typographic systems are discrete. Thus, given a discourse model with the relevant systems, NIGEL could be made to output graphological paragraphs in the interface code specified here. Trying, however, to work out what the phonological version of the graphological paragraph is much harder. The prosodic paragraph discussed by Silverman (1987 ch.6) is not well defined or obviously delineated - intonation is much more complex than the typographic use of space. The complementarity of chapters two and three lies in this difference in discreteness. While the phonological model is better developed for short-range supra-segmental phenomena - Tone contours - the graphological model makes description of larger units possible. Chapters four and five simply describe the output systems. They both represent exercises in programming and engineering - converting the interface codes into sound or graphics. By virtue of the three principles for a clean one-way interface the output programs can be developed independently of the PENMAN system. Chapter four, in discussing the GEKKO program, also defines the graphological interface code. Since it is designed to be as general as possible the numbered bracketting system is used. Typically, the bracketting indices are not needed to describe typographic texts. However, it is not known to what extent they would be necessary in describing spoken texts. The indexed brackets make it unnecessary to have a multi-tiered model of graphology or phonology - as many structural devices as necessary can be described. This generality is desirable - not undesirable - because it means that computational designs using this set of formalisms will not require rebuilding in order to implement new things. In any case, having a multi-tiered model requires some prediction of how many tiers there are; such work for graphology has not yet been done. The tactics of English graphology (including punctuation and macro-punctuation) have not been described sufficiently formally to decide just how many tiers would be needed (cf Waller 1980). Chapter five gives an overview of the way that the speech synthesiser works, in producing melodic speech. The question of long text pieces has not yet been attacked by King and Vonwiller, work has so far concentrated on the same domain as the grammatical system of Key described in chapter two: single tonics (and pre- and post-tonics.). 6.2 Agenda for further developmentThe following section identifies the short-term direction that the work described here will take. The immediate agenda essentially concentrates on clause-based grammar: the first effort will be towards making the existing NIGEL grammar output melodic speech, using Vonwiller and King’s speech synthesis package; and towards ‘rehabilitating’ NIGEL’s punctuation system. i. Above the interface
ii. Below the interface
In the longer term, those aspects of phonology and graphology which are not clause-based will be pursued. This will include re-design of some parts of the system to cope with IUs that are not necessarily congruent with the clause structure generated by the lexico-grammar. As well, the meaningful specification of typography as macro-punctuation will be attempted. 6.3 Theoretical conclusionsThis thesis has used a very general descriptive paradigm, consistent with the systemic theory that informed the design of NIGEL. For example, it would have been a simple matter to start implementing systems for Key in NIGEL without worrying about the broader picture. However, by ‘stepping back’ and considering graphology as well as phonology, looking at the overall organisation of NIGEL, an agenda presents itself in which work on implementing Key systems is only one part. The discreteness of typography means that it is possible to see clearly how the generation process in general might be modelled - the section in chapter three about ‘ways of making meaning’ is an example. It is much more difficult to think about phonological spans of text greater than one cause, because it is not immediately obvious just what such units there are. For some linguistic endeavours ‘constraint’ in representation is valued over generality but the opposite applies in this work. One aspect of a very general descriptive paradigm is the way that it often generates more than one way of describing something . For many formal theorists this is anathema. However, for a practical project like the PENMAN system it is very valuable indeed to be able to make alternative analyses and to choose the one that works best - instead of being forced by theoretical pre-conceptions into an inappropriate model. Further, different analyses can be used for different purposes. An example of this is the graphological network presented in chapter three - it shows the tactical organisation of graphology. One alternative network would show meta-functional diversity. Another alternative would be a distributed system, like the ‘Key in Mood’ system (figure 2.6). In the chapter on graphology (3) the discussion was based on literature and advertising; it is unlikely that the NIGEL grammar will be called upon to generate such texts. However, there is much to be learnt by looking at language that pushes at the boundaries of sytematicity - like the Bukoswki story with its odd punctuation, and the “Jungle Groove” text with its staccato punctuation of four sentences for one clause-complex - and at texts which define their own systems - like this thesis itself, referring to computer programs in all-caps. By taking such a view the danger of building limitations into systemic theory and into NIGEL is reduced. Building limitations into a system is a way of creating problems which then need to be solved. An example of this is the inadequacies with NIGEL’s architecture cited in the introduction: the punctuation system. Of course, the designers knew the limitations of the system and it was never meant to be extended to produce intonation but given the opportunity and the resources it is better to build extensible systems in the first place.
|
|||||
|
|||||