Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
SYSTEM
Gunter Neumann
Deutsches Forschungszentrum fur Kunstliche Intelligenz
Stuhlsatzenhausweg 3
D-6600 Saarbrucken 11, Germany
neumann@dfki.uni-sb.de
To appear in: Proceedings of the TWLT 5, Twente, Netherlands, 1993.
abstract
with respect to new applications. The development of the cosma system will serve
In this paper we introduce the basic design as an example of this claim.
principles of the disco system, a Natural
Language analysis and generation system.
In particular we describe the disco de- 1 introduction
velopment shell, the basic tool for the
Today's natural language systems are
integration of natural language compo- large
software products. They consist of
nents in the disco system, and its appli- serveral
mutually connected components
cation in the cosma (COoperative Sched- of dierent
kinds, each developed by difule MAnagement Agent) system.
ferent researchers often placed on dierFollowing an object oriented architec- ent locations. The integration of these
tural model we introduce a two{step ap- components has therefore become a softproach, where in the rst phase the archi- ware engineering and mangement probtecture is developed independently of spe- lem. We will consider the project discic components to be used and of a par- co (DIalogue Systems for COoperating
ticular
ow of control. In the second phase agents) from this perspective.
the \frame system" is instantiated by the disco's primary goal is the processintegration of existing components as well ing of multiagent natural language diaas by dening the particular
ow of con- logue. Multiagent capabilities make it
trol between these components. Because an appropriate front end for autonomous
of the object{oriented paradigm it is easy cooperative agents, which will be exemto augment the frame system, which in- plied by the cosma system (COoperacreases the
exibility of the whole system tive Scheduling MAnagement system) described in section 4.
The research underlying this paper was supThe project disco as a whole is a fourported by a research grant, FKZ ITW 9002 0, year, eight-person research eort fundfrom the German Bundesministerium fur
Forschung und Technologie to the DFKI project ed by the German Ministry for Research
disco.
and Technology. disco is completing its
1
fourth year. The rst task of the development of the disco system was to provide
a uniform core formalism based on unication of feature structures. The second
was the construction of a modular architecture orthogonal to the representation
formalism, as a platform for experimentation. Third, research and construction
of dialogue components is ongoing, as is
(fourth) investigation of the interface between dialogue components and multiagent systems.
We will emphasize in this paper the second phase, i.e. the description of the underlying architecture. The solutions to the
other tasks will be reported brie
y in the
next section.
SAR
SAP.
DISCO
Grammar
(+ Lexicon)
Parser
Generator
TDL
Udine
X2morf
Scanning
Utterance
3
333
6
6
6
6
6
6
77
6
6
6
4N +57
6
6
77
HEAD
MAJ
4
5
6
6
6
6
77
V
6
6
6
6
77
6
SYN
6
LOCAL
6
6
77
6
6
6
6
77
6
6
SUBCAT
6
4
57
6
4
5
6
6
6
LEXICAL +
6
6
6
6
2
3
6
6CAT6
6
PRED
time
6
6
7
6
6SEM 6
6HOUR 1 7
6
6
4
5
6
6
6
6
MIN
2
6
6
6
32
32
2
6
6
*
6
6
STEM
uhr
STEM
cardinal
6
7 6
7 6STEM cardinal
6
6MORPH 6
6
;
5
4
5;4
4
4
4
HEAD NUM sg
HEAD VALUE 2
HEAD VALUE 1
2
hi
(1)
np
33
77
77
77
77
77
77
77
77
77
77
77
77
77
77
77
77
77
77
77
7
7
3 7
+7
77
7
7 7
7
5; 7
57
5
Figure 2: An example of a multi-word lexeme (taken from [Netter, 93]) that covers
time expressions such as \14 Uhr 30" (2.30 p.m.).
predicate-argument-structure
thematic roles of arguments
the modication relation
sortal and selectional restrictions on
predicate-argument structures and
modication. These have proved
to be very ecient means for
disambiguation especially of PPattachment ambiguities (see [Kasper,
1993]).
a great deal of power, the disco system also provides a logical form module
which facilitates translation into various
types of back-end systems. NLL is a
representation of standard predicate logic,
with lambda abstraction and several features that support representation of natural language constructions [Laubsch and
Nerbonne, 1991]. These include
Plural terms
(exist ?x shipper(inst/i:?x) =
(arg1:+ Jim,John arg2:?x))
Location terms
located(th:John
loc:reg-X SB,on-fn(Saar) )
Restricted parameters
Generalized quantiers
Complex determiners
lowing we describe brie
y the main processing components of the linguistic kernel
(cf. g. 1).
Scanning The scanner is mainly re- strategies, as well as giving control over
Generation The surface generator curMorphology The morphological com- rently in use is a variant of the
Parsing The parser is a bidirectional bottom-up chart parser which operates on a context-free backbone implicitly contained in the grammar. The parser provides parameterized general parsing
or makes it possible to organize programs language components. All current protopoorly; it is just a tool that helps us to cols are dened using the same structure
as shown in g. 4.
achieve such modular systems.
The controller uses the generic function
call-component
to activate an individDISCO's Class Hierachy The disco development shell consists of the ual language component instance, specialclass hierarchy and the specication of ized to the appropriate subclass. Conclass specic methods. Every type of com- trol
ow is determined by the sequence of
ponent and its specializations are dened call-component invocations. Between
as CLOS classes. Figure 3 shows a portion calls, output is veried and converted to
the following component's input format by
of the current hierarchy.
module is the most general class. All calling the generic function check-andother classes inherit its data structure and transform. This mechanism is very imassociated methods. The class language portant to support robustness especially
component subsumes all modules of the during the development phase of the syscurrent system which are responsible for tem. Specic methods are dened for each
language processing. A module that is ac- module that indicate what to do if a modtually used in the system is an instance of ule does not come up with a correct result. These methods are activated by the
one of the classes.
New modules are added to the sys- controller during the call of check-andtem by associating a class with them. transform. In the current version of
CLOS supports dynamic extension of the the system further processing is then inclass hierarchy so that new types can be terrupted and the user is informed about
added even at run{time. For example, the problem that occured.
if we wanted to add a new parser mod- For example, the output of morule we would either use the already exist- phology denes the input to parser
ing class parser or dene a new class, and so on. By calling (check-andsay alternative-parser. In the rst transform controller morpholocase we assume that we only need the gy parser) the controller checks whether
methods that have already been dened the morphology yielded a valid output and
for the parser class. In the second eventually transforms the output for the
case we would have to add new methods parser. If morphology detects an unor could specialize some of the methods known word X further processing is inthat alternative-parser inherits from terrupted and the user receives a message
parser. In principle it could also happen notifying him that X is unknown to the
that the new parser shares many proper- morphological component.
ties with disco-parser. In that case we
would have to rene the parser subnet in Some Remarks If two adjacent comorder to avoid redundancy.
ponents have been proven to work together without problems check-andProtocols The
ow of control between transform need not be called for them
a set of components is mediated by means as it is the case for the components
of protocols. Protocols are methods de- component2 and component3 in the
ned for the class controller. They specify example shown in g. 5.
the set of language components to be used Input and output for the whole system
and the input/output relation between the is specifed using general communication
Module
Tool
Trace Handler
Language Component
Lisp Scanner
Morphology Lexicon
Parser
Controller
...
Command-Shell
NLL
Yacc Scanner
X2morf
Morphix
Disco Parser
debugging it is possible to run serveral subsystems (called standalone applications), which are activated via the command level. For example, one might want
to run the parser and generator without
morphology or only the set of components
necessary during analysis aso. In each
case the same functionality is available as
well as the same set of tools. In principle
it would be possible for a user to dene
protocols himself e.g., to test self{written
modules because the integration of modules and the denition of protocols takes
place in a standardized fashion.
Output
Input
Component-1
Input
Component-2
Output
Input/Output
Input/Output from/to general device (e.g., email)
ControllerUnit
Input
Input
Component-4
Output
Component-3
Output
e{mail software has the advantage that Short Example Figure 6 gives an
scheduling of appointments can be done overview of a conguration where three
participants, a human (Tick) and two
in a distributed and asychronous way.
cosmas (Trick, Track) are involved.
Natural language (NL) comes into play A possible appointment scheduling is as
because we allow humans to participate follows (abstracting away from details):
who have no calendar assistent available. Track to Trick and Tick:
The only restriction is that they have elec- arrange(meeting, 21.10.1992,1p.m.)
tronic mail available. Such a (poor) per- Trick to Track:
son is responsible for mantaining an old{ accept(meeting)
fashioned calendar but is allowed to use Tick to Track:
natural language during appoinment nego- Ich bin mit dem Termin einverstanden. (I
tiation. Consequently, each cosma sys- accept the appointment).
tem needs to be able to process natural Track to Trick and Tick:
language, either to understand a NL dia- conrm(meeting)
logue contribution or to produce one.
In words: Track wanted to arrange a
meeting and sends this request to Trick
Each user of a cosma system has ac- and Tick (in form of an internal planning
cess to the calendar data{base by means expression). Trick automatically sends an
of a graphical user interface. The graphi- acception. Because there are no con
ictcal user interface | developed by Stephen ing entries in his calendar data{base, Tick
Spackman who named the tool dui | sends an acception using NL. Track will
is used to display and update existing update its calendar while sending a conitems and enter new items into the data{ rmation to the two participants notifybase. The intelligent calendar manager ing them that all participants accepted the
maintains the calendar database. The appointment.
current version (developed by the AKA{ The current version of the system is able
MOD group of the DFKI) consists of time to handle more complex dialogs, e.g., approcessing functions, a nite-state proto- pointment scheduling initiated by a noncol for arranging appointments, and an cosma user, cancellation and modicaaction memory storing the protocol state tion of already set up appointments.
and original e{mail for each arrangement
in progress. The principle task of the disco system is to extract that information 5 conclusion
from an natural language expression that
can be used by the calendar manager. dis- In this paper we have given an overview
co is also responsible for the production of the disco system and a detailed deof natural language text from the inter- scription of the disco development
nal representation of scheduling informa- shell. The disco development shell
tion computed by the calendar manager. has been proven very useful in setting up
The produced text is sent in addition to the cosma system. It was possible to
the internal structure of scheduling ex- integrate the new modules independentpressions to the participants via e{mail. ly from the rest of the system. Existing
[Busemann, 1993] describes the current modules have been exchanged by new ones
approach for generating natural language without the need of adding new methods. Because dierent researchers were
expressions in cosma in more detail.
TICK
TRACK
Planner
e-mail
DISCO
Emailer
DUI
TRICK
Human
(traditional)
e-mail
Planner
Human
(loves AI)
DISCO
DUI
References
[Busemann and Harbusch, 1993]
Stephan Busemann and Karin Harbusch, editors. Proc. DFKI Workshop
on Natural Language Systems: Modularity and Re-usability, Saarbrucken,
germany, 1993.
[Busemann, 1993] S. Busemann. Towards
the conguration of generation systems:
Some initial ideas. In Busemann and
Harbusch [1993].
[Hinkelman and Spackman, 1992]
Elizabeth A. Hinkelman and Stephen P.
Spackman. Abductive speech act recognition, corporate agents and the cosma system. In W. J. Black, G. Sabah,
and T. J. Wachtel, editors, Abduction,
Beliefs and Context: Proceedings of
the second ESPRIT PLUS workshop in
computational pragmatics, 1992.
[Inc., 1990] Reasoning Systems Inc. Rene user's guide. Technical report, Palo
Alto, CA, 1990.
[Kasper, 1993] Walter Kasper. Integration of syntax and semantics in feature