Finitestate Methods And Natural Language Processing 5th International Workshop Fsmnlp 2005 Helsinki Finland September 12 2005 Revised Papers 1st Edition Tero Harju Auth

Finitestate Methods And Natural Language
Processing 5th International Workshop Fsmnlp
2005 Helsinki Finland September 12 2005 Revised
Papers 1st Edition Tero Harju Auth download
https://ebookbell.com/product/finitestate-methods-and-natural-
language-processing-5th-international-workshop-
fsmnlp-2005-helsinki-finland-september-12-2005-revised-
papers-1st-edition-tero-harju-auth-2039526
Explore and download more ebooks at ebookbell.com

Here are some recommended products that we believe you will be
interested in. You can click the link to download.
Finitestate Methods And Natural Language Processing Postproceedings Of
The 7th International Workshop Fsmnlp 2008 J Piskorski
https://ebookbell.com/product/finitestate-methods-and-natural-
language-processing-postproceedings-of-the-7th-international-workshop-
fsmnlp-2008-j-piskorski-2531074
Finitestate Methods And Natural Language Processing 6th International
Workshop Fsmnlp 2007 Revised Papers Thomas Hanneforth
https://ebookbell.com/product/finitestate-methods-and-natural-
language-processing-6th-international-workshop-fsmnlp-2007-revised-
papers-thomas-hanneforth-2545780
Finitestate Methods And Natural Language Processing 8th International
Workshop Fsmnlp 2009 Pretoria South Africa July 2124 2009 Revised
Selected Papers 1st Edition Colin De La Higuera Auth
https://ebookbell.com/product/finitestate-methods-and-natural-
language-processing-8th-international-workshop-fsmnlp-2009-pretoria-
south-africa-july-2124-2009-revised-selected-papers-1st-edition-colin-
de-la-higuera-auth-4141722
Finite Difference Methods For Ordinary And Partial Differential
Equations Steadystate And Timedependent Problems Randall Jleveque
https://ebookbell.com/product/finite-difference-methods-for-ordinary-
and-partial-differential-equations-steadystate-and-timedependent-
problems-randall-jleveque-34998058

Finitestate Computational Morphology An Analyzer And Generator For
Georgian Irina Lobzhanidze
https://ebookbell.com/product/finitestate-computational-morphology-an-
analyzer-and-generator-for-georgian-irina-lobzhanidze-46668320
Finite State Machines In Hardware Unknown
https://ebookbell.com/product/finite-state-machines-in-hardware-
unknown-56400976
Finitestate Text Processing Kyle Gorman Richard Sproat
https://ebookbell.com/product/finitestate-text-processing-kyle-gorman-
richard-sproat-57414662
Finite State Machine Logic Synthesis For Complex Programmable Logic
Devices 1st Edition Robert Czerwinski
https://ebookbell.com/product/finite-state-machine-logic-synthesis-
for-complex-programmable-logic-devices-1st-edition-robert-
czerwinski-4230232
Finitestate Techniques Automata Transducers And Bimachines Stoyan
Mihov
https://ebookbell.com/product/finitestate-techniques-automata-
transducers-and-bimachines-stoyan-mihov-10666670

Lecture Notes inArtificial Intelligence 4002
Edited by J. G. Carbonell and J. Siekmann
Subseries of Lecture Notes in Computer Science

AnssiYli-Jyrä Lauri Karttunen
Juhani Karhumäki (Eds.
Finite-StateMethods
andNaturalLanguage
Processing
5th International Workshop, FSMNLP 2005
Helsinki, Finland, September 1-2, 2005
Revised Papers
13

Series Editors
Jaime G. Carbonell, Carnegie Mellon University, Pittsburgh, PA, USA
Jörg Siekmann, University of Saarland, Saarbrücken, Germany
Volume Editors
Anssi Yli-Jyrä
Scientiﬁc Computing Ltd.
P.O. Box 405, 02101 Espoo, Finland
E-mail: ylijyra@csc.ﬁ
Lauri Karttunen
Palo Alto Research Center
3333 Coyote Hill Rd, Palo Alto, CA 94304, USA
E-mail: [email protected]
Juhani Karhumäki
University of Turku
Department of Mathematics
20014 Turku, Finland
E-mail: karhumak@utu.ﬁ
Library of Congress Control Number: 2006937535
CR Subject Classiﬁcation (1998
LNCS Sublibrary: SL 7 – Artiﬁcial Intelligence
ISSN 0302-9743
ISBN-10 3-540-35467-0 Springer Berlin Heidelberg New York
ISBN-13 978-3-540-35467-3 Springer Berlin Heidelberg New York
This work is subject to copyright. All rights are reserved, whether the whole or part of the material is
concerned, speciﬁcally the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting,
reproduction on microﬁlms or in any other way, and storage in data banks. Duplication of this publication
or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965,
in its current version, and permission for use must always be obtained from Springer. Violations are liable
to prosecution under the German Copyright Law.
Springer is a part of Springer Science+Business Media
springer.com
© Springer-Verlag Berlin Heidelberg 2006
Printed in Germany
Typesetting: Camera-ready by author, data conversion by Scientiﬁc Publishing Services, Chennai, India
Printed on acid-free paper SPIN: 11780885 06/3142 543210

Preface
These proceedings contain the revised versions of the papers presented at the 5th
International Workshop of Finite-State Methods and Natural Language Process-
ing, FSMNLP 2005. The book includes also the extended abstracts of a number
of poster papers and software demos accepted to this conference-like workshop.
FSMNLP 2005 was held in Helsinki, Finland, on September 1–2, 2005. The
event was the ﬁfth instance in the series of FSMNLP workshops, and the ﬁrst that
was arranged as a stand-alone event, with two satellite events of its own: the Two-
Level Morphology Day (TWOLDAY) and a national workshop on Automata,
Words and Logic (AWL). The earlier FSMNLP workshops have been mainly
arranged in conjunction with a bigger event such as an ECAI, ESSLLI or EACL
workshop, and this practice may still be favored in the future.
The collocation of the three events promoted a multidisciplinary atmosphere.
For this reason, the focus of FSMNLP 2005 covered a variety of topics related
but not restricted to ﬁnite-state methods in natural language processing.
The 24 regular papers and 7 poster papers were selected from 50 submissions
to the workshop. Each submitted regular paper was evaluated by at least three
Program Committee members, with the help of external referees. In addition
to the submitted papers and two invited lectures, six software demos were pre-
sented. The authors of the papers and extended abstracts come from Canada,
Denmark, Finland, France, Germany, India, Ireland, Israel, Japan, The Nether-
lands, Norway, Spain, South Africa, Sweden, Turkey, and the USA.
It is a pleasure to thank the members of the Program Committee and the
external referees for reviewing the papers and maintaining the high standard
of the FSMNLP workshops. Naturally, we owe many thanks to every single
conference participant for his or her contributions to the conference and for
making FSMNLP 2005 a successful scientiﬁc event.
FSMNLP 2005 was co-organized by the Department of General Linguistics
at the University of Helsinki (host) and CSC, the Finnish IT center for science
(co-ordination). We thank the members of the Steering Committees for their
kind support in the early stage of the project and Antti Arppe, Sari Hyv¨arinen
and Hanna Westerlund for helping with the local arrangements. Last but not
least, we thank the conference sponsors for their ﬁnancial support.
August 2005 A. Yli-Jyr¨ a
L. Karttunen
J. Karhum¨aki

Organization
FSMNLP 2005 was organized by the Department of General Linguistics, Uni-
versity of Helsinki in cooperation with CSC, the Finnish IT center for science.
Invited Speakers
Tero Harju University of Turku, Finland
Lauri Karttunen Palo Alto Research Center,
Stanford University, USA
Program Committee
Steven Bird University of Melbourne, Australia
Francisco Casacuberta Universitat Polit`ecnica de Val`encia, Spain
Jean-Marc Champarnaud Universit´ edeRouen,France
Jan Daciuk Gdansk University of Technology, Poland
Jason Eisner Johns Hopkins University, USA
Tero Harju University of Turku, Finland
Arvi Hurskainen IAAS, University of Helsinki, Finland
Juhani Karhum¨aki,Co-chairUniversity of Turku, Finland
Lauri Karttunen,Co-chair PARC and Stanford University, USA
Andr´e Kempe Xerox Research Centre Europe, France
George Anton Kiraz Beth Mardutho: The Syriac Institute, USA
Andr´as Kornai Budapest Institute of Technology, Hungary
D. Terence Langendoen University of Arizona, USA
Eric Laporte Universit´ e de Marne-la-Vall´ee, France
Mike Maxwell Linguistic Data Consortium, USA
Mark-Jan Nederhof University of Groningen, The Netherlands
Gertjan van Noord University of Groningen, The Netherlands
Kemal Oﬂazer Sabanci University, Turkey
Jean-Eric Pin CNRS/University Paris 7, France
James Rogers Earlham College, USA
Giorgio Satta University of Padua, Italy
Jacques Sakarovitch CNRS/ENST, France
Richard Sproat University of Illinois at Urbana-Champaign,
USA
Nathan Vaillette University of T¨ ubingen, Germany
Atro Voutilainen Connexor Oy, Finland
Bruce W. Watson University of Pretoria, South Africa
Shuly Wintner University of Haifa, Israel

VIII Organization
Sheng Yu University of Western Ontario, Canada
Lynette van Zijl Stellenbosch University, South Africa
Organizing Committee
Anssi Yli-Jyr¨a,Chair University of Helsinki and CSC, Finland
Hanna-Maria Westerlund University of Helsinki, Finland
Sari Hyv¨arinen University of Helsinki, Finland
Antti Arppe University of Helsinki, Finland
Steering Committee I (FSMNLP Traditions)
Lauri Karttunen PARC and Stanford University, USA
Kimmo Koskenniemi University of Helsinki, Finland
Gertjan van Noord University of Groningen, The Netherlands
Kemal Oﬂazer Sabanci University, Turkey
Steering Committee II (Local Advisory Group)
Lauri Carlson University of Helsinki, Finland
Tero Harju University of Turku, Finland
Lauri Hella University of Tampere, Finland
Arvi Hurskainen University of Helsinki, Finland
Fred Karlsson University of Helsinki, Finland
Krista Lagus Helsinki University of Technology,
Finland
Kerkko Luosto University of Helsinki, Finland
Matti Nyk¨anen University of Helsinki, Finland
Additional Referees
Rafael C. Carrasco Universitat d’Alacant, Spain
Loek Cleophas Technische Universiteit Eindhoven,
The Netherlands
Yvon Francois GET/ENST and LTCI, France
Ernest Ketcha Ngassam University of South Africa and
University of Pretoria, South Africa
Ines Klimann Universite Paris 7, France
Sylvain Lombardy Universite Paris 7, France
David Pic´o-Vila Universidad Polit´ ecnica de Valencia, Spain
Enrique Vidal Universidad Polit´ ecnica de Valencia, Spain
Juan Miguel Vilar Universitat Jaume I, Spain
M. In´es Torres Universidad Pa´ ıs Vasco, Spain
Anssi Yli-Jyr¨a University of Helsinki and CSC, Finland

Organization IX
Sponsoring Institutions
CSC - Scientiﬁc Computing Ltd., Finland
University of Helsinki, Finland
The KIT Network, Finland
Academy of Finland
Connexor Ltd., Finland
Lingsoft Ltd., Finland

Table of Contents
Invited Lectures
Characterizations of Regularity
Tero Harju....................................................1
Finnish Optimality-Theoretic Prosody
Lauri Karttunen...............................................9
Contributed Papers
Partitioning Multitape Transducers
Fran¸cois Barth´elemy...........................................11
Squeezing the Inﬁnite into the Finite: Handling the OT Candidate Set
with Finite State Technology
Tam´as B´ır´o...................................................21
A Novel Approach to Computer-Assisted Translation Based
on Finite-State Transducers
Jorge Civera, Juan M. Vilar, Elsa Cubel, Antonio L. Lagarda,
Sergio Barrachina, Francisco Casacuberta, Enrique Vidal...........32
Finite-State Registered Automata and Their Uses in Natural Languages
Yael Cohen-Sygal, Shuly Wintner................................43
TAGH: A Complete Morphology for German Based on Weighted Finite
State Automata
Alexander Geyken, Thomas Hanneforth...........................55
Klex: A Finite-State Transducer Lexicon of Korean
Na-Rae Han..................................................67
Longest-Match Pattern Matching with Weighted Finite State Automata
Thomas Hanneforth............................................78
Finite-State Syllabiﬁcation
Mans Hulden..................................................86
Algorithms for Minimum Risk Chunking
Martin Jansche................................................97

XII Table of Contents
Collapsing-Loops in Weighted Finite-State Machines
J. Howard Johnson............................................110
WFSM Auto-intersection and Join Algorithms
Andr´e Kempe, Jean-Marc Champarnaud, F. Guingne,
Florent Nicart.................................................120
Further Results on Syntactic Ambiguity of Internal Contextual
Grammars
Lakshmanan Kuppusamy........................................132
Error-Driven Learning with Bracketing Constraints
Takashi Miyata, Kˆoiti Hasida...................................144
Parsing with Lexicalized Probabilistic Recursive Transition Networks
Alexis Nasr, Owen Rambow.....................................156
Integrating a POS Tagger and a Chunker Implemented as Weighted
Finite State Machines
Alexis Nasr, Alexandra Volanschi................................167
Modelling the Semantics of Calendar Expressions as Extended Regular
Expressions
Jyrki Niemi, Lauri Carlson.....................................179
Using Finite State Technology in a Tool for Linguistic Exploration
Kemal Oﬂazer, Mehmet Din¸cer Erba¸s, M¨uge Erdoˇgmu¸s.............191
Applying a Finite Automata Acquisition Algorithm to Named Entity
Recognition
Muntsa Padr´o, Llu´ıs Padr´o.....................................203
Principles, Implementation Strategies, and Evaluation of a Corpus
Query System
Ulrik Petersen.................................................215
On Compact Storage Models for Gazetteers
Jakub Piskorski................................................227
German Compound Analysis withwfsc
Anne Schiller.................................................239
Scaling an Irish FST Morphology Engine for Use on Unrestricted Text
Elaine
´
U´ı Dhonnchadha, Josef Van Genabith......................247

Table of Contents XIII
Improving Inter-level Communicationin Cascaded Finite-State Partial
Parsers
Sebastian van Delden, Fernando Gomez..........................259
Pivotal Synchronization Languages: A Framework for Alignments
Anssi Yli-Jyr¨a, Jyrki Niemi.....................................271
Abstracts of Interactive Presentations
A Complete FS Model for Amharic Morphographemics
Saba Amsalu, Dafydd Gibbon....................................283
Tagging with Delayed Disambiguation
Jos´eM.Casta˜no, James Pustejovsky.............................285
A New Algorithm for Unsupervised Induction of Concatenative
Morphology
Harald Hammarstr¨om..........................................288
Morphological Parsing of Tone: An Experiment with Two-Level
Morphology on the Ha Language
Lotta Harjula..................................................290
Describing Verbs in Disjoining Writing Systems
Arvi Hurskainen, Louis Louwrens, George Poulos..................292
An FST Grammar for Verb Chain Transfer in a Spanish-Basque MT
System
I˜naki Alegria, Arantza D´ıaz de Ilarraza, Gorka Labaka,
Mikel Lersundi, Aingeru Mayor, Kepa Sarasola....................295
Finite State Transducers Based on k-TSS Grammars for Speech
Translation
Alic´ıcial P´erez, F. Casacuberta, In´es Torre, V. Guijarrubia.........297
Abstracts of Software Demos
Unsupervised Morphology Induction Using Morfessor
Mathias Creutz, Krista Lagus, Sami Virpioja......................300
SProUT – A General-Purpose NLP Framework Integrating Finite-State
and Uniﬁcation-Based Grammar Formalisms
Witold Dro˙zd˙zy´nski, Hans-Ulrich Krieger, Jakub Piskorski,
Ulrich Sch¨afer.................................................302

XIV Table of Contents
Tool Demonstration: Functional Morphology
Markus Forsberg, Aarne Ranta..................................304
From Xerox to Aspell: A First Prototype of a North S´ami Speller Based
on TWOL Technology
Børre Gaup, Sjur Moshagen, Thomas Omma, Maaren Palismaa,
Tomi Pieski, Trond Trosterud...................................306
A Programming Language for Finite State Transducers
Helmut Schmid................................................308
FIRE Station
Bruce Watson.................................................310
Author Index...................................................311

Characterizations of Regularity
Tero Harju
Department of Mathematics, University of Turku, Finland
Abstract.Regular languages have many diﬀerent characterizations in
terms of automata, congruences, semigroupsetc. We have a look at
some more recent results, obtained mostly during the last two decades,
namely characterizations using morphic compositions, equality sets and
well orderings.
1 Introduction
We do not intend to give a full survey on regular languages but rather a short
overview of some of the topics that have surfaced during the last two decades.
Customarily regular languages are deﬁned either as languages accepted by
ﬁnite automata, represented by regular expressions, or generated by right linear
grammars. The mostcommon approach is by acceptance using deterministic
ﬁnite automata, or aDFAfor short. A DFA can be described as a ‘concrete
machine’ with a read-only input tape from which the head of the automaton
reads one square at a time from the left end to the right end. A DFAAcan be
conveniently presented as a 5-tuple
A=(Q, A, δ, q0,F),
whereQis the set of initial states,Ais the alphabet of the inputs, and the
transition functionδ:Q×A→Qdescribes theactionofAsuch thatδ(q, a)=p
means that while reading the symbolain stateq, the automaton changes to
statepand starts consuming the next input symbol. The stateq0is the initial
state ofA,andF⊆Qis the set of its ﬁnal states. The action of the automaton
Aisoftenwrittenintheformqa=pinstead ofδ(q, a)=p. The transition
functionδextends to words by settingδ(q, wa)=δ(δ(q, w),a). Thus for each
wordw,δ(q, w) is the state where the automaton enters when started in the
stateqand after exhaustingw.Ifw=ε, the empty word, thenδ(q, ε)=qfor
all statesq.
More pictorially a ﬁnite automaton can be described as a directedgraph,
where nodes represent the states of the automaton and each labelled edgeq
a
−→p
corresponds to the transitionδ(q, a)=p.Thenδ(q, w) is the state that is reached
fromqby traversing the edges labelled by the letters ofw.
A languageL⊆Aisregularif it is accepted by a DFA,L=L(A), where
L(A)={w∈A
∗
|δ(q0,w)∈F}.
A. Yli-Jyr¨a, L. Karttunen, and J. Karhum¨aki (Eds.): FSMNLP 2005, LNAI 4002, pp. 1–8, 2006.
cεSpringer-Verlag Berlin Heidelberg 2006

2T.Harju
The family of regular languages is a neat family in the sense that it is closed
under many natural operations of languages: ifLandKare regular languages,
then so are
–L∪K,L∩K,L\K, catenationL·K, Kleene closuresL
∗
andL
+
,shuﬄe
KsL, quotientsL
−1
KandLK
−1
, complementA
∗
\L, morphic (and the in-
verse morphic) imagesh(L)(andh
−1
(L)), as well as the reversalL
R
(mirror
image).
This list could be continued much further.
Instead of deterministic ﬁnite automata one can also employ other ﬁnite mod-
els of automata for regular languages. For instance, a languageLis regular if
it is accepted by anondeterministicFA where the transitions are given by a
relation instead of a function.
We can extend the transition function (or relation) in several ways, say by
attaching conditions to the transitions that change the design of the states. As
an example, each state can have a sign, + or−, and the transitions can depend
on the signs and change them.
Also, one can expand the way howﬁnite automata accept words. Analter-
natingﬁnite automaton is a nondeterministic FA where the states are divided
into existential and universal states, and acceptance depends on the global tree
of behaviour.
By allowing ﬁnite automata to read the input word both to the left and right
does not inﬂuence the family of accepted languages, i.e., a 2-way FA accepts
only regular languages.
Decision problems for regular languages are, as a rule, decidable. However,
many algorithmic problems are hard for them. For instance, one can prove that
the problem of ﬁnding a nondeterministic ﬁnite automaton with the smallest
number of states accepting a regular languageLis truly hard. The problem is
PSPACE-complete.
The syntactic characterizations of regular languages are originally due to
Myhill [1] and Nerode [2] as well as to Rabin and Scott [3] at the end of the 1950s.
These characterizations follow from analyzing the behaviour and structures of
ﬁnite automata.
For a languageL⊆A
∗
deﬁne the relation∼Lby
u∼Lv⇐⇒u
−1
L=v
−1
L,
whereu
−1
L={w|uw∈L}. This relation is an equivalence relation onA
∗
,and
thusA
∗
is divided into equivalence classes w.r.t.∼L.
Theorem 1.A languageLis regular if and only if∼Lis of ﬁnite index, i.e.,
there are only ﬁnitely many equivalence classes w.r.t.∼L.
The idea behind Theorem 1 is that the setu
−1
Lcorresponds to the stateδ(q0,u)
of the DFA acceptingL. As an example, consider the languageL={a
n
b
n
|n≥
0}which is well known to be nonregular. We notice that the setsu
−1
i
Lare all

Characterizations of Regularity 3
diﬀerent for the wordsui=a
i
,i≥1. Since there are inﬁnitely many setsu
−1
L,
we deduce that, indeed, the languageLis not regular.
Let
u
∼
=Lv:xuy∈L⇐⇒xvy∈L
be thesyntactic congruenceofL⊆A
∗
.
Theorem 2.A languageLis regular if and only if the syntactic congruence of
Lhas ﬁnite index.
Using syntactic congruences one can study the ﬁne structure of regular languages
more deeply. This approach leads toalgebraic theoryof regular languages. For
instance, Sch¨utzenberger [4] showed that a languageLis star-free if and only if
its syntactic monoid is aperiodic, i.e., contains only trivial subgroups. Here we
say thatLisstar-freeif it can be represented by a generalized regular expression
allowing complementationL
c
but disallowing stars∗. For instance,A
∗
=∅
c
,and
(ab)
∗
=1+a∅
c
∩∅
c
b∩(∅
c
aa∅
c
)
c
∩(∅
c
bb∅
c
)
c
.
We also state an algebraic characterization of regular languages that is related
to syntactic congruences.
Theorem 3.A languageLis regular if and only if it is recognized by a ﬁnite
monoidM, i.e., there is a ﬁnite monoidMsuch thatF⊆Mand
L=ϕ
−1
(F)
for a monoid morphismϕ:A
∗
→MontoM.
We can restate this theorem as follows:
Theorem 4.A languageLis regular if and only if there exists a ﬁnite monoid
Msuch that
L=ϕ
−1
ϕ(L)
for a monoid morphismϕ:A
∗
→M.
Regular languages can also be described by matrices. The following theorem is
due to Sch¨utzenberger.
Theorem 5.For each regular languageL, there are0,1-vectorsuandv,anda
matrixM(of ﬁnite sets) such that
L=u
T
M
∗
v.
Regular languages have had connections to logic since the studied made by
B¨uchi [6], Elgot [7], and McNaughton and Papert [5].
Theorem 6.A languageLis regular if and only ifLdeﬁnable in the monadic
second order logic (which allows comparisons of positions of letters in words and
quantiﬁers over sets of positions).

4T.Harju
2 Morphic Characterizations
The topic of morphic characterizations of regular languages was was initiated
by Culik, Fich, and Salomaa [8] in 1982, and it was continued by several people
during the following years.
Recall that a mappingh:A
∗
→B
∗
is amorphismif
h(uv)=h(u)h(v)
for all wordsu, v.Theinverse morphismis the many-valued mapping
h
−1
(v)={u|h(u)=v}.
In the theorems that follow the morphismshi,fori=1,2,..., are between
suitable alphabets. Culik, Fich, and Salomaa [8] proved that
Theorem 7.A languageLis regular if and only if there are morphismshisuch
that
L=h4h
−1
3
h2h
−1
1
(a
∗
b).
This result was improved by Latteux and Leguy[9] in 1983:
Theorem 8.A languageLis regular if and only if there are morphismshisuch
that
L=h3h
−1
2
h1(a
∗
b).
We shall sketch the idea behind the proof of this theorem.
In the other direction the claim follows from the fact that regular languages
are closed under taking morphic images and inverse morphic images, and the
starting languagea
∗
bin Theorem 8 is certainly regular.
Let thenLbe a regular language and letAbe a DFA acceptingL. Assume
that the states ofAare
Q={q0,q1,...,qm},
whereq0is the initial state. Let
Γ={[qi,x,qj]|δ(qi,x)=qj}
be an alphabet that encodes the transitions ofA,andleta, banddbe three
special symbols. Deﬁne our ﬁrst morphismh1:{a, b}→{a, b, d}
∗
by
h1(a)=ad
m
andh1(b)=bd
m
,
Henceh1(a
n
b)=(ad
m
)
n
·bd
m
for each powern.
Let thenh2:Γ
∗
→{a, b, d}
∗
be deﬁned by
h2([qi,x,qj]) =
ﬀ
d
i
ad
m−j
ifj=m,
d
i
bd
m
ifj=m.

Characterizations of Regularity 5
Hence
u∈h
−1
2
h1(a
n
b)⇐⇒ucodes the accepting computation ofAofa1a2...an.
Finally, leth3:Γ
∗
→A
∗
be deﬁned by
h3([q, x, p]) =x.
ThenL(A)=h3h
−1
2
h1(a
∗
b).
Even a simpler variant was shown to hold by Latteux and Leguy [9]:
Theorem 9.A languageLis regular if and only if there are morphismshisuch
that
L=h
−1
3
h2h
−1
1
(b).
The special case of regular star languages has especially appealing
characterization.
Theorem 10.For any languageL,thelanguageL
∗
is regular if and only if
there exists a (uniform) morphismhand a ﬁnite setFof words such that
L
∗
=h
−1
(F
∗
).
The morphic characterizations of regular languages extend partly to transduc-
tions, i.e., to many-valued mappingsτ:A
∗
→B
∗
computed by ﬁnite transducers.
The following is due to Turakainen [10], Karhum¨aki and Linna [11].
Theorem 11.LetRbe a given regular language. Then for all languagesL,
L∩R=h3h
−1
2
h1μ(L),
whereμ:A
∗
→A
∗
dis a marking deﬁned byμ(w)=wdfor a special symbold.
Latteux, Leguy, and Turakainen [9, 12] showed
Theorem 12.Each rational transductions has the forms
h4h
−1
3
h2h
−1
1
μandh
−1
4
h3h
−1
2
h1μ,
whereμis a marking.
The following theorem of Harju and Kleijn [13] shows that there is no algorithm
to decide whether the markingμis needed.
Theorem 13.Iy is undecidable whether or not a transduction has a represen-
tation without endmarkerμ.

6T.Harju
3 Equality Sets
In the Post Correspondence Problem,PCPfor short, the problem instances are
pairs (g, h) of morphismsg, h:A
∗
→B
∗
, and the problem asks to determine
whether there exists a nonempty wordwsuch thatg(w)=h(w). It was shown
by Post in 1947 that the PCP is undecidable in general, that is, there does not
exist an algorithm for its solution.
The set of all solutions of an instanceg, h:A
∗
→B
∗
is called theequality set
ofgandh.Itistheset
E(g, h)={w∈A
∗
|g(w)=h(w)}.
Choﬀrut and Karhum¨aki [14] have shown that the equality setE(g, h)isreg-
ular for a special class of morphisms, called bounded delay morphisms. However,
for these morphisms the problem whether or notE(h, g) contains a nonempty
word remains undecidable! This means that there is no eﬀective construction of
a ﬁnite automatonAaccepting the regular languageE(g, h) when the instance
g, hconsisting of bounded delay morphisms is given.
A morphismh:A
∗
→B
∗
is called apreﬁx morphism, if for all diﬀerent letters
a, b∈A, the imageh(a) is not a preﬁx of the imageh(b).
IfAandBare alphabets such thatA⊆B, then the morphismπA:B
∗
→A
∗
,
deﬁned by
πA(a)=
ﬀ
aifa∈A,
εifa∈B\A,
is theprojectionofB
∗
ontoA
∗
.
The next result is due to Halava, Harju, and Latteux [15, 16].
Theorem 14.A star languageL=L
∗
⊆A
∗
is regular if and only if
L=πA(E(g, h))
for preﬁx morphismsg, hand the projectionπAontoA
∗
.
A morphismf:A
∗
→B
∗
is acoding, if it maps letters to letters.
Theorem 15.A star languageL=L
∗
⊆A
∗
is regular if and only if
L=f(E(g, h))
for preﬁx morphismsg, handacodingf.
4 Well Quasi-orders
Aquasi-orderρ⊆X×Xon a setXis a reﬂexive and transitive order relation:
xρxand
xρy
yρz
ε
=⇒xρz.
Moreover,ρis awell quasi-order,wqofor short, if every nonempty subsetY⊆X
has at least one minimal element but only ﬁnite number of (non-equivalent)
minimal elements. In the below instead ofρwe use≤for an order relation.

Characterizations of Regularity 7
Lemma 1.The following are equivalent for a relation≤to be a wqo:
(1) Every inﬁnite sequence of elements of S has an inﬁnite ascending subse-
quence.
(2) Ifx1,x2,...is an inﬁnite sequence of elements, thenxi≤xjfor somei<j.
(3) There are no inﬁnite strictly descending sequences, nor inﬁnite sets of pair-
wise incomparable elements.
The following result is a special case of Higman’s [17] theorem:
Theorem 16.The set of wordsA
∗
is well quasi-ordered by the subsequence
order deﬁned by
x=x1x2...xn
y=y1x1y2x2...ynxnyn+1
ε
=⇒x≤y.
The idea behind the proof of this theorem is gratifying: Letw1,w2,...be an
inﬁnite sequence of words such thatwiδwjfor alli<j, and that the sequence
is ‘length minimal’ in the sense that the length ofwkis as short as possible so
thatw1,w2,...,wksatisﬁeswiδwjfor alli<j≤k. There is a letterathat
starts inﬁnitely manywi,say
wi1=av1,wi2=av2,...
Now,w1,w2,...,wi1−1,v1,v2,...is a ‘smaller’ sequence satisfying the require-
ments; a contradiction.
For the theorem of Ehrenfeucht, Haussler, Rozenberg [18] we say that a quasi-
order≤ismonotoneif
x1≤y1
x2≤y2
ε
=⇒x1x2≤y1y2.
A languageLisupwards closedclosed w.r.t. the ordering≤, if for allw∈L,
w≤vimplies that alsov∈L.
Theorem 17.A languageLis regular if and only if it is upwards closed w.r.t.
some monotone wqo onA
∗
.
The proof in the direction (=⇒) uses the Myhill–Nerode characterization of
regularity: a languageLis regular if and only ifLis a union of equivalence
classes of some congruence of ﬁnite index. Note that a monotone equivalence
relation is a congruence.
References
1. Myhill, J.: Finite automata and the representation of events. Technical Report
WADD TR-57-624, Wright Patterson Air Force Base, Ohio (1957)
2. Nerode, A.: Linear automaton transformations. Proc. Amer. Math. Soc.9(1958)
541–544

8T.Harju
3. Rabin, M.O., Scott, D.: Finite automata and their decision problems. IBM Journal
3(1959) 115–125
4. Sch¨utzenberger, M.P.: On ﬁnite monoids having only trivial subgroups. Inf. Control
8(1965) 190–194
5. McNaughton, R., Papert, S.: Counter-free automata. The M.I.T. Press, Cambridge,
Mass.-London (1971) With an appendix by William Henneman, M.I.T. Research
Monograph, No. 65.
6. B¨uchi, J.R.: Weak second-order arithmetic and ﬁnite automata. Z. Math. Logik
Grundlagen Math.6(1960) 66–92
7. Elgot, C.C.: Decision problems of ﬁnite automata design and related arithmetics.
Trans. Amer. Math. Soc.98(1961) 21–51
8. Culik II, K., Fich, F., Salomaa, A.: A homomorphic characterization of regular
languages. Discrete Appl. Math.4(1982) 149–152
9. Latteux, M., Leguy, J.: On the composition of morphisms and inverse morphisms.
In: Lecture Notes in Comput. Sci. Volume 154. (1983) 420–432
10. Turakainen, P.: A machine-oriented approach to composition of morphisms and
inverse morphisms. Bulletin of the EATCS20(1983) 162–166
11. Karhum¨aki, J., Linna, M.: A note on morphic characterization of languages. Dis-
crete Appl. Math.5(1983) 243–246
12. Latteux, M., Turakainen, P.: A new normal form for the compositions of morphisms
and inverse morphisms. Math. Systems Theory20(1987) 261–271
13. Harju, T., Kleijn, H.C.M.: Decidability problems for unary output sequential trans-
ducers. Discrete Appl. Math.32(1991) 131–140
14. Choﬀrut, C., Karhum¨aki, J.: Test sets for morphisms with bounded delay. Discrete
Appl. Math.12(1985) 93–101
15. Halava, V., Harju, T., Latteux, M.: Representation of regular languages by equality
sets. Bulletin of the EATCS86(2005) 224–228
16. Halava, V., Harju, T., Latteux, M.: Equality sets of preﬁx morphisms and regular
star languages. Inf. Process. Lett.94(2005) 151–154
17. Higman, G.: Ordering by divisibility in abstract algebras. Proc. London Math.
Soc.3(1952)
18. Ehrenfeucht, A., Haussler, D., Rozenberg, G.: On regularity of context-free lan-
guages. Theoret. Comput. Sci.27(1983) 311–332

Finnish Optimality-Theoretic Prosody
Lauri Karttunen
Palo Alto Research Center, Stanford University, USA
A well-known phenomenon in Finnish prosody is the alternation of binary and
ternary feet. In native Finnish words, the primary stress falls on the ﬁrst syllable.
Secondary stress generally fallson every second syllable: (v´oi.mis).(t`e.li).(j`oi.ta)
’gymnasts’ creating a sequence of trochaic binary feet. However, secondary stress
skips a light syllable that is followed by a heavy syllable. In (v´oi.mis.te).(l`em.me)
’we are doing gymnastics’, the ﬁrst foot is ternary, a dactyl.
Within the context of Optimality Theory (OT, [5]), it has been argued that
prosodic phenomena are best explained in terms of universal metric constraints.
OT constraints can be violated; no word can satisfy all of them. A language-
speciﬁc ranking of the constraints makes some violations less important than
others. In her 1999 dissertation [2], A uniﬁed account of binary and ternary
stress, Nine Elenbaas gives an analysis of Finnish in which the alternation be-
tween binary and ternary feet follows asa side eﬀect of the ordering of two par-
ticular constraints, *Lapse and *(L’. H) The *Lapse constraint stipulates that
an unstressed syllable must be adjacent to a stressed syllable or to word edge.
The *(L’. H) constraint prohibits feet such as (t`e.lem) where a light stressed syl-
lable is followed by a heavy unstressed syllable. The latter constraint of course
is outranked by the constraint that requires initial stress on the ﬁrst syllable in
Finnish regardless of the its weight. In his 2003 article on Finnish Noun Inﬂec-
tion [4], Paul Kiparsky gives essentially the same account of the binary/ternary
alternation except that he replacesthe *(L’.H
StressToWeight constraint.
Although OT constraints themselves can be expressed in ﬁnite-state terms,
Optimality Theory as a whole is not a ﬁnite-state model if it involves un-
bounded counting of constraint violations [3]. With that limitation OT anal-
yses can be modelled with ﬁnite-state tools. In this paper we will give a full
computational implementation of the Elenbaas and Kiparsky analyses using the
extended regular expression calculus from the 2003 Beesley & Karttunen book
on Finite State Morphology [1]. Surprisingly, it turns out that Elenbaas and
Kiparsky both make some incorrect predictions. For example, according to their
accounts a word such as kalasteleminen ’ﬁshing’ should begin with a ternary foot:
(k´a.las.te).(l`e.mi).nen. The correct footing is (k´a.las).(t`e.le).(m`ı.nen). There may
of course be some ranking of OT constraints under which the binary/ternary al-
ternation in Finnish comes “for free”. It does not emerge from the Elenbaas and
Kiparsky analyses.
This case study illustrates a more general point: Optimality Theory is com-
putationally diﬃcult and OT theorists are much in the need of computational
help.
A. Yli-Jyr¨a, L. Karttunen, and J. Karhum¨aki (Eds.): FSMNLP 2005, LNAI 4002, pp. 9–10, 2006.
cﬀSpringer-Verlag Berlin Heidelberg 2006

10 L. Karttunen
References
1. K. R. Beesley and L. Karttunen.Finite State Morphology. CSLI Studies in Com-
putational Linguistics. CSLI Publications, Stanford, CA, 2003.
2. N. Elenbaas.A uniﬁed account of binary and ternary stress: considerations from
Sentani and Finnish. PhD thesis, Utrecht University, the Netherlands, 1999.
3. R. Frank and G. Satta. Optimality theory and the generative complexity of con-
straint violability.Computational Linguistics, 24:307–315, 1998.
4. P. Kiparsky. Finnish noun inﬂection. In D. Nelson and S. Manninen, editors,Gen-
erative Approaches to Finnic Linguistics. CSLI Publications, Stanford, CA, 2003.
5. A. Prince and P. Smolensky. Optimality theory: Constraint interaction in generative
grammar. Technical Report RuCCS TR-2, Rutgers University Center for Cognitive
Science, New Brunswick, NJ, 1993.

Partitioning Multitape Transducers
Fran¸cois Barth´elemy
CNAM-C´edric, 292 rue Saint-Martin, F-75003 Paris, France
and INRIA, domaine de Voluceau, F-78153 Rocquencourt cedex, France
[email protected]
Abstract.In this paper, we deﬁne a class of transducers closed under
intersection and complementation, which are the operations used for
contextual rule compilation. This class of transducers is not theoreti-
cally more powerful than the Epsilon-Free Letter Transducers most com-
monly used. But they are more convenient for morphological description
whenever the correspondence between lexical and surface forms is not a
symbol-to-symbol matching.
A complete set of operations on transducers is deﬁned, including some
operations (projection and join) which change the number of tapes of the
transducers.
1 Introduction
Finite State morphology use contextual rules (rewrite rules or two-level rules)
which are compiled into ﬁnite-state transducers. The key idea of compilation, due
to Kaplan and Kay [3] is subtractive: ﬁrst, compute a superset of the speciﬁed
language, namely the concatenation closure of the rule centers; then retract what
is not allowed by the contexts speciﬁed in the rules. At the operational level, the
fundamental operations are complementation and subtraction.
Finite State Transducers are not closed under complementation and subtrac-
tion in general. The subclass of transducers where the transitions are labeled by
exactly one symbol on each tape is closed under complementation, subtraction
and intersection. It is calledEpsilon-Free Letter Transducer[5]. This subclass is
used for Finite State morphology. A special symbol 0 is artiﬁcially introduced to
represent the empty string using an ordinary symbol. Introduction and removal
of 0s are done as pre and post-processing when executing a transducer.
In this paper, we present another subclass of transducers that are closed under
complementation, intersection and subtraction, called the Partitioning Multi-
tape Transducers (PMTs). The relations deﬁned are not necessarily same-length
relations. The transitions are labeled by an independent regular expression for
each tape. The same-length constraint does not apply on symbols but on parti-
tion components, which are possibly empty symbol sequences (i.e. strings). Each
transition deﬁnes a partition.
PMTs have the same expressive power as Epsilon-Free Letter Transducer
and regular languages. We give a compilation algorithm which compiles a PMT
into a Finite State Automaton. We also deﬁne a computation framework with
operations which change the number of tapes of the transducers.
A. Yli-Jyr¨a, L. Karttunen, and J. Karhum¨aki (Eds.): FSMNLP 2005, LNAI 4002, pp. 11–20, 2006.
cﬃSpringer-Verlag Berlin Heidelberg 2006

12 F. Barth´elemy
PMTs are convenient for deﬁning correspondences between strings which sym-
bols are not related. For example a feature structure associated to a morpheme
although there is no symbol-to-symbol relation between them. Another example:
words and their graphical representation in an ideographic writing system.
We ﬁrst deﬁne the Partition Multitape Transducers and then, we propose a
compilation algorithm for one of the most sophisticated two-level formalisms:
the Partition-Based Two-Level Grammars by Grimley-Evans and al. [2].
2 Partitioning Multitape Transducers
2.1 Introducing the Concept
Epsilon-Free Letter Transducers generated by two-level compilers are two tape
transducers whose transitions are labeled by exactly one symbol on each tape.
We would like to extend the formalism in two ways:
–allowing the use of more than two tapes, with operations on transducers that
increase or decrease the number of tapes.
–labeling transitions with a string, or even a regular set of strings on each
tape.
We know that such transducers are not closed under intersection with respect
to the usual semantics of n-relations, where such a n-tape transducer deﬁnes a
set of string n-tuples.
Instead, we propose a new semantics based on the notion of partition. Any
success path in the transducer is labeled by a sequence of transition labels, i.e.
a sequence of regular expressions tuples. N-relations semantics assumes that the
concatenation operation is distributive with respect to the tuple construction.
The partition-based semantics does not.
For instance, the two paths (aaa, ﬃ)(a, bb)and(aa, b)(aa, b)aretwowaysof
denoting the relation (aaaa, bb) with the usual relation semantics. They are dif-
ferent elements for the partition basedsemantics. The transducer intersection
has diﬀerent meanings in the two semantics. Transducers are closed under inter-
section using the second kind of semantics.
2.2 Formal Deﬁnition
Deﬁnition 1.Partitioning Multitape Transducer (PMT)
A Partitioning n-Tape Transducer is a 5-tupleA=(Σ,Q,i,F,T)where:
–Σis a ﬁnite set of symbols called the alphabet.
–Qis a ﬁnite set of states
–i∈Qis the initial state
–F⊆Qis the set of ﬁnal states
–T⊆{(q
1,e1,...,en,q2)|q1∈Q, q2∈Q, e1,...enare regular sets overΣ
∗
}.
We callnthe arity of the transducer.

Partitioning Multitape Transducers 13
This is a standard deﬁnition of transducers, very similar to the one in [5],
except that it does not distinguish a separate alphabet for each tape, with no
loss of generality. The speciﬁcity of Partitioning Multitape Transducers lies in
their semantics.
Deﬁnition 2.N-tuple sequences deﬁned by a n-tape transducer
Lets=(w
1
1
,...,w
n
1
)...(w
1
k
,...,w
n
k
)a possibly empty sequence of string n-tuples
wherew
j
i
∈Σ
∗
,andA=(Σ,Q,i,F,T)a n-tape transducer. The sequences
belongs toAif there exists inAa success pathi
t1
→q1
t2
→ ···
tk
→qkwithqk∈F
such that∀i,1≤i≤k, ti=(qi−1,e
1
i
,...e
n
i
,qi)and
w
1
i
∈e
1
i
,...w
n
i
∈e
n
i
.
We callpartitioned n-relationa set of string n-tuples. Aregular partitioned
n-relationis a partitioned n-relation deﬁned by a Partitioning Multitape Trans-
ducer. An example of transducer with some of the n-tuple sequences it deﬁnes
is given in ﬁgure 1.
x :a:αα*
yy:ε:β
0 12
z:cc:ε
(∈, a, ⊆⊆)(yy, ∈, →)
(x, a, αα)(yy, ∈, →)
(xx, a, αα)(yy, ∈, →)
(∈, a, ⊆⊆)(yy, ∈, →)(z, cc, ﬃ)(yy, ∈, →)
Fig. 1.An example of PMT and some of its n-tuple sequences
Deﬁnition 3.String recognized by an n-tape transducer
Letwbe a string ofΣ
∗
andA=(Σ,Q,i,F,T)a n-tape transducer. The string
wis recognized byAon tapejif there exists an n-tuple sequence
s=(w
1
1,...,w
n
1)...(w
1
k
,...,w
1
k
)ofAsuch thatw=w
j
1
.....w
j
k
.
We now brieﬂy present a few operations on Partitioning Multitape Transducers.
The formal deﬁnitions are straightforward, so we do not detail them.
–tape reordering: changes the order of the tapes in the transitions. The op-
eration is deﬁned given a transducer and a permutation upon tapes ranks.
Notation:reorder2,1,3(A).
–projection: keeps the speciﬁed tapesand removes the others. Notations:
project
{1,3}
(A) which keeps tapes 1 and 3, andproject
{ 2}
(A)whichremoves
tape 2 and keeps all the other tapes.
–union, concatenation, with the usual notations.

14 F. Barth´elemy
Deﬁnition 4.Join
The join onktapes of two regular partitioned relationsr
1andr 2having respec-
tively aritynandmis notedr
1ﬃ∗kr2anddeﬁnedasfollows:
r
1ﬃ∗kr2={(s
1
1
,...s
n+m−k
1
)...(s
1
p
,...s
n+m−k
p
)|
(s
1
1
,...s
n
1
)...(s
1
p
,...,s
n
p
)⊆r 1,
(s
n−k+1
1
,...,s
n+m−k
1
)...(s
n−k+1
p
,...,s
n+m−k
p
)⊆r 2}
This operation is very general. It combines two transducers having possibly dif-
ferent arities. We restrict here to joins where the lastktapes of the ﬁrst operand
are identiﬁed with thekﬁrst tapes of the second operand. This restriction is
made only to simplify the deﬁnition. Thanks to thereorderoperation, however
there is no loss of generality.
The intersection is a join of two n-tapes PMT on all the tapes (r
1∩r2=
r
1ﬃ∗nr2). This is a direct consequence of the join deﬁnition.
Relation composition may be described as the sequence of a join and a pro-
jection which forgets the tapes on which the join operates.
This join operation was described in [4] for n-relations. Here we adapt the
deﬁnition to partitioned n-relations. The name was chosen by analogy with the
join operation of relational databases.
Deﬁnition 5.Transducers join
LetA
1=(Σ,Q 1,i1,F1,T1)and
A
2=(Σ,Q 2,i2,F2,T2)be two transducers deﬁning the partitioned relationsr 1
andr 2. The following transducerAdeﬁnesr 1ﬃ∗kr2.
A=(Σ,Q
1×Q2,(i1,i2),F1×F2,T)where
T={((q
i1,qi2),e1,...en,en+1...en+m−k ,(qf1,qf2))|
∃(q
i1,e
1
1
,...e
1
n
,qf1)∈T 1,∃(qi2,e
2
1
,...e
2
m
,qf2)∈T 2and
e
1=e
1
1
,...en−k=e
1
n−k
,en−k+1 =e
1
n−k+1
∩e
2
1
,...,en=e
1
n
∩e
2
k
,
e
n+1=e
2
k+1
,...,en+m−k =e
2
m
}
x :a:αα*
yy:ε:β
z:cc:ε
yy:ε:β:2*
x :a:αα*
0,3 1,4
:11
2,4
:2*
34
a :*α*:11
b :β
+*
10 2
Fig. 2.An example of transducer join

Partitioning Multitape Transducers 15
2
Regular expression: xxxy z**y*
yy:ε:β
3
xxx:a:αα
4
yy:ε:β
5
z:cc:ε
6
yy:ε:β
7
z:cc:ε
0
x :a:αα*
1
Fig. 3.An example of partitioning composition
Deﬁnition 6.Partitioning Composition
Letpartition(α)be the partitioned relation{(a
1,...,ak)such that
a
1...ak∈α}. The partitioning composition on tapeiof a regular expressionα
andaPMTtis deﬁned by:
α◦
it= reorder2,...i−1,1,i...n (partition(α)ﬃ∗ 1reorderi,1,...i−1,i+1,...n (t))
The partitioning composition is the operation of recognition of a regular expres-
sion on a tape. An example of partitioning composition is given in ﬁgure 3.
3 Compiling Partitioning Multitape Transducers
Partitioning Multitape transducers are not convenient operational devices be-
cause the application of their transition is rather complex. It involves regular
expression matching, which may be achieved by ﬁnite state automata execution.
We propose a translation into a ﬁnite state automaton.
Conceptually, all the tapes are read in parallel with no synchronization within
a given partition. For instance, in the string 2-tuple sequence (xx, a)(y, bc)(ﬃ, cc),
–there is a total order within each tape (we write as a subscript the occurrence
of symbols):x
1<x2<yanda<b<c 1<c2<c3
–there is a partial order between symbols of diﬀerent tapes:x 2<b,a<y,
y
2<c2
–there is no order between symbols in the same partition: for instance,ais
notbeforeorafterx
1orx2
Any letter n-tape transducer will encode a stronger partial order relation,
since at most one symbol of each tape is read at once. For instance, ifaandx
1
are read by the same transition, thenais read beforex 2. There are several ways
of encoding the signiﬁcant partial order using an epsilon-free letter transducer.
For instance, the ﬁrst partition of the sequence namely (xx, a) may be recognized
by any of the following transition sequences:
x:ﬃ
→
x:ﬃ
→
ﬃ:a
→
x:ﬃ
→
ﬃ:a
→
x:ﬃ
→
ﬃ:a
→
x:ﬃ
→
x:ﬃ
→
x:a
→
x:ﬃ
→
x:ﬃ
→
x:a
→
We propose to choose an arbitrary canonical representation. The basic idea of
the compilation is to read on the tapes sequentially, one after the other, following

16 F. Barth´elemy
the rank order. Each transition reads one symbol on one tape and nothing on
the other tapes.
Now, one can ﬂatten all the tapes in just one tape, the limits of tape being
marked by two special symbols<and>not inΣ
1
. Similarly, the partitions
limits are marked by two special symbolsand,notinΣ.
Deﬁnition 7.Compilation algorithm
LetA=(Σ,Q,i,F,T)be a partitioning n-tape transducer. For each transition
t=(q0,w1,...,wn,q1)inT, compile the regular expression<w1>···<
wn>into a ﬁnite state automaton andinsertit between statesq0andq1.
Note that it is always possible to compile the given regular expression in a ﬁnite
state automaton having exactly one ﬁnal state, thanks to the ﬁnal transition
on.
Property 1.Property of the compiled automaton
We notenboccs(p) the number of occurrences of the symbolsin the pathp.
1. For all pathpstarting in the initial state, (nboccα(p)−1)∗n≤nboccπ(p)∗
n≤nbocc>(p)≤nbocc<(p)≤nboccα(p)∗n
2. All the strings recognized by the automaton begin withand end by an
occurrence of.
The property is obtained by construction, because each transition inserts one
occurrence ofandandnoccurrences of<and>.
Corollary.On each looplin the automaton,nbocc<(l)=nbocc>(l)=n∗
nboccα(l)=n∗nboccπ(l).
Lemma 1.Every ﬁnite state automaton fulﬁlling the property 1 encodes a Par-
titioning Multitape Transducer.
To retrieve the PMT from an automaton, one has to replace each path from a
to aby a single transition labeled by a regular expression n-tuple. The
only diﬃculty is that there must be a ﬁnite number of such paths to ensure the
termination of the process and the ﬁniteness of the transducer.
The most import property of the automata fulﬁlling the property 1 is that
either a loop has no<or>symbol, or it traverses one or several partitions. In
the ﬁrst case, the loop concerns only one tape, it is translated with a star in the
regular expression recognized on this tape. In the second case, the star comes at
the partition level. It never happens that a loop contains a<or a>within a
given partition. Therefore, the number of transitions of the transducer is ﬁnite.
In the following, we callPartitioning Automatonand we writecomp(t)aFinite
State Automaton encoding a Partitioning Multitape Transducert.
We have now to implement transducer operations on the compiled form. For
this purpose, we use a few operations on ﬁnite state machines deﬁned in [3]
and [2].
1
A single symbol marking the separation between tapes would be suﬃcient, but some
deﬁnitions are made simpler with two surrounding symbols.

Partitioning Multitape Transducers 17
–The identityidis a 2-tape transducer which maps any symbol to itself.
–insertfreely inserts a regular expressioneinto a given automaton. It is a 2-tape
transducer where the empty string maps the inserted regular expressione.
–replaceis an operation which replaces all the occurrences of a given regular
expression by another one in a regular expression.
–projectis the projection on the ﬁrst (project
1
) or the second (project
2
)tape
of the transducer.
The union (resp. the intersection, the concatenation) of two partitioning
n-tape transducers is performed using the union (resp. intersection, concate-
nation) of the two corresponding compiled automata.
The proof is straightforward using property 1.
The complementation is slightly more diﬃcult: the property 1 does not hold
for the standard ﬁnite state automaton complement of a Partitioning Automa-
ton. In order to except all the strings in the complement which are not string n-
tuple sequences, the complement has to be intersected with the set of all the valid
string n-tuple sequences, which is the compiled counterpart of (Σ
∗
,...,Σ
∗
)
∗
,
namely (<Σ
∗
>···<Σ
∗
>)
∗
:
comp( t)=comp(t)∩(<Σ
∗
>···<Σ
∗
>)
∗
The join operation of two PMTt1(arityn)andt2(aritym)onktapes is
performed as follows:
a1=project
{1}
(replace
>π,>(<Σ
∗
>)
m−k
π
(comp(t1)))
a2=project
{2}(replace
α<,α(<Σ
∗
>)
n−k
<(comp(t2)))
comp(t1ﬃΣkt2)=a1∩a2
Thereplaceoperations are used to insert new tapes at the end of the ﬁrst au-
tomaton and at the beginning of the second one in order to obtain two automata
having n+m-k tapes. The inserted tapes recognize the free languageΣ
∗
.
The partitioning composition on tapekof a PMTtand a regular expression
eis obtained by the following:
pbnd=(<Σ
∗
>)
n−k+1
(<Σ
∗
>)
k−1
preﬁx=(<Σ
∗
>)
k−1
suﬃx=(<Σ
∗
>)
n−k+1

comp(e◦kt)=comp(t)∩(preﬁx insert
pbnd
(e)suﬃx)
The left-hand part of the intersection is the union of all the PMTs which
recognizeeon tapekandΣ
∗
on the other tapes. It is obtained by inserting
freely partition boundaries surrounded by the irrelevant tapes, i.e. all but the
tapek. The expressionspreﬁxandsuﬃxtake care respectively of the ﬁrst and
last partition boundaries, the only ones which are not surrounded on both sides
by tape boundaries>and<.
In the special case where the multitape transducer is a letter transducer, that
is, in any partition, at most one symbol is read on each tape, a more eﬃcient
compilation algorithm may be used. The tape boundaries<and>are no more
needed. A new symbol 0, not inΣ, is inserted to represent the empty string

18 F. Barth´elemy
on tapes which read no symbols. Then, one knows that thei
th
symbol after a
partition limitis on tapei. This kind of compilation for letter transducers is
proposed by Ganchev and al. [1].
4 Using Partitioning Multitape Transducers for
Two-Level Morphology
In this section, we show how a partition-based two-level formalism may be com-
piled into Partitioning Multitape Transducers. This formalism was deﬁned by
Grimley-Evans, Kiraz and Pulman [2]. We ﬁrst recall the deﬁnition and then
describe the compilation.
4.1 Partition-Based Two-Level Formalism
LetΣbe a ﬁnite alphabet of symbols.
Deﬁnition 8.A Context Restriction Rule is a triple(l, c, r)wherel, candrare
three n-tuples of regular expressions overΣ. Furthermore, the regular expressions
ofcare ﬁnite.
The three n-tuples are called respectively the left context, the center and the
right context of the rule.
Deﬁnition 9.A Surface Coercion rule is a 4-tuple(l, cl, cs, r)wherelandr
are n-tuples of regular expressions overΣ,clandcsare respectively a k-tuple
and an m-tuple of regular expressions such thatk+m=n.
Here again,landrare called the contexts,clandcsare the lexical and sur-
face centers. A Two-Level Grammar is a set of Context Restriction and Surface
Coercion rules.
Deﬁnition 10.A Two-Level Grammar accepts a string tuple P partitioned as
P
1,...,Pqif and only if:
1.∀i,∃(l, c, r)a CR rule such thatP
1...Pi−1∈l, Pi∈c, Pi+1...Pq∈r.
2.∀i, j, i≤j,there is no SC rule(l, cl, cs, r)such thatP
1...Pi−1∈l,
P
j...Pq∈r,project
{1...k}
(Pi...Pj−1)∈cland
project
{k+1...n} (Pi...Pj−1)/∈ls.
Note that in this deﬁnition, the center of a SC rule applies on a possibly empty
sequence of consecutive partitions whereas centers of CR rules match exactly one
partition. This strange deﬁnition is designed to handle adequately epenthetic SC
rules, where the lexical center is a cross product of empty stringﬃ
k
.
4.2 Compilation into Partitioning Multitape Transducer
Partitioning Multitape Transducers are a natural operational device to compile
a partition-based formalism since they have the same notion of partition.

Partitioning Multitape Transducers 19
Compiling the center of a CR rule is straightforward: the center is the descrip-
tion of exactly one partition. A two-state transducer with a transition labeled by
the center implements it. Compiling contexts is slightly more diﬃcult because
the contexts may match a number of partitions. A diﬀerent regular expression is
given for each tape, but all the tapes must be divided in the same number of par-
titions. Inserting freely partition boundaries everywhere would not ensure that
the result fulﬁlls the property 1. The solution consists in cascading application
of the partitioning composition ofΣ
∗
,...,Σ
∗
(noccurrences) on each regular
expression in the context.
The same idea works also for Surface Coercion rule lexical and surface centers
which also match a sequence of partitions. Furthermore, the lexical and surface
centers must match the same number of partitions: they have to be partitioning
composed to the same free partition sequence.
comp((c1,...,cn)) = ((((Σ
∗
,...,Σ
∗
ﬃ Σαπ
noccurrences
)
∗
◦1c1)...)◦ncn)
Each part of a rule being compiled, we adapt the algorithm of [3] to the
semantics of the partition-based formalism. This algorithm is subtractive: we
ﬁrst compute the closure under concatenationπ
∗
of the set of CR rules centers
π. Then we remove from this language the partitioned strings which violate at
least one of the rules of the grammarG.
π=
∈
(l,c,r)∈G
c
violate
(l,c,r)=(comp(l)cπ
∗
)∪(π
∗
ccomp(r))
violate
(l,cl,cs,r)=comp(l)comp(cl×cs)comp(r)
system=π
∗
−
∈
(l,c,r)∈G
violate
(l,c,r)−
∈
(l,cl,cs,r)∈G
violate
(l,cl,cs,r)
4.3 Improving the Formalism
There is a restriction in the Two-Level Grammar deﬁnition which is related to
the compilation algorithm given by Grimley-Evans and al.: the centers of Context
Restriction Rules are compiled into letter n-tape transducers, by padding ends
of strings with 0s. Combined with the condition of independence of the tapes
within a given partition, it is a restriction to ﬁnite languages. In general, for a
given relationR, there is no same-length regular relation such that the elements
are the ones ofRpossibly lengthened with 0s at the end of strings.
Using the compilation into Partitioning Multitape Transducers, one can relax
the restriction. The center of a CR rule may be any regular expression n-tuple.
The contexts of two-level rules are speciﬁed separately on each tape. No syn-
chronization between tapes is expressible using the formalism. It is possible to
change this and allow for synchronization at partition boundaries.
For instance, (Σ
∗
,Σ
∗
)(C, C) is a valid context since it may be rewritten
(Σ
∗
C, Σ
∗
C), but (Σ
∗
,Σ
∗
)(C, C)
∗
is not a valid context because no pair of
independent regular expressions denotes the same language. There is a strong

20 F. Barth´elemy
synchronization between the two tapes: there are the same number ofCson
both tapes. The deﬁnition by Grimley-Evans and al. does not allow such a syn-
chronization. Compilation into PMT is possible if and only if the loops begin
and end on a partition frontier. Contexts would be similar to the ones of Koske-
niemi’s original formalism: they would be regular expression over feasible pairs,
the pairs being now pairs of regular expressions.
5Conclusion
Partitioning Multitape Transducers are the right operational device to imple-
ment a partition-based two-level formalism. There is still work to make such
formalism really attractive. First, the use of the notion of partition for morphol-
ogy and phonology is not completely clear. We believe that some phenomena are
well described in terms of partition whereas others are better described using
single symbols. So the two degrees of granularity have to coexist. Secondly, the
conﬂicts between two-level rules is a major diﬃculty in practice.
We are not aware of any attempt to deﬁne a partition-based formalism with
rewrite rules and we are not sure if the idea is pertinent at all, since the one-to-
one correspondence between pre and post rule application does not seem essential
to this approach.
By construction, the regular expressions characterizing the diﬀerent tapes
within a partition are not related. It is a strong restriction, especially when the
lexical and surface representation areclosely related, when they are basically
identical with only small local changes. It would be interesting to deﬁne a less
restrictive deﬁnition of partition, while preserving the uniqueness of canonical
representation which is essential for intersection.
Another interesting topic deals with composition of transducers having diﬀer-
ent partitionings, without loss of information. At the moment, only partitioning
composition would work, and one of the two partitioning would be lost in the
operation. There are applications where forms are partitioned in a diﬀerent way
using respectively a morphological or a graphemic point of view.
References
1. H. Ganchev, S. Mihov, and K. U. Schulz. One-letter automata: How to reduce k
tapes to one. Technical Report IS-Bericht-03-133, Centrum f¨ur Informations- und
Sprachverarbeitung, Universit¨at M¨unchen, Munich (Germany), 2003.
2. E. Grimley-Evans, G. Kiraz, and S. Pulman. Compiling a partition-based two-level
formalism. InCOLING, pages 454-459, Copenhagen, Denmark, 1996.
3. R. M. Kaplan and M. Kay. Regular models of phonological rule systems.Computa-
tional Linguistics, 20:3:331-378, 1994.
4. A. Kempe, F. Guingne, and F. Nicart. Algorithms for weighted multi-tape automata.
Technical Report 2004/031, XRCE, Grenoble, France, 2004.
5. E. Roche and Y. Schabes, editors.Finite-State Language Processing.Bradford Book.
MIT Press, Cambridge, Massachusetts, USA, 1997.

Squeezing the Inﬁnite into the Finite:
Handling the OT Candidate Set with Finite State
Technology
Tam´as B´ır´o
Humanities Computing, University of Groningen
[email protected]
Abstract.Finite State approaches to Optimality Theory have had two
goals. The earlier and less ambitious one was to compute the optimal
output by compiling a ﬁnite state automaton for each underlying repre-
sentation. Newer approaches aimed at realizing the OT-systems as FS
transducers mapping any underlying representation to the corresponding
surface form. After reviewing why the second one fails for most linguis-
tically interesting cases, we use its ideas to accomplish the ﬁrst goal.
Finally, we present how this approach could be used in the future as
a—hopefully cognitively adequate—model of the mental lexicon.
1 Introduction
Although very popular in linguistics,Optimality Theoryby Prince and Smolen-
sky (OT, [17], [18]) poses a serious problem for being computationally very
complex. This fact could question the relevance of much contemporary linguistic
work both for cognitive research, and language technology. Is our brain doing
such a hard computation? Could language technology make us of OT models?
Fortunately, things are not so bad.
Figure 1 presents the architecture of an Optimality Theoretical grammar,
which consists of two modules,GenandEval. The input—the underlying repre-
sentation (UR)—is mapped by the universal Gen onto aset of candidates.The
candidate set, or a subset of it, reﬂects language typology: for each language,
the language-speciﬁc Eval chooses the element (or elements) that appears as the
surface form SR. Eval is a function assigning a harmony value to each candidate,
and the most harmonic one will surface. Alternatively, Eval can also be seen as
a pipeline in which the constraints ﬁlter out the sub-harmonic candidates. This
second approach is most often used in practice, and the ﬁnite state realizations
presented in this paper are also based on this vision of an OT system.
In many models advanced by theoretical linguists, the set of candidates is inﬁ-
nite, leading to serious questions. How could our brain process an inﬁnite set? How
could language technology make use of a model involving an inﬁnite set?
Diﬀerent approaches have been, then, proposed in order to handle an inﬁnite
candidate set. Chart parsing (dynamic programming) is probably the best known
among them (chapter 8 in [19] for syllabiﬁcation; [16] for implementing it to OT
A. Yli-Jyr¨a, L. Karttunen, and J. Karhum¨aki (Eds.): FSMNLP 2005, LNAI 4002, pp. 21–31, 2006.
cﬀSpringer-Verlag Berlin Heidelberg 2006

22 T. B´ır´o
Tam´as B´ır´o
UR GEN
set
of
candi-
dates
Eval
Con1 Con2 Con3 SR
Fig. 1.The architecture of an OT grammar, which maps the underlying representation
onto the surface representation. Gen is followed by the Eval module: the latter is a series
of constraints, acting as ﬁlters.
LFG). It presupposes on the one hand that applying a recursive rule (usually
insertion) incurs some constraint violation; and on the other, that “all constraints
are structural descriptions denoting bounded structures”. The interplay of these
two assumptions guarantees that the algorithm may stop applying the recursive
rule after a ﬁnite number of steps, for no hope is left to ﬁnd better candidates
by more insertions.
Alternatives include using heuristic optimization techniques. Genetic algo-
rithms were proposed by Turkel [20], whereas simulated annealing by B´ır´o[5].
Such approaches involve only relatively low computational cost; nonetheless,
they do not guarantee ﬁnding the optimal candidate. Simulated annealing, for
instance, returns a “near-optimal” form within constant time, but you cannot
know if the algorithm has found the good solution. Even though B´ır´o [4] argues
that this algorithm models language production, one may still wish to have a
perfectly working algorithm for language technology.
The present paper proposes an alternative: determining the optimal candidate
by usingﬁnite statetechnologies. We ﬁrst present the previous approaches to
Finites State Optimality Theory (FS OT) in section 2, with an emphasis on
thematching approach. This is followed by a new proposal in section 3, further
developed into a model of the lexicon in section 4.
2 Finite State Optimality Theory
The idea of computing the optimal candidate of an OT system by building a
ﬁnite state (FS) automaton goes back to Ellison [10].
1
He requires the set of
candidates for a given input be a regular expression, and realizes the constraints
as transducers (weighted automata) assigning violation marks. The number of
violation marks assigned by a constraint to a candidate is reﬂected by the sum of
the weights along the path representingthe given candidate. Ellison subsequently
proposes a series of algorithms resulting in an automaton in which “the only
paths from the initial state to the ﬁnal state will be optimal and deﬁne optimal
candidates.” This approach builds a new automaton for each input.
1
Eisner [9] summarizes existing work on FS OT, and proposes a framework very
similartotheonetobepresentedhere.

Squeezing the Inﬁnite into the Finite 23
Later work in FS OT aims at realizing the OT-system as a regular relation
mapping any correct UR to the corresponding surface form. By compiling such
a transducer, one would enjoy all advantages of ﬁnite state techniques, such
as robustness, speed and relatively low memory requirementsin both directions
(production and parsing). This approach includes Frank and Satta [11] and Kart-
tunen [15], on the one hand (thecounting approach), as well as Gerdemann and
van Noord [12], generalized by J¨ager [13], on the other (thematching approach).
The hope for a regular mapping from the UR to the SR goes back to Douglas
Johnson [14].
In short, ﬁnite state approaches to OT require Gen, as well as each of the con-
straints be realizable—in some sense—as a regular expression or transduction.
In many linguistic theories, Gen produces a regular set, as exempliﬁed by syllabi-
ﬁcation [12] or metrical stress assignment [3]. However, many further examples,
such as reduplicative morphology or numerous phenomena in syntax, are not
ﬁnite state-friendly.
2
Concerning Eval, Eisner [7] [8] and B´ır´o [3] discuss what
constraints can be realized as ﬁnite transducers. Eisner’s Primitive Optimality
Theory (OTP) [7] launches a challenging research program the goal of which is
to model as many phonological phenomena as possible by restricting ourselves
to ﬁnite state-friendly constraints.
Nonetheless, the more ambitious program of FS OT to create a transducer
mapping any underlying representation to any surface representation cannot be
fully carried out. Even with a FS representation of Gen and of the constraints at
hand, ﬁltering is not always possible. Frank and Satta [11] (following Smolensky
and Hiller) show a simple counter-example:
Example 1:Let Gen map stringa
n
b
m
to the candidate set{a
n
b
m
,b
n
a
m
},and
let the only constraint penalize each occurrence ofa. The resulting language is
{a
n
b
m
|n≤m}∪{b
n
a
m
|n≥m}, which is clearly not regular. And yet, Gen is
regular, similarly to the proposed constraint.
3
AlthoughExample 1might look very artiﬁcial, itsconstraint, actually, is a pro-
totype form many constraints used in linguistics. In Syllable Structure Theory
[17], each segment may be parsed or underparsed, and constraintParsepunishes
underparsing. Metrical stress may be assigned to each syllable, and constraint
WSP (“Weight-to-Stress Principle”) requires each heavy syllable to be stressed
(cf. e.g. [19]). In fact, most constraints in phonology penalize each occurrence
of some local substructurea(underparsed, unstressed,...), and prefer its alter-
native, substructureb(parsed, stressed,...). The above example shows that all
these constraints could realize a non-regular language with some speciﬁc input
(output of Gen and the previous constraints) [8].
2
Albro [1] shows how to combine a non-ﬁnite state Gen with ﬁnite state constraints.
3
Authors diﬀer in what is meant by a “regular constraint”. For Frank and Satta [11],
the set of strings incurring exactlykviolation marks should form a regular set,
for allk. Gerdemann and van Noord [12] use transducers inserting violation mark
characters into the candidate strings. The given counter-example satisﬁes both of
these deﬁnitions, unlike that of J¨ager [13].

24 T. B´ır´o
Thecounting approachproposed by [11] and [15] requires an upper bound on
the number of violations a given candidate can incur. Thematching approachis
closer to the model proposed by linguists: it can, in theory, distinguish between
any number of level of violations. Yet, in many cases, only an approximation
is possible, as we shall soon see. By supposing that the length of candidates is
bounded due to restrictions in the working memory, both approaches can be used
in applications. Nevertheless, not only do we lose here the linguistic “point”, but
the size of the automata also increases quickly.
2.1 The Matching Approach
Both thecounting approachand thematching approachshare the agenda com-
posed of the following three steps:
–ﬁrst, formulate a ﬁnite state transducerGen;
–then, formulate an optimality operatoroo,whichmakesuseof
–the ﬁnite state realizationsCon-iof the constraintsCon-i.
Once we have all these at hand, the grammar is realized by the ﬁnite state
transducer obtained after having compiled the following expression:
ﬀ
ﬃ
Gen ooCon-1

ooCon-2

...... ooCon-N (1)
From now on, we suppose that a FS transducerGenis given. The task is to formu-
late the optimality operatoroo; the latter will then determine what realization
Con-iof each constraintCon-i is required.
The key idea of thematching approachis to build a setWorse(Input,Con)
that includes all sub-harmonic candidates of the inputInputwith respect to
the constraintCon, as well as possibly other strings; but excludes all harmonic
candidates. This set will then serve as a ﬁltering set in the deﬁnition of the
optimality operatoroo:
InputooCon:=InputoId
Worse(Input,Con)
(2)
Here, the identity transduction ﬁlters out the elements ofWorse(Input,Con),
only the elements of its complement may become outputs. This approach is a
straightforward implementation of the idea behind OT—supposing that the set
Worse(Input,Con) can be constructed.
Without referring to violation marks, J¨ager [13] proposes to realize a con-
straintConwith a transducerConJthat directly will create the ﬁltering set,
a superset of the sub-harmonic candidates. The candidatewis mapped onto a
set containing: (1) all candidates ofGen(Gen
−1
(w)) that are less harmonic than
w; and (2) possibly strings not belonging toGen(Gen
−1
(w)). Now, the range
ofInputo ConJcontains all sub-harmonic elements ofInputbut no harmonic
ones. Hence, it can serve as the ﬁlter in (2):
InputooCon:=InputoId
Ran(Inputo ConJ)
(3)
The draw-back of J¨ager’s approach is the diﬃculty of deﬁning the required
transducers corresponding to constraints in linguistics. Even worse, it is not

Squeezing the Inﬁnite into the Finite 25
possible very often—otherwise a ﬁniteautomaton could accept a non-regular
language in Example 1. A constraint that is ﬁnite-state in J¨ager’s sense would
lead automatically to a ﬁnite-state realization of OT.
It is more fruitful to realize constraints with transducers assigning violation
marks to the strings, as done by Ellison [10], Gerdemann & v. Noord [12] and
Eisner [9]. One can simply construct a ﬁnite state transducer that inserts a spe-
cial violation mark symbol after each disfavored substructure—supposing that
the latter are simple enough to be recognized with an FSA, which is usually the
case. In this sense, most constraints are ﬁnite-state.
4
Canwemakeanyuseof
these transducers?
Suppose that for constraintCon, a transducerConexists that introduces
the required number of violation marks into any candidate string. Importantly,
we only know that the output ofConincludes the correct number of violation
mark symbols, but we do not know how these symbols are dispersed in the
string. Furthermore, letremove violdenote the transducer removing all viola-
tion mark symbols from its input. The only task now is to deﬁne the transducer
makeworse, and then we can rewrite deﬁnition (2) as follows:
InputooCon:=Inputo Con o
oId
Ran(Inputo Con o makeworse)
o removeviol (4)
Now, we have to deﬁnemakeworse. Imagine that the set of candidatesInputenter-
ing the constraint ﬁlterConincludes only candidates that are assignedNviolation
marks, or more. Let us add at least one violation mark to each of them: we thus ob-
tain a set of strings with not less thanN+ 1 violation marks. If we ignored the
characters other than the violation marks, this latter set could simply be used for
ﬁltering, because only the candidates of the input set with the least (namely,N)
violation marks are not element of the ﬁltering set thus constructed. Consequently,
the ﬁnite state transducermakeworsewill have to add any positive number of ex-
tra violation marks to the input, using a ﬁnite state transduceraddviol.
Nevertheless, we cannot ignore the characters in the candidate strings. The
ﬁltering set will not yet include all the sub-harmonic candidates, because the
candidate strings vary not only in the number of violation marks. The diﬀerent
elements of the candidate set have to diverge from each other, for instance, in
the position of parsing brackets. Most probably, the violation marks should also
be permuted around the segments of the strings.
Therefore, we redeﬁnemakeworse: besides adding extra violation marks
(addviol), it will delete all characters that are not violation marks using the
simple transducerdeletechar, and then insert any new characters (transducer
insertchar) (cf. (4) and (5) to the formalism in Eisner [9]):
makeworse:=deletechar o addviol o insertchar (5)
The range ofInputo Con o makeworseis now the set ofallstrings with more
violation marks than the minimal in the range ofInputo Con: all candidates
4
Quadratic alignment constraints assigning a number of violation marks growing
faster than the length of the string, are not regular even in that sense [7] [8] [3].

26 T. B´ır´o
to be ﬁltered out, further uninteresting strings, but no candidates to be left in.
This fact guarantees that the harmonic candidates, and only they will survive
the ﬁltering in deﬁnition (4). Or almost.
Yes, we still face a problem. Suppose that underlying representationW1is
mapped byInpo Conto candidates involving at leastN1violation marks, and
w1is an optimal one. Further, suppose that URW2is mapped to candidates
containingN2violation marks or more, with an optimalw2. Suppose also that
N1<N2. BecauseW1is in the domain ofInpo Con o make worse, the latter’s
range will include all strings with more thanN1violation marks,w2among them.
Consequently, all candidates corresponding toW2will be ﬁltered out, andW2is
predicted to be ineﬀable, without any corresponding output.
Gerdemann and van Noord [12], therefore, deﬁnemakeworsesuch a way
that it will keep the underlying material unchanged. Suppose that what Gen
does is nothing but to add some extra material, like parsing brackets. In such a
case, deleting and reintroducing only the brackets introduced originally by Gen
ensures that diﬀerent underlying representations cannot interfere:
makeworse=addviol o delbrackets o insbrackets (6)
Nonetheless, a new problem arises! Let the underlying representationababyield
two candidates, namelya[b]aband[a]b[a]b. Let the constraint insert a vi-
olation mark@after each closing bracket, so the set enteringmakeworseis
{a[b]@ab,[a]@b[a]@b}. By applying the operationmakeworseas deﬁned in (6),
we get among others the strings[a]b@a[b]@or[a]@b@[a]b; but not[a]@b[a]@b,
the candidate to be ﬁltered out. An extra operation is, therefore, required that will
permute the violation marks: in our case, we need to remove the@between the
ﬁrstband the seconda, and simultaneously insert a@following one of thea’s; the
second violation mark will be inserted byaddviolafter the othera.
The real problem arises when one hasto compare two candidates, such that
the ﬁrst one may have an unbounded number of violation marks in its ﬁrst
part, while the second one any number of violation marks in its last part. This
happens in Example 1, and in the many analogous linguistic applications. Then,
the transducer should have to keep track of the unbounded number of violation
marks deleted at the beginning of the string, before it reaches the end of the
string and re-inserts them. That is clearly a non-ﬁnite state task.
If the transducer permuting the markspermis able to move one violation
at the same time, then the following deﬁnition ofmakeworseyields an exact
OT-system only for the case where not more thannviolation marks should be
moved at once:
5
5
The same transducer can move a second violation mark after having accomplished its
task with the ﬁrst one. Note that such a ﬁnite-state friendly case can theoretically
result from the interplay of Gen and the previously ranked constraints; and not
only from restricting the number of violation marks assigned, due, for instance, to
a bound in the length of the candidates. Further research should reveal whether the
linguistically relevant cases are indeed ﬁnite-state, or languages do produce extreme
candidate sets, such as the one in Example 1. See the research line launched by
Eisner’s OTP [8].

Squeezing the Inﬁnite into the Finite 27
make worse:=addviol o delbrackets
o insbrackets o perm1o perm2o...o permn (7)
Thus, we have run into the “permute marker problem”: only an approximation is
oﬀered by Gerdemann and van Noord for the general case. Besides, introducing
permntimes makes the automaton enormous.
3 Planting the Input into the FST
The matching approach, as proposed by Gerdemann and van Noord [12], has
two main advantages over its competitors. First, it does not require the number
of levels of violations to be ﬁnite, as opposed to the counting approach. Sec-
ond, it makes use of transducers assigning violation marks to the strings, which
is much easier to realize than the transducers in J¨ager’s generalized matching
approach.
Example 1 has shown that there is no hope for solving the “permute marker
problem” in general. Can we still bring out the most of the the counting ap-
proach? Maybe by stepping back to the lesser goal of Ellison: compiling an
automaton to each word, instead of creating a general transducer mapping any
underlying representation to the corresponding surface form? This is bad news
for people believing in FS OT (despite Example 1), and yet, it opens the way to
a new model of the mental lexicon.
We have seen that the radical deﬁnition ofmakeworsein (5) creates a prob-
lem: the candidates corresponding to some underlying representation may dis-
cardallcandidates of another underlying representation. The solution by [12],
that is, to modifymakeworseas (6) and (7), led to the “permute marker prob-
lem”. Another solution is to keep the more radicalmakeworsetransducer, as
deﬁned in (5), for the deﬁnition (4
neously, to introduce a ﬁlter at the beginning of the pipeline (or, into Gen,
as Ellison did). By restricting the domain of the transduction, this ﬁlter—an
identity transduction on a singleton—ensures that no other input disturbs the
computation. So, for hierarchyCon-1Con-2...Con-N,andforeach
underlying representationWwe have to compile the following regular expression:

≤
∪
Id
{W}o Gen
≥
ooCon-1

ooCon-2

... ooCon-N (8)
Let us prove the correctness of this approach:
Theorem:Let all constraintsCon-i be represented by a transducerCon-iin-
serting violation marks, and let
makeworse:=deletechar o addviol o insertchar
InputooCon:=Inputo Con o
oId
Ran(Inputo Con o makeworse)
o removeviol

28 T. B´ır´o
If for hierarchyH=Con-1Con-2...Con-Nand underlying
representationW,
OT0:=Id
{W}o Gen OTi:=OTi−1oo Con-i
then the range ofOTNis the set of outputs with respect to underlying repre-
sentationWand rankingH.
Proof:By induction on the number of constraintsN.ForN= 0: by deﬁnition,
the range ofOT0is the candidate set corresponding toW.
ForN=k>0: We suppose that the range ofOTk−1is the set of candi-
dates returned by the part of the pipe-line before constraintCon-k. We have
to show thatOTkis the optimal subset ofOTk−1with respect to constraint
Con-k.
Letmdenote the number of violation marks assigned by constraintCon-
k to the most harmonic candidates in the range ofOTk−1. By the deﬁnition
ofmake worse, the range ofOTk−1o Con-k o makeworseincludes all strings
(words and non-words) with more thanmviolation marks, and only them. Thus,
the identity transduction in the deﬁnition of the optimality operatorootrans-
duces all strings with no more thanmmarks, and only them. Consequently, the
range ofOTkwill include exactly those elements of the range ofOTk−1that
violateCon-kmtimes. ≤
We have stepped back to the less ambitious proposal of Ellison [10]: we com-
pile a regular expression for each input. One ﬁrst formulates a ﬁnite transducer
realizing Gen, as well as transducer adding each candidate string the same num-
ber of violation marks as the constraints of the linguistic model do. Then, the
range of the regular expression (8) has to be compiled and read. Compilation—
even if it is a computationally complex task, primarily due to the set comple-
ment operation—can be done automatically, with any package handling regular
expressions.
Is stepping back to a ten-year old result something worth writing a paper on?
The good news, however, is that the approach proposed opens new perspectives
about a ﬁnite-state model of the lexicon.
4 Modeling a Complex Lexicon
Many linguistic phenomena can be described by using “co-phonologies”, by re-
ferring to exceptions or to “minor rules”.The discussion about the interaction
between morphology and phonology (here we just refer to the well-known “past
tense debate”) has also aﬀected OT [6]. On-going and further research shall
analyze whether a ﬁnite-state Optimality Theoretical approach has something
interesting to say about the issue. In the remaining space of the present paper,
we shall present the possible ﬁrst steps of such a research line.
Equation (8) allows for generalization. IfSLis a subset of the lexicon, the
following expression will deﬁne the surface representation of elements ofSL:

Squeezing the Inﬁnite into the Finite 29

≤
∪
IdSLo Gen
≥
ooCon-1

ooCon-2

... ooCon-N (9)
Note that elements ofSLmay “extinguish” each other: if a candidatew1corre-
sponding to some elementW1∈SLincurs less violation marks than the optimal
candidate corresponding to anotherW2, then no output is returned forW2.
Therefore,SLshould be the set of “analogous” words in the language.
The phonology of the language is then modeled thus:
∈
i

≤
∪
IdSLi
o Gen
≥
ooConi1

ooConi2

... ooConiN (10)
The lexicon is composed of subsets of words. Each subsetSLiis associated
with a hierarchyConi1Coni2...ConiN. Diﬀerent subsets may be
associated with the same hierarchy, but cannot be uniﬁed, unless some words
are erased from the language, as explained. Yet, once we have this structure,
nothing prohibits us to associate diﬀerent subsets with diﬀerent hierarchies.
6
Until now, if the sub-lexicons are ﬁnite, the complex expression in (10) is com-
piled into a simple ﬁnite set of UR-SR pairs. Yet, we claim that expression (10)
together with linguistically motivated constraints have a cognitive explanatory
value by restricting the possible lexicons:whatUR-SR mappings are thinkable?
Additionally, our on-going research tries to introduce some sort ofgeneraliza-
tioninto the sub-lexicons. Let thehasha#ofanelementaof the alphabet be
the following concatenation:
a#:=pc* |{a, pc}| pc*, (11)
wherepcis apunished change: whatever character followed by a specialpunish-
ment symbol.Thus,thehashof a character is its generalization: you can replace
it, you can add anything before and after it, but whatever change introduced
is marked by apunishment symbol. In the next step, the generalizationW#
of a memorized wordWis the concatenation of the hash of its characters in
the corresponding order. Last, we propose that if a learned (memorized) word
W∈SLi,thenalsoW#∈SLi.
With this generalization, the input of the grammar model (10) can also be
an unseen word—yet, notanyunseen word. The punishment symbols mea-
sure the “distance” of the input from previously memorized, “similar” words,
in terms of letter changes. An input may match the hash of several learnt lexical
items, possibly in diﬀerent sublexicons, in which case more outputs are generated
6
One can speculate about how co-phonologies have emerged in languages. Decom-
posing the lexicon into sub-lexicons is necessary, otherwise some words would be
ineﬀable,i.e., unpronounceable. Thus, an acquisition model should be able to open
new sub-lexicons. Then, as the constraint pipe-line is connected to each sub-lexicon
independently, nothing prohibits constraint re-ranking for certain sub-lexicons. A
prediction is that language varieties diﬀer in the constraint ranking corresponding
exactly to these sub-lexicons, which reminds us the similar proposal of [2].

30 T. B´ır´o
simultaneously in various pipe-lines.
7
These symbols are preserved during the
transduction, and the output with the minimal number of punishment symbols
is the predicted form. We can use a FS OT-style ﬁlter on the punishment sym-
bols, and we obtain a sort of memory-based learning. The consequences of this
proposal, learnability issues and its possible cognitive relevance are subject to
future research.
5Conclusion
In the introduction, we have raised the problem of how one can handle the in-
ﬁnite set of OT candidates appearing in contemporary linguistic work within
the framework of a plausible psycholinguistic model or a working language tech-
nology application. In this paper, we have proposed a new way of using ﬁnite
state technology in order to solve that problem. We have reviewed why it is
not possible to create a FS transducer realizing an OT-system in general, even
if Gen is a regular relation, and constraints are also regular (at least in some
sense). Subsequently, we have proposed to make thematching approachexact
by planting a ﬁlter before Gen.
This way we have obtained an alternative to Ellison’s algorithm [10]. By com-
piling (8) for each input separately, we can calculate the optimal element of
the possibly inﬁnite candidate set. Finally, we have shown how this result can
be generalized into a model of the lexicon, yet further research has to prove
the cognitive adequateness of such a model. For instance, does it account for ob-
served morpho-phonological minor rules? Preliminary results show that diﬀerent
hierarchies are compiled in signiﬁcantly diﬀerent time. If so, do less frequently
attested language typologies correspond to rankings more diﬃcult to compile?
The present paper hope to have paved the way for such future research.
Acknowledgments
I wish to acknowledge the support of the University of Groningen’s Program
for High-Performance Computing. I also would like to thank Gosse Bouma and
Gertjan van Noord for valuable discussions.
References
1. D. M. Albro. Taking primitive Optimality Theory beyond the ﬁnite state. In
Eisner,J.L.KarttunenandA.Th´eriault (eds.): Finite-State Phonology: Proc. of
the 5th Workshop of SIGPHON, pages 57–67, Luxembourg, 2000.
7
The input may turn out to be ineﬀable in some—or all—of the pipe-lines. Importantly,
constraints and the deﬁnition of the hash operation should be such thatW#maynotren-
derWineﬀable in its own subset. Many, yet not all constraints assign an equal or higher
number of violation marks to the best candidate of a longer input. This is also the reason
why apunished changedoes not include an empty string, allowing for shortening—at
any rate, it is quite rare that longer forms inﬂuence shorter forms by analogy.

Squeezing the Inﬁnite into the Finite 31
2. A. Anttila and Y. Cho. Variation and change in optimality theory.Lingua, 104(1-
2):31–56, 1998.
3. T. B´ır´o. Quadratic alignment constraints and ﬁnite state Optimality Theory. In
Proc. of the Workshop on FSMNLP, at EACL-03, Budapest, pages 119–126, also:
ROA-600,
8
2003.
4. T. B´ır´o. When the hothead speaks: Simulated Annealing Optimality Theory for
Dutch fast speech. presented at CLIN 2004, Leiden, 2004.
5. T. B´ır´o. How to deﬁne simulated annealing for optimality theory? InProc. of
the 10th Conference on Formal Grammar and the 9th Meeting on Mathematics of
Language, Edinburgh, August 2005.
6. L. Burzio. Missing players: Phonology and the past-tense debate.Lingua,
112:157–199, 2002.
7. J. Eisner. Eﬃcient generation in primitive Optimality Theory. InProc. of ACL
1997 and EACL-8, Madrid, pages 313–320, 1997.
8. J. Eisner. Directional constraint evaluation in Optimality Theory. InProc. of
COLING 2000, Saarbr¨ucken, 2000.
9. J. Eisner. Comprehension and compilation in Optimality Theory. InProc. of ACL
2002, Philadelphia, 2002.
10. T. M. Ellison. Phonological derivation in Optimality Theory. InCOLING-94,
Kyoto, pages 1007–1013, also: ROA-75, 1994.
11. R. Frank and G. Satta. Optimality Theory and the generative complexity of con-
straint violability.Computational Ling., 24(2):307–315, 1998.
12. D. Gerdemann and G. van Noord. Approximation and exactness in ﬁnite state
Optimality Theory. InJ. Eisner, L. Karttunen, A. Thriault (eds): SIGPHON
2000, Finite State Phonology, 2000.
13. G. J¨ager. Gradient constraints in ﬁnite state OT: The unidirectional and the
bidirectional case.ROA-479, 2002.
14. D. C. Johnson.Formal Aspects of Phonological Description. Mouton, The Hague
[etc.], 1972.
15. L. Karttunen. The proper treatment of Optimality Theory in computational
phonology. InFinite-state Methods in NLP, pages 1–12, Ankara, 1998.
16. J. Kuhn. Processing optimality-theoretic syntax by interleaved chart parsing and
generation. InProc. of ACL-2000, Hongkong, pages 360–367, 2000.
17. A. Prince and P. Smolensky. Optimality Theory, constraint interaction in genera-
tive grammar. RuCCS-TR-2, ROA Version: 8/2002, 1993.
18. A. Prince and P. Smolensky.Optimality Theory: Constraint Interaction in Gener-
ative Grammar. Blackwell, Malden, MA, etc., 2004.
19. B. Tesar and P. Smolensky.Learnability in Optimality Theory. The MIT Press,
Cambridge, MA - London, England, 2000.
20. B. Turkel. The acquisition of optimality theoretic systems. m.s., ROA-11, 1994.
8
ROA stands forRutgers Optimality Archiveathttp://roa.rutgers.edu/.

A Novel Approach to Computer-Assisted
Translation Based on Finite-State Transducers
ﬀ
Jorge Civera
1
, Juan M. Vilar
2
,ElsaCubel
1
, Antonio L. Lagarda
1
,
Sergio Barrachina
2
, Francisco Casacuberta
1
,andEnriqueVidal
1
1
Departamento de Sistemas Inform´aticos y Computaci´on
Universitat Polit`ecnica de Val`encia
Instituto Tecnol´ogico de Inform´atica, E-46071 Val`encia, Spain
[email protected]
2
Departamento de Lenguajes y Sistemas Inform´aticos
Universitat Jaume I, E-12071 Castell´on de la Plana, Spain
Abstract.Computer-Assisted Translation (CAT) is an alternative ap-
proach to Machine Translation, that integrates human expertise into the
automatic translation process. In this framework, a human translator in-
teracts with a translation system that dynamically oﬀers a list of trans-
lations that best completes the part of the sentence already translated.
Stochastic ﬁnite-state transducer technology is proposed to support this
CAT system. The system was assessed on two real tasks of diﬀerent
complexity in several languages.
1 Introduction
State-of-the-art Machine Translation (MT
ing high quality translations. This drawback leads us to introduce an alternative
approach to the translation problem that brings human expertise into the MT
scenario. This idea was proposed in [13] and can be illustrated as follows. Initially,
the human translator is provided with a possible translation for the sentence to
be translated. Unfortunately, in most cases, this translation is far from being
perfect, so the translator amends it and asks for a translation of the part of the
sentence still to be translated (completion). This latter interaction is repeated
as many times as needed until the ﬁnal translation is achieved.
The scenario described in the previous paragraph can be seen as an iterative
reﬁnement of the translations oﬀered by the translation system, that while not
having the desired quality, can help the translator to increase his/her produc-
tivity. Nowadays, this lack of translation excellence is a common characteristic
in all Machine Translation systems. Therefore, the human-machine synergy rep-
resented by the Computer-Assisted Translation (CAT) paradigm seems to be
more promising than fully-automatic translation in the near future.
The CAT approach has two important aspects: the models need to provide ad-
equate completions and they have to do so eﬃciently under usability constrains.
ﬀ
This work has been supported by the European Union under the IST Programme
(IST-2001-32091) and the Spanish project TIC2003-08681-C02.
A. Yli-Jyr¨a, L. Karttunen, and J. Karhum¨aki (Eds.): FSMNLP 2005, LNAI 4002, pp. 32–42, 2006.
cﬀSpringer-Verlag Berlin Heidelberg 2006

A Novel Approach to Computer-Assisted Translation 33
To fulﬁll these two requirements, Stochastic Finite-State Transducers (SFST)
have been selected since they have proved to be able to provide adequate trans-
lations [12, 1, 3]. In addition, eﬃcient parsing algorithms can be easily adapted
in order to provide completions.
The rest of the paper is structured as follows. Next section introduces the
general setting for MT and ﬁnite-state models. In Section 3, the search procedure
for interactive translation is explained. Experimental results are presented in
Section 4. Finally, some conclusions and future work are exposed in Section 5.
2 Machine Translation with Finite-State Transducers
In a probabilistic framework, given a source sentences, the goal of MT is to ﬁnd
a target sentenceˆtthat:
ˆt=argmax
t
Pr(t|s) = argmax
t
Pr(t,s). (1)
It should be noted that the maximisation problem stated above is NP-hard [10].
The joint distribution Pr(t,s) can be modelled by a SFSTT[15]:
ˆt=argmax
t
Pr(t,s)≈argmax
t
PrT(t,s). (2)
ASFSTTis deﬁned as a tupleﬃΣ,Δ,Q,q0,F,δ,p,fωwhereΣandΔare ﬁnite
sets of source and target symbols respectively,Qis a ﬁnite set of states,q0is the
initial state,F⊆Qis the set of ﬁnal states,δ⊆Q×Σ×Δ
ﬀ
×Qis the set of
transitions,p:Q×Σ×Δ
ﬀ
×Q→[0,1] is a transition probability function and
f:Q→[0,1] is the state probability function. The functionspandfsatisfy:
f(q)+
ﬀ
q
ﬀ
∈Q, a∈Σ, ω∈Δ
ﬀ
p(q, a, ω, q
ω
)=1∀q∈Q. (3)
SFSTs have been successfully applied into many translation tasks [1, 5]. Fur-
thermore, there exist eﬃcient search algorithms like Viterbi [16] for the best path
and the Recursive Enumeration Algorithm (REA) [11] for then-best paths.
A possible way of inferring SFSTs is the Grammatical Inference and Align-
ments for Transducer Inference (GIATI) technique [7]. Given a ﬁnite sample of
string pairs, it works in three steps:
1. Building training strings. Each training pair is transformed into a single
string from an extended alphabet to obtain a new sample of strings. The
“extended alphabet” contains words or substrings from source and target
sentences coming from training pairs.
2. Inferring a (stochastic) regular grammar. Typically, a smoothedn-gram is
inferred from the sample of strings obtained in the previous step.
3. Transforming the inferred regular grammar into a transducer. The sym-
bols associated to the grammar rulesare adequately transformed into
source/target symbols, thereby transforming the grammar inferred in the
previous step into a transducer.

34 J. Civera et al.
The transformation of a parallel corpus into a corpus of single sentences is
performed with the help of statistical alignments: each word is joined with its
translation in the output sentence, creating an “extended word”. This joining is
done taking care not to invert the order of the output words. The third step is
trivial with this arrangement. In our experiments, the alignments are obtained
using the GIZA++ software [14], which implements IBM statistical models [4].
3 Interactive Search
The concept of interactive search is closely related to the CAT paradigm. This
paradigm introduces a new factortpinto the general MT equation (Eq. 1).tp
represents a preﬁx of the target sentence obtained as a result of the interaction
between the human translator and the MT system.
An example of this interaction is shown in Fig. 1. In each iteration, a preﬁx
(tp) of the target sentence has somehowbeen ﬁxed by the human translator
in the previous iteration and the CAT system computes its best (orn-best)
translation suﬃx hypothesis (ˆts) to complete this preﬁx.
ITER-0 (tp)(
ITER-1
(ˆts) (Haga clic para cerrar el di´alogo de impresi´on)
(a) (Haga clic)
(k) (en)
(tp) (Haga clic en)
ITER-2
(ˆts) (ACEPTAR para cerrar el di´alogo de impresi´on)
(a) (ACEPTAR para cerrar el)
(k) (cuadro)
(tp) (Haga clic en ACEPTAR para cerrar el cuadro)
FINAL
(ˆts) (de di´alogo de impresi´on)
(a) (de di´alogo de impresi´on)
(k) (#)
(tp≡t)(Haga clic enACEPTAR para cerrar el cuadrode di´alogo de impresi´on)
Fig. 1.Example of a CAT system interaction to translate into Spanish the English
sentence“Click OK to close the print dialog”extracted from a printer manual. Each step
starts with a previously ﬁxed target language preﬁxtp, from which the system suggests
asuﬃxˆts. Then the user accepts part of this suﬃx (a)andtypessomekeystrokes(k),
in order to amend the remaining part ofts. This produces a new preﬁx, composed
by the preﬁx from the previous iteration and the accepted and typed text, (a)(k),
to be used astpin the next step. The process ends when the user enters the special
keystroke ”#”. In the ﬁnal translation,t, all the text that has been typed by the user
is underlined.
Giventp
ˆts, the CAT cycle proceeds by letting the user establish a new,
longer acceptable preﬁx. To this end, he or she has to accept a part (a)oftp
ˆts
(or, more typically, just a preﬁx ofˆts). After this point, the user may type some
keystrokes (k) in order to amend some remaining incorrect parts. Therefore,
the new preﬁx (typically) encompassestpfollowed by the accepted part of the

A Novel Approach to Computer-Assisted Translation 35
system suggestion,a,plusthetext,k, entered by the user. Now this preﬁx,
tpak, becomes a newtp, thereby starting a new CAT prediction cycle.
Ergonomics and user preferences dictate exactly when the system can start
its new cycle, but typically, it is started after each user-entered word or even
after each new user keystroke.
Perhaps the simplest formalization of the process of hypothesis suggestion of
a CAT system is as follows. Given a source textsand a user validatedpreﬁxof
the target sentencetp,searchforasuﬃxof the target sentence that maximises
thea posterioriprobability over all possible suﬃxes:
ˆts=argmax
ts
Pr(ts|s,tp). (4)
Taking into account that Pr(tp|s) does not depend onts,wecanwrite:
ˆts=argmax
ts
Pr(tpts|s), (5)
wheretptsis the concatenation of the given preﬁxtpand a suﬃxts.Eq.5is
similar to Eq. 1, but here the maximisation is carried out over a set of suﬃxes,
rather than full sentences as in Eq. 1. This joint distribution can be adequately
modeled by means of SFSTs [9].
The solution to this maximisation problem has been devised in two phases.
The ﬁrst one copes with the extraction of a word graphWfrom a SFSTTgiven
asourcesentences. In a second phase, the search of the best translation suﬃx
(or suﬃxes) according to the Viterbi approach [16] is performed over the word
graphWgiven a preﬁxtpof the target sentence.
3.1 Word Graph Derivation
A word graph is a compact representation of all the possible translations that
aSFSTTcan produce from a given source sentences[9, 8]. In fact, the word
graph could be seen as a kind of weighted ﬁnite-state automaton in which the
probabilities are not normalized.
Formally, given a SFSTT=ﬃΣ,Δ,Q,q0,F,δ,p,fωand a source sentence
s=s1,···,si,···s
|s|, the constructed word graph is deﬁned as a tupleW=
ﬃΔ, Q
ω
,q
ω
0
,F
ω
,δ
ω
,p,fω:
Q
ω
=Q×i:0≤i≤|s|
δ
ω
={((q, i−1),t,(q
ω
,i))|(q, si,t,q
ω
)∈δ}
q
ω
0
=(q0,0)
F
ω
=
ﬃ
(q
ω
,|s|)|((q,s
|s|,t,q
ω
)∈δ)∧(q
ω
∈F)
ω
There are a couple of minor issues to deal with in this construction. On the
one hand, the output symbol for a given transition could contain more than
one word. In this case, auxiliary states were created to assign only one word
for each transition and simplify the posterior search procedure. On the other

36 J. Civera et al.
hand, it is possible to have words in the input sentence that do not belong to
the input vocabulary in the SFST. This problem is solved with the introduction
of a special generic “unknown word” in the input vocabulary of the SFST.
Intuitively, the word graph generated retains those transitions in the SFST
that were compatible with the source sentence along with their transition prob-
ability and output symbol(s
parsing process of the source sentence, over the SFST, are considered ﬁnal states
(as well as those states reachable withλ-transitions from them).
Once the word graph is constructed, it can be used to ﬁnd the best completions
for the part of the translation typed by the human translator. Note that the word
graph depends only on the input sentence, so it is used repeatedly for ﬁnding
the completions of all the diﬀerent preﬁxes provided by the user.
3.2 Search forN-best Translations Given a Preﬁx of the Target
Sentence
Ideally, the search problem consists in ﬁnding the target suﬃxtsthat maximises
thea posterioriprobability given a preﬁxtpof the target sentence and the input
sentences, as described in Eq. 5. To simplify this search, it will be divided into
two steps or phases. The ﬁrst one would deal with the parsing oftpover the
word graphW. This parsing procedure would end reaching a set of statesQ
ω
p
that deﬁne paths from the initial state whose associated translations include
tp. To clarify this point, it is important to note that each stateqin the word
graph deﬁnes a set of translation preﬁxesPq. This set of translation preﬁxes is
obtained from the concatenation of the output symbols of the diﬀerent paths
that reach this stateqfrom the initial state. Therefore, the setPqof each state
inQ
ω
p
includestp. The second phase would be the search of the most probable
translation suﬃx from any of the states inQ
ω
p. Finally, the complete search
procedure extracts a translation from the word graph whose preﬁx istpand its
remaining suﬃx is the resulting translation suﬃxts.
Error-Correcting Parsing.In practice, however, it may happen thattpis
not present in the word graphW. The solution is not to usetpbut a preﬁxt
ω
p
that is themost similartotpin some string distance metric. The metric that
will be employed is the well-known minimum edit distance based on three basic
edit operations: insertion, substitutionand deletion. Therefore, the ﬁrst phase
introduced in the previous paragraph needs to be redeﬁned in terms of the search
of those states inWwhose setPqcontainst
ω
p, that is, the set of statesQ
ω
p.It
should be remarked thatt
ω
p
is not unique, but there exist a set of preﬁxes inW
whose minimum edit distance totpis the same and the lowest possible.
Given a translation preﬁxtp, the computation ofQ
ω
p
is eﬃciently carried out
by applying an adapted version of the error-correcting algorithm for regular
grammars over the word graphW. This algorithm returns the minimum edit
costc(q) with respect totpfor each stateqinW.Tobemoreprecise,this
minimum edit cost is the lowest minimum edit cost betweentpand the set of
preﬁxesPqof each stateq. Finally,Q
ω
pis deﬁned as:

A Novel Approach to Computer-Assisted Translation 37
Q
ω
p
=argmin
q∈Q
ﬀ
c(q)(6)
The asymptotic cost of this algorithm isO(|tp|· |Q
ω
|·B), whereBis the
(average) branching factor of the word graphW.
The implementation of the error-correcting parsing is further improved by
visiting the states inWin topological order, and incorporating beam-search
techniques to discard those states whose minimum edit cost is worse than the
best minimum edit cost at the current stage of the parsing by a given constant.
Moreover, given the incremental nature oftp, the error-correcting algorithm
takes advantage of this peculiarity to parse only the new suﬃx oftpprovided
by the user in the last interaction, that is, the concatenation ofaandk.
As mentioned before, once the setQ
ω
p
has been computed, the search of the
most probable translation suﬃx could be calculated from any of the states in
Q
ω
p. In practice, only one stateqpfromQ
ω
pis selected to ﬁnd the suﬃxts.
This selected stateqpmaximises thea posterioriprobability of the word-graph
preﬁxt
ω
pdeﬁned during the error-correcting parsing process. This maximisation
is performed according to the Viterbi approximation [16].
N-best Search.The actual implementation of this CAT system is able to
provide a set of diﬀerent translation suﬃxes, instead of a single suggestion. To
this purpose, an algorithm that searches for then-best translation suﬃxes in a
word graph is required. Among then-best algorithms available, the Recursive
Enumeration Algorithm (REA) described in [11] was selected. The main two
reasons that support this decision are its simplicity to calculate best paths on
demand and its smooth integration with the error-correcting parsing algorithm.
Basically, the interaction between these two algorithms, error-correcting and
n-best, consists in the supplement of the stateqpby the former, so that the
n-best translation suﬃxes can be calculated from this state by the latter.
The version of REA included in the CAT system, which is being described,
stores for each stateqinW, the sorted list of current best paths (in the form of
next state in the best path) fromqto any ﬁnal state. The length of this sorted
list depends on the number of transitions leavingq. During the initialisation
of REA, the initial sorted list of best paths for each state is calculated start-
ing from the ﬁnal states and visiting the rest of states in backward topological
order. This last condition imposes a total order inQ
ω
that favours the eﬃcient
calculation of the sorted list of best paths. This is so because each state is visited
only once, and once the best paths of the preceding states have already been
computed.
Then, given a stateqpfrom which then-best translation suﬃxes need to be
calculated, REA ﬁrst extractsthe 1-best path from the stateqp, since it was
precomputed during REA initialisation. Ifn>1, then the next best path from
qpwill be obtained. The next best path at stateqpcan be found among the
candidate paths still left in the sorted list of this state and the second best
path through the transition traversed in the 1-best path just extracted. This
fact requires the recursive calculationof the second best path (whenever exists)
through the states visited in the 1-best path. This same rationale is applied to

38 J. Civera et al.
the calculation of subsequent best paths untiln-best diﬀerent translation suﬃxes
have been obtained or no more best paths can be found.
4 Experimental Framework and Results
The SFST models introduced in the previous sections were assessed through
some series of experiments with two diﬀerent corpora that were acquired and
preprocessed in the framework of the TransType2 (TT2
section, these corpora, the assessment metrics and the results are presented.
4.1 XRCE and EU Corpora
Two bilingual corpora extracted from diﬀerent semantic domains were used in
the evaluation of the CAT system described. The language pairs involved in the
assessment were English/Spanish, English/French and English/German.
The ﬁrst corpus, namelyXRCEcorpus, was obtained from a miscellaneous
set of printer user manuals. Some statistics of this corpus are shown in Table 1.
Table 1.The “XRCE” and “EU” corpora English(En) to/from Spanish(Sp
man(Ge) and French(Fr). Trigrams models were used to compute the test perplexity.
(Kdenotes×1.000, andMdenotes×1.000.000).
XRCE EU
En/SpEn/GeEn/FrEn/SpEn/GeEn/Fr
Train
Sent. pairs (K56 49 53 214223215
Run. words (M0.6/0.70.6/0.50.6/0.75.9/6.66.5/6.16.0/6.6
Vocabulary (K)26/3025/2725/3784/9787/15385/91
Test
Sentences (K)1.11.0 1.0 0.80.80.8
Run. words (K)8/99/1011/1020/2320/1920/23
Perplexity 107/6093/169193/13596/7295/15397/71
It is important to remark that the English manuals are diﬀerent in each pair
of languages. The size of the vocabulary in the training set is about 25.000 words
in most of the language pairs that can be considered to be a broad lexicon. In
the test set, even though all test sets have similar size, the perplexity varies
abruptly over the diﬀerent language pairs.
The second dataset was compiled from the Bulletin of the European Union,
which exists in the 11 oﬃcial languages of the European Union. This dataset is
known as theEUcorpus and is publicly available on the Internet. A summary
ofitsfeaturesispresentedinTable1.
The size of the vocabulary of this corpus is at least three times larger than
that of theXRCEcorpus. These ﬁgures together with the amount of running
words and sentences reﬂect the challenging nature of this task. However, the
perplexity of theEUtest set is similar to that of theXRCE. This phenomenon
can be intuitively explained through the more uniform grammatical structure of
the sentences in theEUcorpus.

A Novel Approach to Computer-Assisted Translation 39
4.2 Translation Quality Evaluation
The assessment of the CAT system has been carried out based on two measures:
1.Translation Word Error Rate(TWER). It is deﬁned as the minimum number
of word substitution, deletion and insertion operations required to convert
the target sentence provided by the translation system into the reference
translation, divided by the number of words of the reference translation. It
can also be seen as the ratio of the minimum edit distance between the system
and the reference translation, and the number of words of the reference
translation [1, 6].
This metric is employed to evaluate the quality of the complete transla-
tions oﬀered by the system when no preﬁx is taken into consideration, that
is, no interaction with the user is assumed.
2.Key-Stroke Ratio(KSR). Number of interactions, as the sum of mouse ac-
tions (to selecta) and keystrokes (to typek), that are necessary to achieve
the reference translation plus the ﬁnal translation-acceptance keystroke di-
vided by the number of characters of the reference translation [9, 8].
KSR reﬂects the ratio between the number of interactions of a ﬁctitious
user when translating a given text using a CAT system compared to the
number of interactions, which this user would need, to translate the same
text without using a CAT system. Thus, this measure gives a clear idea of
the amount of work that a translator would be saving when translating using
a CAT system.
4.3 Experimental Results
These experimental results were obtained with GIATI transducers based on
smoothed trigram language models for theXRCEcorpus and smoothed 5-gram
language models for theEUcorpus (see Table 2).
The translation metrics presented in the previous section were calculated on
the test set for all the pairs of languages and both directions, translating from
English to a non-English language and from a non-English language to English,
as shown on the left-most column of Table 2. Moreover, the results were obtained
assuming two possible cases, the CAT system only oﬀers the best translation or
the 5-best translations. In the latter case, the calculation of a given assessment
metric was conducted considering that translation out of the ﬁve suggested trans-
lations that most minimises the corresponding error measure. As expected, there
is a notable improvement when comparing 1 to 5-best translation error measures.
Analysing the results accomplished in theXRCEcorpus, it is observed that
the TWER and KSR rates for English/Spanish language pairs are substantially
lower than those obtained in the rest of language pairs. A possible reason be-
hind the error rate discrepancies betweenEnglish/Spanish pairs with respect to
English/German and English/French pairs could be found in the perplexity dif-
ferences shown in Table 1. The Spanish test perplexity is signiﬁcantly lower than
that of the rest of languages and this fact is transformed into better translation
results. Another reason for the outperforming results of the English/Spanish

Other documents randomly have
different content

VIII.
A NAPEVŐ.
– Színhely: ismét a nagy szalón. A déli látogató megint: liptai
Thurzóné. Elza hosszan várakoztat magára; végre megjelenik. –
Eäza. – Bocsáss meg, kedves Pálma…
Páäma (ijedten). – Istenem, csak nem az ágyból keltél fel
miattam?!… Olyan zilált vagy!…
Eäza. – Dehogy! Már régen fenn vagyok és pompásan érzem
magam…
Páäma. – Annál jobb, annál jobb… bár tartok tőle, hogy csak meg
akarsz nyugtatni. Hadd lássalak! Jesszusom, hiszen nem is sápadt
vagy, hanem zöld!
Eäza. – Rosszul látsz, mert a szoba egy kicsit homályos. Úgy
nézek ki, mint máskor.
Páäma. – Az igaz, hogy egy idő óta mindig igen rossz színben
vagy!… szinte megöregedtél. A multkor meg is kérdeztem Alfrédtől:
»Mondja, mit csinál maga a menyasszonyával?! Hiszen nem lehet
ráismerni!«
Eäza. – Nagyon szép, édes Pálma, hogy ennyire érdeklődöl, de
biztosítalak…
Páäma. – Nem, nem, hiába beszélsz… és én kétségbe vagyok
esve, mert nem gondolhatok egyebet, mint, hogy ezt én okoztam, a
fecsegésemmel és ezekkel az átkozott levelekkel… Mondtam is a
multkor Alfrédnak, hogy már keservesen megbántam, a mit tettem…

Eäza. – Ez érdekes, hogy szépen megbeszéled vele a különböző
démarche-okat, a mikkel azon fáradozol, hogy őt kontrakarirozd!…
És ő, mit szól mindezekhez ő?… A mi közös barátunk, a ki annyit
diskurál veled a menyasszonyáról és a ki a menyasszonyának azt
mondja rólad, hogy téged kár szabadlábon hagyni, mert köz- és
önveszélyes őrült vagy?…
Páäma. – A kis kedves!… Te, Elza, esküszöm neked, hogy a föld
még nem hordott a hátán ilyen gazembert! Igen, eleinte azzal
fenyegetőzött, hogy be fog csukatni az őrültek házába. De ez nem
sült el, mert jól néznénk ki, ha minden asszonyt be lehetne vinni a
Svarczerhez, csak azért, mert nem jól él az urával!… Aztán rám akart
ijeszteni, hogy elárul a férjemnek, hogy elmond neki mindent… és ő
képes is erre, mert nincs az a nemtelenség, a mire ne volna képes,
ha két krajczár haszna van belőle… De ez se hatott, mert jól tudja,
hogy a férjemmel azt hitetem el, a mit akarok, és semmi szín alatt se
fogja azt hinni, a mi rá nézve kellemetlen… Végre azt eszelte ki,
hogy be fog mártani előtted, és nekem a legnagyobb czinizmussal
csak annyit mondott, hogy csak beszéljek bátran és ne felejtsek el
semmit, mert jobb, ha a menyasszonya előre tisztában lesz
mindennel s a házasság után nem pattanhat ki semmi, a mit én már
előbb nem szellőztettem… Azt mondta: tulajdonképpen szolgálatot
teszek neki, és azóta arczátlan nyugalommal kérdezősködik, hogy
micsoda eredményeket értem el a leleplezéseimmel…
Eäza. – Au fond, igaza van, mert jobb, ha idejekorán tisztázzuk az
eszméket.
Páäma. – Te még azt hiszed, hogy tisztázhatod őket?
Eäza. – Miért ne?
Páäma. – Elolvastad a leveleket?
Eäza. – No, majd nem olvasom el az ilyen leveleket!
Páäma. – Hát mit szólsz a vőlegényedhez?
É

Eäza. – Édes istenem, úgy látszik, igazad volt, és a mi közös
barátunk csakugyan nem kifogástalan ember. De hát hol vannak a
Lohengrinek?
Páäma. – Mindössze ennyi a megjegyzésed?
Eäza. – Miért simfeljek a vőlegényemre? Én simfeljek rá? Eleget
simfelik a politikában, és szomorú, hogy nem ok nélkül. De ha már
benne vagyok a szószban, hát mit csináljak?
Páäma. – Nem értelek. Mindezek után, trocz alledém, férjhez
akarsz menni hozzá?! Ahhoz az emberhez, a kit én… megfizettem!
Eäza. – Pardon, csak kölcsön adtál neki. Lehet, hogy soha se
akarta visszafizetni, de csak kölcsönöket fogadott el. A mi igaz, az
igaz. És lehetséges, hogy igazságtalanul ítéljük meg, a mikor
gyanusítjuk, hogy soha se akart fizetni. Talán éppen azért kereste
folyton a nagy partit, hogy az adósságait visszafizethesse. Nekem
őszintén megmondta, hogy nem venne el, ha szegény leány volnék.
Na hát a kölcsönt vissza fogja fizetni kamatostul, sőt a kamatok
kamatjával, ha akarod. Ez a kimagyarázkodás egy kicsit
kellemetlen… rád nézve is… mind a kettőnkre nézve… de essünk túl
rajta. Nem valami díszes história, az igaz. És én meglehetősen
különösnek találom a világ berendezését, de úgy látszik, előfordul,
hogy a vőlegénynek nem csak a jövőjét kell megvennünk, hanem
meg kell vásárolnunk, vissza kell vásárolnunk a multját is… ki kell őt
váltani… Úgy látszik, előfordul, hogy a legjelesebb férfiú is, ha
másképpen nem lehet, egy darabig hitelbe élvezi az életet, a
pezsgőt, a kocsit, a czigányozást s a többit, aztán szivének trónjára
beültet egy ártatlan leányt, a ki készségesen ki fogja fizetni, a mit a
szívkirály előbb hitelbe élvezett, a pezsgő-kontót, a kocsitartozást,
toute la lyre, s hálájának kifejezésébe bele fogja foglalni az összes,
régebben szerepelt kisasszonyokat is… Ez egy kicsit furcsa, hogy a
menyasszonynak kell mindezekről gondoskodnia, visszamenőleg…
de, úgy látszik, a házasság, a maga czikornyátlanságában,
voltaképpen csak afféle konverzionális művelet… az egyik fél kielégíti
az összes apró hitelezőket, annak fejében, hogy a másik fél leköti

magát… és meg is esküszik rá, hogy ezentúl csak egyfelé fogja
fizetni a kamatokat… Ez nem annyira vidám, mint inkább undorító,
de éppen nem ritkaság… A formák gondos betartásával, a
valóságnak ügyes és szép felöltöztetésével gavalléroknál, grófoknál
is előjön… mondhatni: gang und gébe. Na hát én a világ rendjét
nem fogom megváltoztatni… és nem várom a Lohengrint, a ki
elvenne egy garas nélkül is… mert meglehet, hogy ha lecseppenne
az égből, az is mindjárt a brabanti uradalom után tudakozódnék…
hanem egyszerűen beleilleszkedem a társadalmi rendbe. Szóval, légy
nyugodt, a mult ki lesz egyenlítve, és a mi téged illet, nem
panaszkodhatol, mert a pénz nem veszett el, s neked a többi: a
szebb emlékek, a vidám órák, a bókok, vagyis, »a szív zenéje«, und
so weiter, a holdvilág, a csalogány dala, mind ingyen maradt!
Páäma. – Hallatlan leány vagy! Azért vágsz a fejemhez ilyen
rettenetes szotizokat, mert meg akarlak menteni ettől a
Schinderhannestől?!… Nem értelek. Én a legnagyobb nyíltsággal
szólok hozzád; megalázkodom előtted, s a mi ennél is több,
elmondom neked a legszégyenletesebb titkaimat, csakhogy
megóvjalak egy nagy veszedelemtől. S te a helyett, hogy kiadnád az
útját ennek a sviháknak, és nekem köszönetet mondanál, hogy
megszabadítottalak tőle, cserében a jóakaratomért, malicziákkal
felelsz, és azzal bünteted meg magadat, hogy juszt is férjhez mégy
ahhoz, a kit nem szeretsz, és a ki nem méltó hozzád!
Eäza. – Kedves Pálma, te szives voltál elismerni, és valóban a
legnagyobb nyíltsággal, hogy a jóakaratod nem egészen önzéstelen.
Elmondtad, hogy meg akarod tartani az emberedet, akárminők az
erkölcsi fogyatkozásai. Nem méltó hozzám, de neked megfelelő…
Páäma. – Az más. Én szeretem ezt az embert, akármilyen
kucséber… De én asszony vagyok, és te ezt nem érted. Egy leány
nem szerethet; egy leány csak álmodozhatik.
Eäza. – Eh, kedves Pálma, te erre már nem emlékszel!… annak,
hogy te leány voltál, már jó ideje!… De ne vitatkozzunk erről… Annyi
bizonyos, hogy egyelőre én se szándékozom lemondani róla… Pedig

elhiszem, hogy éppen nem az az ember, a kiről álmodozni lehet… De
úgy látszik, hogy ez nem határoz, és úgy látszik, hogy én is
szeretem.
Páäma. – Ezt beszéld el a Blaschkenének; az talán elhiszi. Az egész
világ tudja, hogy mind a kettőtöktől távol van minden érzelgősség…
Alfréd soha se csinált titkot belőle, hogy csak a pénzedért vesz el… s
a multkor magad is elismerted, hogy csak rezonból méssz hozzá,
mint olyan fiatal emberhez, a kinek nagy jövője van, a kinek az
oldalán a legmagasabb körökbe s ott is vezető szerephez juthatsz…
a mi ugyan nem igaz, de ez a gazember a legokosabb lányokkal is el
tudja hitetni ezt a szamárságot…
Eäza. – A multkor is mondtam, hogy már kezdem szeretni…
Páäma. – No, ha csak akkor fogtál hozzá!… A mi azóta történt,
aligha tett valami nagyon szerelmessé…
Eäza. – A mi azóta történt?… Nem, kedvesem, ez nem távolított el
tőle, sőt ellenkezőleg. Meglehet, hogy igazán csak a belátás hozott
össze bennünket… Belém disputálta, hogy ő rám nézve a képzelhető
legjobb parti, s utóljára elhittem neki… Abban igazad van, hogy
eleinte minden lelkesedés nélkül mentem bele a dologba… De a
mióta akadályokat látok, drámai akadályokat, csunya akadályokat:
kezdek makacsul ragaszkodni az eszméhez… Én már ilyen vagyok!…
És hogy mindent elmondjak: azt hiszem, tulajdonképpen te vagy az,
a ki szorosabbra fűzted ezt a frigyet!
Páäma. – Feláldozod magad, csak azért is?!… Egyedül azért, hogy
elüsd a kezemről?!… Hogy akármilyen keveset ér, ne maradjon az
enyém?!…
Eäza. – Nem, kedves Pálma, erre a triumfusra nem vágyódom…
egyáltalában nem ingerel, hogy veled, vagy akárkivel
versenyezzem… egyszerűen nem törődöm mással, csak a magam
érdekével… De ezt nem vagyok hajlandó feláldozni holmi
szentimentális tekintetekből. És a mióta látom, hogy milyen
makacsul ragaszkodol a vőlegényemhez, én is kezdek ragaszkodni

hozzá, mert arra kell gondolnom: lám, lám, úgy látszik, vannak
érdemei, a melyeket én még nem ismerek!…
Páäma. – Furcsákat beszélsz össze-vissza, leány létedre! És
mondhatom, hamarjában nem is tudom, hogy mit feleljek erre, mert
hátha félreértettelek?!… Látod, kedvesem, én szerencsétlen asszony
vagyok… rossz asszony, ha éppen úgy akarod… de leánykoromban
inkább meghaltam volna, mint hogy ilyen aperszűket tegyek…
Eäza. – Azóta, azt hiszem, egy kicsit bátrabb lettél… Különben
megnyugtathatlak, mert csakugyan félreértettél. Úgy látszik, nagyon
is asszonyosan gondolkozol… szóval nem akartam mondani semmi
különöset. Úgy értettem, hogy végre is régebben és jobban ismered,
mint én, és ha csak a rosszat mondod el róla, bizonyosan van benne
sok kedvesség is, több, mint képzelem, mert különben nem
szeretnéd.
Páäma. – Pardon, de igazán nem tehetek róla, ha félreértettelek…
Te adtad rá az okot, mert a nélkül, hogy ezt provokáltam volna,
olyan nyiltan beszélünk, a mennyire ez… leányokkal… régebben nem
volt szokásos… Leánykorom óta bátrabb lettem, a mint jól mondod…
azt is mondhatnád, hogy elromlottam… de azért minduntalan
meglepnek egy kicsit a mai leányok…
Eäza. – Édes istenem, nyiltabban beszélünk, mert nem akarjuk
eltitkolni, hogy kinyílt a szemünk!… Kinyílt és nem sütjük le; azzal a
szándékkal, hogy letakarjuk vele a gondolatainkat… Azt hiszem, ez
becsületesebb, mint azoknak a lányoknak a hallgatása, a kik csak
azért hallgattak nagyokat, hogy asszonykorukban annál többet
beszélhessenek, annál nagyobbakat kiálthassanak, annál jobban
rikoltozzanak… És ha már erről van szó, igazán nem értem, hogy min
botránkoztál meg? Azon, hogy képmutatás nélkül beszélek? Bocsáss
meg, de úgy tűnsz fel nekem, mint azok az ujságok, a melyek a
vezérczikkben lecsepülik az erkölcstelenséget, a tárczában
felháborodva tiltakoznak az ellen, hogy az írók és a művészek szóvá
tegyék a bűnt, aztán a hírekben, a törvényszéki rovatban és a
vegyesekben csupa erkölcstelenséget ujságolnak, olyan

szörnyűségeket, a melyek túlliczitálják Párizst és az őserdőket, és
végül, a kishirdetésekben, nyugodtan istápolják a feslettségét és a
romlottságot. Azt követelitek, édes Pálma, hogy szó se essék az
erkölcstelenségről? Hogyan? Hát az erkölcstelenséget a
magánéletetek számára akarjátok fentartani?! Ez nektek szent
tulajdonotok, és nekem még tudnom se szabad róla?!
Páäma. – Egy kicsit visszaélsz a bizalmammal, de nem baj. Ezzel
sokat könnyítesz a helyzetemen. Mert ha mindenről beszélhetek
veled, és egyébről is: nem fogok elhallgatni előtted semmit!…
Megkíméltelek volna ezektől a csunyaságoktól, da ha másképpen
nem tudlak kigyógyítani…
Eäza. – Kérlek, csak beszélj bátran! Sokkal többet már úgy se igen
mondhatsz…
Páäma. – Tévedsz. Abból, a mit eddig tudsz, csak az következik,
hogy a vőlegényed eladó. Jól van, ez még nem olyan nagy dolog;
hiszen ma már eladó majdnem minden és majdnem mindenki. De én
bebizonyítom neked, hogy a vőlegényed nem olyan közönséges
csibész, mint a többi; bandita és bandita között is van különbség. A
te vőlegényed a legutolsó szortéból való, mert alávaló, mint egy
istállószolga!
Eäza. – Ah!
Páäma. – Ez az úr tizennyolcz-húsz éves korában olyan szegény,
olyan rongyos és olyan szennyes volt, hogy ezt te el se tudod
képzelni, mert azt hiszem, fogalmad sincsen róla, micsoda alakok
szaladgálnak a kloákák körül…
Eäza. – Nem, azon a tájékon nincs ismeretségem.
Páäma. – Csak színdarabokban látni ilyen elzüllöttséget. Ha azt
mondom, hogy kuglófból, hitelből és a vendéglőből való
megugrásból élt, ezzel keveset mondok.
Eäza. – Szépen kikefélkedett azóta.

Páäma. – Nos hát, ennek a skatulyából kivett úrnak akkortájt igen
rosszul ment. Hogy kabátot és könyvet lopott: ezt esak a mentségéűl
hozom fel. Lopott, mert fázott, és lopott, mert művelődni akart; ez
még szép tőle, és bizonyára imponálni fog neked, hogy a lopott
könyveket, mielőtt eladta az antikváriusnak, előbb elolvasta…
Eäza. – Elég sokra vitte, ha ilyen mélységből indult.
Páäma. – Sokra vitte, de hogyan! Elmondom részletesen. Tehát
nem csupán kuglófból, felirásból és blicczelésből élt, mert jó étvágya
volt, hanem főképpen abból, hogy napot evett könyörületes, szegény
embereknél. Tudod te, mi az, napot enni?
Eäza. – Nem próbáltam, de sejtem. Azt hiszem, hallottam is
valamit erről a szokásról, abban az időben, a mikor még nem
másztunk fel az ugorkafára, s a jólétünk senkinek se okozott
fejgörcsöket.
Páäma. – Szóval: ebédelni és vacsorálni jótékony embereknél
szokott, a hétnek egyik napján itt, a másikon amott. Ezek a jótékony
emberek maguk is szegény ördögök voltak, a kik egész nap a
nyomorult kis üzletükben foglalatoskodtak. Igy mindjárt a legelső
állomásán könnyű volt a kosztját meghálálnia. Azzal hálálta meg,
hogy elcsábította a szegény ördögök tizenhat esztendős leányát.
Eäza. – Ne csábult volna el.
Páäma. – Igen ridegen itélsz. Hidd el, hogy azok közül, a kiknek
sokáig nem nyílik ki a szemök, igen sok leányt veszít el a
tudatlanság, azt se tudják, hogyan. Nem lehet mindenki előre okos
és előre kitanult. Elég az hozzá, a kétségbeesett szegény ördögök,
mikor látták, hogy egy kis napevővel többen vannak, azt képzelték,
hogy majd feleségül adják a lányukat ehhez a csirkefogóhoz, és
hogy valami keresete legyen, kitaníttatják az egyetemen.
Eäza. – Úgy hangzik, mint egy Zola-regény.
Páäma. – A vége inkább Gorkijra emlékeztet. A napevő, a szegény
emberek költségén, megtanulta a juriszprudencziát, leette róluk, a

mijök volt, s a mikor elkészült, ott hagyott csapot, papot, napot,
leányt és gyermeket.
Eäza. – Hány százalékot engedsz ebből a történetből?
Páäma. – Egy betűt sem. Mindez szóról-szóra igaz, és ha Alfréd úr
letagadja, be fogom neked bizonyítani.
Eäza. – És aztán… mi történt a leánynyal?
Páäma. – Ha jól emlékszem: meghalt.
Eäza. – Hát a gyermek?
Páäma. – Az csirkét fog valahol; persze soványabbakat, mint az
apja.
Eäza. – Kedves Pálma! Ha te bebizonyítod nekem, hogy mindez
szóról-szóra igaz: akkor én a föld alól is előteremtem a kis napfiut,
édes anyja leszek neki… Én mindenre képes vagyok, még az
önzéstelenségre is…
Páäma. – Ne siess úgy, mert nem tudod, hogy mire vállalkozol.
Eäza. – Még egy napfiú fog előjönni?
Páäma. – Annyi napfiú fog előjönni, hogy megsokallod. Ez az úr
minden háznál egyforma pénzzel fizetett. Rendszer volt nála, hogy
hálájának jeleivel be fogja népesíteni a világot, és valahányszor
megette az utolsó kakast is, fütyölve állott tovább, mint Mantua
herczege.
Eäza. – A mit mesélsz, az már nem opera; inkább operette.
Páäma. – A hogy vesszük. Alfréd operettenek vette.
Eäza. – Bizonyítékokat, kedvesem, bizonyítékokat!
Páäma. – Képzelheted, hogy a bizonyítékokat nem hordom
magamnál, és nem is olyan könnyű ezeket megszerezni, mert azok a
szegény ördögök meglehetősen szerteszóródtak a világba, már a ki

azóta el nem pusztult közülök… De meg fogom szerezni a
bizonyítékokat, ha biztosítasz…
Eäza. – Biztosítalak, hogy nem fáradsz hiába.
Páäma. – Annál jobb. De, kérlek, kínálj meg egy pohár édességgel.
Egészen elrekedtem.
Eäza. – Kérlek. Kitűnő tokajival szolgálhatok… vagy inkább egy
pohár curaçao-t?
Páäma. – Akármit, csak öblítsük le ezt a sok csunyaságot.
Eäza. – Kérlek szépen, azonnal. (Csönget.)
* * *
– Egy hónappal később. Ugyanott. Elza, ugyanabból az üveg
tokajiból, most már Alfrédnak tölt. –
Aäfréd. – Hát ez nagy csalódás, Elza. Én szentül meg voltam
győződve, hogy maga okosabb, mint Kant Immánuel.
Eäza. – Nem, nem vagyok okosabb, mint Kant Immánuel. Vannak
dolgok, a melyeket soha se fogok megérteni.
Aäfréd. – Mert az élet megkímélte attól, hogy megismerje a
nyomoruságot, és ennek hatását az emberi szervezetre, vagy
mondjuk: a lélekre. Azt azonban megérthetné… a fantáziájával, az
értelmével és főképpen a szívével… hogy én ma már… ha egyébbel
nem: az őszinteségemmel, azzal, hogy megengedhetem magamnak
ezt a fényűzést… fölötte állok a multamnak, és fölötte állok
mindennek. Ez a mult kellemetlen; és én se szeretek rágondolni. De
nagyon alant kezdtem, és a ki a mélységből túrja fel magát, arra
bizony sok sár rátapad. Ez szomoru, de nincs másképpen; az
elementáris hatalmakkal szemben mindenki féreg. Nem kellemetes
visszaemlékezni rá, hogy nem volt más módom, felküzdeni magamat
oda, a hol állok… a mint hogy arra gondolni se kellemes, hogy miket

éltem át, micsoda szennyet kellett tűrnöm, és micsoda
nélkülözéseken kellett keresztül vergődnöm. De ez a mult már csak
kellemetlen emlék; ez a mult már nem létezik. Akárki voltam, ma
más ember vagyok. Hozzáférhetetlen, tekintélyes, nagystilű,
rettegett, imponáló. A bűnös ember volt – nincs, a mint hogy nem
létezik többé az egykori mosdatlan, fésületlen ember. Nyoma sincs
rajtam ennek a régi embernek; tiszta vagyok, mint az arany, és
korrekt, mert már jó ideje módomban van korrektnek lenni. És az
emberek annak látnak, a mi most vagyok; a mi voltam: nem látszik
meg, nincs többé, letörültem magamról. Ezt bizonyos önérzettel
mondhatom el, mert minél nagyobb szennyből kerültem ki, annál
nagyobb a pálya, a melyet befutottam… annál nagyobb az erő,
melyet dokumentálok. Igaz, hogy kiáshatják a multamat, és
turkálhatnak benne. Az ellenségeim ezt hébe-hóba meg is cselekszik,
de én csak nevetek az erőlködésükön. Ez a mult már kiszáradt;
tehetetlen velem szemben, nem ránthat vissza a mélységbe. Az
emberek legnagyobb része el se hiszi, a mit ellenségeim beszélnek.
A ki elhiszi, annál inkább fene legénynek tart. Vannak, a kiknek jól
esik, hogy egykor ronda ficzkó voltam, de ezek is dühösen irigyelnek,
félnek tőlem, és szeretnének a helyemben lenni; megölnének egy
kanál vízben, de annak néznek, a ki vagyok: zseniális és erős
embernek. Ez a mult már nem akadályozhatja meg, hogy végig ne
futhassam a kivételes tehetségemnek megfelelő kivételes carriére-t;
nem állíthat meg utamon, nem hiusíthatja meg magasba törő
terveimet, nem változtathat vissza azzá, a ki már nem vagyok. Akárki
voltam, ez már csak: történelem, mely félig szégyenemre, de félig
dicsőségemre válik; akárki voltam, ma, holnap vagy holnapután az
vagyok, a kinek azelőtt csúfoltak, a ki akartam lenni, a kivé
kifejlődtem az egész világ akarata ellenére: egy magyar Disraeli.
Eäza. – Igen, de még csak az a Disraeli, a ki a carrière-ért eladta
magát. És az eredeti, az csak egyszer, egyszerre adta el magát, egy
törvényes feleségnek; nem sokszor, nem részletekben, nem egy
idegennek, a ki alkalom adtán előáll azzal, hogy egyszerűen
megfizette…

Aäfréd. – Ha megbocsátja magát a tényt, a »hogyan« már
közönbös. A tény lehet dehonesztáló; de az, hogy milyen formák
között történt, már nem számít. Mind a két esetet csak
egyféleképpen lehet megítélni, sőt a körülmények inkább az eredeti
rovására szólnak. Mert az nem volt kénytelen vele; mégis megtette.
Eäza. – Ne vitatkozzunk erről; nem érdemes. A fődolog az, hogy…
eh, minek erről beszélni? Ugy se változtathatunk a dolgon.
Aäfréd. – Csak mondja! Akármilyen bántó, a mit gondol, magától
akczeptálom.
Eäza. – Tehát van egy dolog, a mibe soha se tudnék
belenyugodni. Abba, hogy sok szegény ember szaladgál a világban,
a ki a férjemnek enni adott, a kinek ezt a férjem soha se tudta
megszolgálni, a kinek a férjem csak rosszszal fizetett!
Aäfréd. – Ez csakugyan komisz emlék. De ez a világrend. A
magasabbrendü állatnak hány apró állatot kell megennie, csak azért,
hogy az életét egy darabig meghosszabbíthassa! Hát még a nagy
pályák! Csak holttesteken át haladhatunk előre.
Eäza. – Tudom, hogy filozófiával mindent ki lehet menteni,
mindent ki lehet magyarázni, mindent lehet igazolni. De ez a
mindenre hajlékony, gummi-filozófia előttem nem filozófia, hanem
disznóság. Ne haragudjék, Alfréd. Azért maga mégis imponál nekem.
Talán azért, mert olyan szemtelenül elbizakodott!

IX.
TEOFIL.
– Történik: egy júliusi napon, Gmundenben, a Hôtel Austria első
emeletén. Berzsenyi báró székkel és szivarral kínálja Vrillényi doktort,
a ki ebben a pillanatban érkezett. –
A báró. – Na, ez aztán olyan pontosság, a mely semmi
kivánnivalót nem hagy hátra, és valóban megelégedésemmel
találkozik.
Vráääénóá . – Három órakor kaptam meg a báró úr telegrammját, és
négykor már a gyorsvonaton ültem. Megvallom, igen örültem a
meghivásnak, mert nekem is volna egy szerény kérésem…
A báró. – Na, ezt mint nem fontost hagyjuk egy más alkalomra.
Legközelebb ismét Budapestre utazom, és szemlét tartok a
szerkesztőségben, a midőn az esetleges hiányokat megbeszélhetjük.
De úgy látom, ön árnyékával sincs a sejtelemnek, milyen
nagyjelentőségü az ügy, melyért önt ide rendeltem, egy
hôtelszobába, tanuk nélkül.
Vráääénóá . – Én azt hittem, hogy a báró úr kedves családjával egy
villában lakik…
A báró. – Nem egy villában, hanem a Cumberland herczeg-féle
villában, melyet kibéreltem. De az ügy megkívánja, hogy ez a
kihallgatás, melyet önnek adományozok, titok maradjon, mindenki
előtt, a családomat sem véve ki. Azért rendelkeztem úgy, hogy ön itt
jelentkezzék, és nehogy Gmundenben tartózkodása nekem valami
feltűnést keltsen, már ezúttal figyelmeztetem önt, hogy a

legközelebbi vonattal tovább fog utazni Ischlbe, a hol ön a világ
számára jelenleg is időzik.
Vráääénóá . – Értem.
A báró. – Ön nem volt Gmundenben, ön engem nem látott, nem
hallott, és én azt se tudom, hogy ön létezik, mert ön csupán
Ischlben járt egy tetszés szerinti ügyben, melyet szabadon
választhat.
Vráääénóá . – Tökéletesen értem, báró úr.
A báró. – Ez még nem elég. Én tudom, hogy ön diskrét ember;
egy igazi bizományos, a mint elő van írva; egy ember, egy szó. De
még ez se elég. És ön becsületszavát fogja nálam zálogban hagyni,
hogy egy léleknek sem és egyetlen betűt nem fog kicsevegni abból,
a mi értekezletünkön szóba kerül, sőt még azt sem, hogy ön ezúttal
az én színemet látta.
Vráääénóá . – Becsületszavamat adom, báró úr…
A báró. – Megjegyzem, hogyha netalán ön, teszem fel
ügyetlenségből, valamit elejtene beszélgetésünkből, én kénytelen
volnék ezt mindenütt légből kapott állításnak minősíteni, és önt mint
hazugot visszautasítani.
Vráääénóá . – Ilyen eset nem fordalhat elő.
A báró. – És most hallgasson ide, de jól figyeljen, mert én semmit
se fogok ismételni, és bizonyos dolgokat, melyeket nincs módomban
elmondani, mert ezekre nézve magasabb tekintetek tőlem is
diskrécziót kivánnak, hallgatással fogok mellőzni. Önnek tehát egyet-
mást az összefüggésből kell kiokoskodni, de e tekintetben megbízom
az ön képességeiben, mert egy szerkesztő ott is kitalál, a hol nincs.
Vráääénóá . – Kevés szóból is meg fogom érteni az ügyet.
A báró. – Tehát most jól figyeljen ide. Az én leányom, Blanka
bárókisasszony, egy hónappal ezelőtt azon magas kitüntetésben

részesült, hogy Altenberg-Griesbach grófné szoaréján szerencsés volt
bemutattatni ő fensége Konrád Mária Ruprechtnek, a ki jelenleg itt,
az Ort-kastélyban időzik.
Vráääénóá . – Ah!
A báró. – Ezt pusztán baráti vonzalmamból kifolyólag adtam
tudtára önnek, mert oly okokból, a melyeket önnel nem közölhetek,
ez egyelőre mély titok, és miután tudomásul vette, felszólítom, hogy
azonnal felejtse el az egészet.
Vráääénóá . – Kérem.
A báró. – Miután ön biztosít róla, hogy már elfelejtette, a mit az
előbb mondottam, egy más történetet fogok önnek elbeszélni, nevek
nélkül, mert oly okokból, a melyeket önnel nem közölhetek, a
neveket el kell hallgatnom.
Vráääénóá . – Egészen a tetszése szerint, báró úr.
A báró. – Tehát miután ön nem tud semmit, mert én nem
mondtam semmit, egy mély, sőt legmélyebb családi titkot közlök
önnel, a melynek az én családomból nem szabad kiszivárognia, és
én, ha a fejemet levágnák, se mondanék erről semmit, önnek pedig
hasonlóképpen kell eljárnia, bármi történjék. Ezen titokból csupán
annyit közölhetek önnel, hogy az én leányom, Blanka baronesz
menyasszony, mert a legnagyobb titokban el van jegyezve egy igen
magas állásu úrral, a kinek a nevét nem mondhatom meg. Ezen igen
magas állásu urat a rövidség kedvéért Teofilnak fogjuk nevezni.
Teofil tehát, kinek szivét leányomnak sikerült nőies bájaival
elnyernie, a legeslegmagasabb családok egyikének tagja, és mint
ilyen, nem házasodhatik meg szabad akaratból, mert a legjobb
családokban természetesen semmit se lehet tenni a család többi
tagjainak és kitünően a családfőnek beleegyezése nélkül, a mi csak
természetes.
Vráääénóá . – Értem.

A báró. – Másrészt nem alaptalan az aggodalom, hogy Teofil
egyelőre nem nyerheti meg frigyéhez családjának beleegyezését,
mert a legmagasabb körök, sajnos, még nincsenek áthatva az
egyenlőségi eszméktől, és oly előitéleteket táplálnak, mintha a rang
magában véve, a rangnak huzamosabb gyakorlása nélkül, nem
emelné az embereket a többiek fölé. Teofil tehát, míg lassankint
legyőzheti az előtte tornyosuló akadályokat, eljegyzését titokban
kivánja tartani, nehogy a hiresztelések az akadályokat még
szaporítsák, a miben jövendőbeli vőmnek tökéletesen igazat adok.
Vráääénóá . – Világos.
A báró. – Meg kell azonban önt ismertetnem ezen Teofil fenkölt
gondolkozásával. Helyében mindenki más egy morganatikus
házasságot óhajtana, a mi bebizonyitottan kényelmes módja a
megoldásnak, és a legfelső körökben, mint legalkalmasabb
leereszkedési eszköz, általános kedveltségnek örvend; sőt,
visszapillantással arra, hogy a jobbkézzel való házasságtól csak
lényegtelenül különbözik, az egész világ részéről nagyobb
tiszteletben részesül, mint némely rendes házasság, mely szegény
ördögök között köttetik. De Teofil semmit se akar tudni egy ilyen
morganatikus házasságról; ellenkezőleg, az egyenranguan kötött
házasságot választja, hogy összes jogaiban részesíthesse nejét és
netaláni gyermekeit.
Vráääénóá . – Ez igen szép, csakhogy, azt hiszem, szinte
elháríthatatlan nehézségeket állít a házasság elé.
A báró. – Ön szegen találta a fejet. És én megvallom önnek, bár
ezt az én családi körömben természetesen nem hangsulyozhatom,
hogy ilyen viszonyok között, melyek majdnem minden kilátástól
megfosztanak, hajlandó volnék kiegyezni ötven százalékra, és a
magam részéről bele mennék a morganatikus házasságba is.
Mindent jól mérlegelve, a morganatikus házasság egy teljesen
érvényes frigy, mely a vallás részéről áldásban és mindenkinek
részéről tiszteletben részesül, csupán a feleség nem részesül a férj
jogigényeiben, egész rangjában és összes előnyeiben. De én ezt nem
É

is igénylem. Én a szív és az egyenlőség embere vagyok, minden
nagyravágyás nélkül, az előitéletek megvetésével és igazi nemes
büszkeséggel. Szó sincs róla, gyermekemnek minél messzebb menő
boldogsága nekem is heves óhajtásom, de ha az előitéletekből
kifolyólag leányomat ezentúl se fogom fenségnek szólíthatni, ezért őt
nem fogom kisebbre becsülni. És mint egyszerü, de egyenes
gondolkozásu férfi, a ki alattvalóit nem nézi le, de a legmagasabb
körben se tántorítható el az egyenlőség eszméitől, semmi rosszat se
látok abban, ha például leányom egy morganatikus házasság
kapcsán, teszem fel azt a Traunkirchen grófnő czímet és rangot
kapja. Én nem kívánom unokáimat egy bolgár, román vagy nem
tudom miféle trónon látni, és ha unokáim csak egyszerü
Traunkirchen grófok lesznek, ezért én őket nem kevésbbé fogom
szeretni.
Vráääénóá . – A báró úr az igazi demokraták közül való; azok közül a
legnemesebb demokraták közül, a kik arisztokratikusan
gondolkoznak.
A báró. – Lássa, ez egy egészen jó mondás, mely érdemes a
kinyomtatásra, például egy rövidebb életrajzomban, a legközelebbi
kitüntetésem alkalmával. Ez alkalommal talán már meg lehet írni azt
is, hogy minden esetben elhárítottam magamtól az előmenetelt és a
kitüntetéseket. Így a jövendőbeli vőmet, kit magunk között
egyszerűen Teofilnak fogok nevezni, gondoljon bár ön, a mit akar…
mondom, jövendőbeli vőmet se én szólítottam fel arra, hogy ejtse el
a morganatikus házasság eszméjét; ellenkezőleg, figyelmeztettem
őt, hogy a ki mindent akar, az esetleg sok nehézséggel találkozik. De
Teofil azt czélozza, hogy összes jogai nejére és gyermekeire is
kiterjesztessenek, tehát visszahatólag reám is, és ebben igaza van,
mert őszinte boldogság csak egyenranguak közt lehetséges.
Vráääénóá . – És a jövendőbeli vőnek, a kinek ezúttal Teofil a neve,
van rá valamelyes reménye, hogy tervét magas családjával el tudja
fogadtatni?

A báró. – Ez egy fiatal rajongó, a ki mint szerelmes beszél. Nem
mérlegel, nem számol; csupán nem tud leányom nélkül élni, és azt
hiszi, hogy ez magas családját meg fogja hatni, a mit én nem
remélek, egyedül annyit, hogy a morganatikus házasság igen bevett,
mondhatnám közkedvelt dolog. De Teofil erről nem akar hallani, és
előttem olyan nyilatkozatokat tett, hogy előbb össze kell dőlnie a
világnak, mint ő leányom kezéről lemond, mert ha a magas
beleegyezéseket nem sikerülne kinyerni, lemond czíméről, rangjáról,
minden, és inkább postatiszt lesz. Ezt én természetesen semmi
körülmény esetén nem fogadnám el, mert eltekintve attól, hogy én
nem vagyok az az ember, a ki leányom kezével a legszebb carrière-
eket akarjam tönkre tenni, másrészt mit csináljak én egy
postatiszttel?! Ha leányom szereti őt, ezt én természetesnek találom,
mert a fiatalság bolondság, és a szív kopog, ez ellen mit se lehet
tenni, de egy fiatal leány nem azért szeret egy igen magas állásu
ifjut, hogy aztán egy postatisztnek legyen a felesége. Ha csak
postatisztet akarna, postatisztet találhat annyit, a mennyit akar.
Vráääénóá . – Magától értetődik.
A báró. – Na hát, így állnak a dolgok. Többet nem mondhatok el;
a diskréczió nem engedi, hogy többet mondjak és ha valaki megölne,
akkor se mondanék többet.
Vráääénóá . – Tökéletesen értem a báró úr álláspontját, s már azért
is hálás vagyok, hogy ennyi bizalomra méltatott. Csak még azt az
egy kérdést kegyeskedjék megengedni, hogy miért méltóztatott, a
mi egyáltalán elmondható volt, velem közölni, s miben lehetek báró
úrnak ez alkalommal szolgálatára?
A báró. – Önre, mint értelmes emberre volt ezúttal szükségem, a
ki kevés szóból is méltányolni fogja a helyzetet, s a kinek kevés szó
kell ahhoz is, hogy álláspontommal tökéletesen megismerkedjék.
Erre nézve a következőket közölhetem, se többet, se kevesebbet,
azzal a hozzátétellel, hogy ön hallgatni fog, mint a sír, és szavaim,
melyekben a legszükségesebbre szoríkozom, önnek egyik fülén be s

a másikon ki kell menniök, vagyis én nem mondtam semmit és nem
tudok semmiről.
Vráääénóá . – Úgy lesz, báró úr.
A báró. – Ön azok után, a miket, mint bizományos emberemmel
közölhettem, sejteni fogja, hogy itt valami előmegy, a mit nem
mondhatok el, de a mit ön kitalálhat, mert természetesnek fogja
vélni, hogy Teofil, vagyis azon titokzatos egyén, a kit én Teofilnak
neveztem, lépéseket tesz a magasságban, a szükséges
beleegyezések kieszközlése végett. Már most ez az én helyzetemnek
a kényességét számomra mondhatni kínossá teszi, mert míg nekem
a legteljesebb diskréczióra kell szorítkoznom, és még ön előtt is, kit
szolgálatainál fogva titkáromnak nevezhetek, a legszükségesebbeket
kivéve, mindent elhallgatok, másfelől ki vagyok téve mindenféle
pletykaságoknak, melyek azon szükséges lépésekből kifolyólag,
hallgatózó inasoktól és más egyéb módon keletkezhetnek. Ön be
fogja látni, hogy ez engem nem hagyhat hidegen, mert az esetleges
pletykálkodások még leányom hírnevét is érinthetnék, a minél
sajnosabbat ön egy apára nézve nem fog kitalálhatni.
Vráääénóá . – Annak kieszközlésére, báró úr, hogy erről az ügyről
nálunk egy szót se fognak kinyomtatni, készségesen vállalkozom.
Sőt, ha a báró úr parancsolja, nagyon szívesen teszek lépéseket úgy
Bécsben, mint, a mennyiben összeköttetéseim messzebbre is
terjednek…
A báró. – ön még nem egészen jól fogta fel a helyzetet, s mielőtt
tovább mennék, figyelmeztetem önt, hogy hagyjon engemet
egészen kibeszélni. Ahhoz, hogy a világ összes hirlapjai ezen ügyben
rendelkezésemre álljanak, én nem vagyok elég vagyonos. De, mint
megjegyeztem, a mik apai gondosságomat aggályokkal töltik el, ezek
egyedül az esetleges pletykaságok, és nem a nem tudom hogyan és
miképp napfényre jött igazság. Ezzel szemben álláspontom a
következő. Ha az igazság nem tudom hogyan, de mindenesetre az
én közbenjöttöm nélkül, nem bánom, például jövendőbeli vőm
szolgaszemélyzetének indiskrécziójából, vagy talán, mivel több

értekezés várható s a falaknak is füleik vannak, egyszóval
akárhogyan kiszivárogna, de a forrás, a hír tartalma és más egyebek,
a miket ön nálam jobban tud, feketéről fehérre bebizonyítanák ezen
hírtől távol állásomat, és minden egyes vak látná hogy nekem semmi
közöm az egész históriához: nekem az igazság ellen nemcsak nem
volna kifogásom, de mint tőlem egészen távol álló valaminek
örvendenék is, és annak fentartásával, hogy az igazság nem tőlem
nyerte eredetét, mert én belőlem harapófogóval sem lehetett
egyetlen indiskrécziót is kivenni… a mint ön tanusíthatja… létrejöttét
nem bánnám, mivel nekem nem kell félnem az igazságtól.
Vráääénóá . – Tökéletesen igy gondolkozom.
A báró. – Miért féljek én az igazságtól?! Kettő közül egy: vagy
létre jön ez a házasság, vagy nem jön létre. Ha létre jön: mit bánom
én azt, hogy már előbb beszéli a nép, a mit később úgyis mindenki
megtud?! Ha pedig nem jön létre: akkor nekem a sok
meghurczolásból egyetlenegy kellemes emlékem se maradjon, és a
világ soha se tudja meg, hogy az én leányomat Teofil csakugyan el
akarta venni, nem is csak úgy morganatikusan, hanem a
legkomolyabban, a legszigorúbban, czímének és rangjának
feláldozásával, egészen odáig, hogy postás legyen a leányomért?! Ez
úgy jön nekem elő, mintha valakit legkegyelmesebben bárói rangra
emelnének és aztán azt mondanák neki: »Hallod-e, te, Berzsenyi,
vagy hogy hívnak, te ezentul báró leszel, mert az országnak
szüksége van tekintélyekre és szakértelemre, de a világnak ezt soha
se szabad megtudnia, és érdemrendet is kapsz, de ezt csak éjszaka
szabad viselned, otthon!…« Hát akkor mirevaló az egész?!…
Vráääénóá . – Egészen a báró úr véleményén vagyok, de legyen
szabad még egy kérdést tennem. Hogyan egyeztessem össze az
igazság érdekét azzal az igérettel, a melyet az imént szavammal
pecsételtem meg? Nekem úgy tetszik, hogy mást diktál a
czéltudatosság, és mást a diskréczió.
A báró. – A czéltudatosság nem tévesztendő szem elől, de első a
diskréczió.

Vráääénóá . – Mégis, ha a báró úr kegyes volna valamely irányt
jelölni ki…
A báró. – Hogyan tudjam én azt?! Ha én azt tudnám, akkor
nekem nem volna szükségem egy olyan bizományosra, a ki
szerkesztő! És ha én szerkesztő volnék, akkor én tudnám, hogy mit
diktál a czéltudatosság és mit a diskréczió!
Vráääénóá (meghajlik).
* * *
– Egy hónap mulva Berzsenyiek hirtelen átköltöznek Gmundenból
Ausseeba. Ebenseeig hajón mennek, s Traunkirchennél Blanka a
fedélzetről búcsút int a traunkircheni templomnak. –
Bäanka. – Es wär’ zu schön gewesen!…
Eäza. – Még egy ilyen blamázs és mehetünk kolostorba!
A báróné. – Csak azt tudnám, hogy ki volt az a gazember, a ki az
egészet elpletykálta az ujságoknak! Ez a gazember tett lehetetlenné
mindent!…
A báró (meggyőződés nélkül:) – Ha én ezt az embert megtalálom:
pofon ütöm! (Meggyőződéssel:) – De nem fogom megtalálni!…

X.
A TENGERI KÍGYÓ.
– Alt-Aussee-ban, a tóparton. Elza és Soroksáry Miska gróf, a kik
bicziklin jöttek idáig, egy padon pihennek. –
Máska. – Nézze, Elza, milyen csendes a viz! Olyan csendes, mint a
minisztérium délután fél négykor. És milyen szép színe van! Olyan
szép színe van, mintha nem is volna igazi víz, csak úgy ide volna
kenve, ultramarin-festékkel. Tulajdonképpen egy ronda kis pocsolya
az egész. És mégis szomorúvá teszi az embert, a fene tudja mért.
Csupa romantika, csupa ábránd, csupa szentimentálizmus. Ha sokáig
nézem, még megkergülök tőle. És ez mély; mély, mint egy kút; mély,
mint a nyavalya, hogy vinné az ördög!
Eäza. – Maga ma fel van húzva.
Máska. – Ez onnan van, mert ma egész nap oda vagyok ettől a
komisz szerelemtől, csapna bele a mennykő! Mivé lettem, az
ebugattát! Két héttel ezelőtt még kutya bajom; és egy átkozott
napon összeakadok magukkal… magával, a kit ismerek, mint a rossz
pénzt… és püff neki! Úgy jött ez, mint a hideglelés. Miért is jöttek
ide, fékomadta! Most éjszaka vigan horkolnék, nappal meg járnám a
bolondját. De egyszerre csak elibém áll, mint a Tavasz, Botticellitől,
bozontos hajjal, kipirulva, és én nem ismerek rá, aztán meg nem
ismerek magamra! A teremburáját! Hát kellett ez nekem?! Mit is
jövök ide, a pokolba, a helyett, hogy otthon ülnék, a vaczkomban, a
kutyafáját! De hát mit lehessen tenni róla?! Tudom, hogy ostoba
vagyok; ostoba, mint egy szerelmes víziló, mint egy lóvá tett bika,
mint egy lerészegített fenevad. De azért mégis fáj a szívem, és ha
arra gondolok, hogy maga talán csak ingerkedik velem, kedvem
volna elbődülni. Persze, maga semmit se vesz komolyan. Szív,
É

szerelem, mi az? Smarn. Hát sebaj. Én értem azt, ha valaki mindent
kinevet. Azelőtt én is egy húron pendültem magával. Hanem
egyszerre belém ütött a mennydörgős mennykő, és most ne neked,
szegény fiu! Hát jól van, Elza. Ha magát csak mulattatja az, hogy én
ennyire veszkődöm, lelke rajta, sebaj, úgy kell nekem! Bánja is
maga, hogy mi történik velem! Ha az epedéstől elvisz az ördög, ha a
féltékenységtől meggárgyulok, az magának bliktri. Maga és szeretni!
Olyan nincs! Hogy az életem nem ér egy fakovát, hogy a nyomorékja
vagyok ennek a heccznek, annál jobb, lesz mit nevetni a városban!
Hát jól van, azt a czudar mivoltát ennek a pocsék világnak! Nem
bánom, ne szeressen; de tűrje el, hogy szeretem, mert ha nem, a
nyüstyit!… No jól van, nem mondok többet. Csak egyet mondok. Ha
maga visszaszíjja azt a keveset, a mit ígért… hej, Elza, az áldóját!…
Hát persze, az ember nem csinál szczénákat… mit tegyen?… tovább
vonszolja a nyomorúságát… adta lelke, ezt a rongy életet még
ellökni se érdemes!… de akkor, Elza, akkor nekem lőttek!
Eäza. – Mióta a Vigszinházban megtanult szerelmesen beszélni,
kezdek hinni magának.
Máska. – No, nem veszem észre!
Eäza. – Pedig úgy van. Ha csak fél annyit érez, mint a mennyit
káromkodik, szavamra, meg vagyok elégedve. Lánczos, lobogós, ez
már beszéd! De, hallja, ha csak be akar csapni ezekkel a szép
szavakkal!…
Máska. – Csak mulasson!
Eäza. – Ne mondjam meg, hogy maga kezd nekem tetszeni?
Máska. – Szép, csakhogy egy kicsit lassan megy a dolog.
Eäza. – Egyelőre elégedjék meg azzal, hogy nem szívok vissza
semmit. Egy félesztendeig nem megyek feleségül máshoz, és ha egy
félesztendő mulva el fogom hinni, hogy ez a nagy szerelem nem
Spiegelfechterei…
Máska. – Magát meggyőzni erről!…

Eäza. – No, nem tartozom azok közé, a kik könnyen fognak tüzet,
az bizonyos… De soha sem kell desperálni… és azt képzelem, hogy
ha aztán egyszer sikerül a dolog… Nem gondolja, hogy a hozzám
hasonlók a milyen nehezen hisznek el valamit, olyan erősen hisznek,
ha egyszer beugrottak?!… Ha én férfi volnék, nekem az olyan
leányok tetszenének, a milyen én vagyok. Azt gondolnám, hogy csak
azokat érdemes szeretni, a kik nem könnyen vesztik el az eszöket,
mert ezek, ha mégis elvesztik, nem találják meg egyhamar.
Máska. – Csak ugrasson, csúf kaczér, ugrasson!… Mintha még nem
volnék kellőképpen megőrülve!… Az árgyélussát!…
Eäza. – Pardon, én nem kaczérkodom, én csak filozofálok. Már
csak azért se kaczérkodom, mert, sajnálatomra, semmi egyebet nem
igérhetek, csak azt, a mit maga kevésnek mond… hunczut az, a ki
többet igér, mint a mennyit megtarthat. Sőt, megvallom, semmi
kedvem magát biztatni, mert magának bajosabb ügye van, mint egy
X-nek, a kit még nem ismerek.
Máska. – Ahá!… És ugyan miért?
Eäza. – Mert maga diskreditálva van előttem.
Máska. – Én?
Eäza. – Na, ezt kitalálhatná. Maga végig udvarolta az egész Várai-
Fejér családot, végig udvarolta a Zelmáékat, s végig udvarolta a fél
Lipótvárost, arról nem is beszélve, hogy körüludvarolta a Blankát, és
körüludvarolt engem is, több tournée-ban, hogy hányszor, bizony
nem tudom előszámlálni. Maga időnkint fölmerül a látóhatáron, mint
a tengeri kígyó, mely felbukkan a habokból, aztán eltűnik, hogy
később, időszakonkint, minduntalan visszatérjen, képviselni a dolgok
örök körforgását. Maga édes mindnyájunk állandó vőlegény-jelöltje.
A hányszor visszament egy-egy partija, s a hányszor nekünk ment
vissza egy-egy partink, maga mindannyiszor ujra meg ujra megjelent
nálunk, ujra meg ujra feldobogó szívvel s olyan arczczal, mintha
semmire se emlékeznék, mintha egy héttel előbb semmi se történt
volna. Számítsa ki, hányszor nem ment össze sehogyse a partija,

hány partija ment vissza Blankának, számítsa ehhez az én két
szomoru esetemet, s a végeredmény az, hogy maga, mint kérő, már
jubileumot tarthat.
Máska. – Az más. Akkor nem voltam szerelmes.
Eäza. – És mégis idült házasodási mániában szenvedett.
Máska. – Ez csak azt jelenti, hogy mindig nagy érzékem volt a
családi élet bensősége iránt. Az apám, a nagyapám, a szépapám,
mind megházasodtak, és én örököltem tőlük a házaséletre való
hajlandóságot. De senkinél sem tudtam megállapodni, mert nem
voltam szerelmes. Most pedig szerelmes vagyok, kutyateringettét,
szerelmes, mint egy bivalybornyu, mert ha néha most is vicczelődöm
a Zelma-féle lányokkal, ez csak külszin, forma, társaságos tónus, s
valójában nem vagyok vidám ember, hanem egy összetört szívű
szerencsétlen… egy Asra, a ki meghal, mert szeret… nem a régi
ember, csak multak árnya, elhaló sóhajtás… egy etruszk váza, mely
megrepedt egy legyezőcsapástól, szóval egy nyomorult, azt a fűzfán
fütyülő rézangyalát a csillagmiriádnak!
Eäza. – Azelőtt is mindig a szerelmet emlegette. Csakhogy azelőtt
kellemeztette magát, a régi, most meg káromkodik, az uj divat
szerint.
Máska. – Ha beleugrom ebbe a tóba, elhiszi, hogy szeretem?
Eäza. – Ne ugorjék bele. Először, mert ez nem bizonyiték, és
másodszor, mert ha csúron vizesen viszem haza, azt hiszik, hogy én
is megbolondultam.
Máska. – Legjobban szeretnék leugrani a második emeletről. Ha
kitörném a lábamat, csak azért, hogy bebizonyítsam a szerelmemet,
nem kételkednék többé bennem, a ki lelke van! Csakhogy maga nem
jönne feleségül egy sánta emberhez.
Eäza. – Na hát tegyünk le erről is. Remélem, találunk valami
ügyesebb bizonyítékot.
É

Máska. – Én nem remélem. A ki elítél valakit, egyesegyedül azért,
mert az a boldogtalan, valaha, zöld korában, udvarolgatott erre-
arra…
Eäza. – Ó, nem csupán ez diskreditálta előttem! Ez csak egy
részlet. A fődolog az, hogy az egész embert nem tartom eléggé
megbízhatónak.
Máska. – Például miért nem?
Eäza. – Például azért nem, mert soha se láttam tökéletesebb zsúr-
hiénát. Egy ember, a ki az életét végig teázta! A ki otthon van
minden lányos háznál a király-utczai fasortól a Krisztinavárosig! A ki
a szezon hat hónapjában soha se vacsorál otthon, vagy a
vendéglőben, hanem mindig en ville, s a haute-saisonban egy este
három szinházban és négy családnál fordul meg! A ki könyvet vezet
a meghívásokról, mert különben a teendőit nem tudná
nyilvántartani, a ki, ha zongoráznak vagy énekelnek, titokban alszik
egyet, a ki… de hagyjuk ezt, mert minek boszantsam meg, éppen
akkor, a mikor kezdem kedvesnek találni?!… Szóval, maga és a
komolyság, maguk ketten, Miska, soha se fognak találkozni. A ki
annyira társas lény, mint maga, abból lehet minden, még miniszter
is… bár ha a minisztériumban úgy halad előre, a hogy eddig, akkor
csak százhetven éves korában lesz miniszter… csak egy nem lehet,
Miska, olyan férj, a milyennek én akarom a magamét, ha ugyan
lesz… mert már kezdek felhagyni minden reménynyel.
Máska. – Maga nem tudja, hogy én mire vagyok képes!
Eäza. – Nagyjában el tudom képzelni. Nem az a baj, fiam, hogy
nem ismerem eléggé magát, ellenkezőleg, az a baj, hogy túlságosan
ismerem. Es éppen azért nem bízom benne, hogy jöhet olyan idő is,
a mikor más szemmel nézném magát. A gyöngéit nagyon is tisztán
látom, kár; a jó tulajdonait könyv nélkül tudom, sőt már észre se
veszem, s ez még nagyobb kár. A maga részéről meglepetéseket
nem igen remélhetek, és a meglepetések reménye nélkül nincs
szerelem. Ha abban, a kihez vonzódik az ember, nincs többé semmi

ismeretlen, a langyos barátságnál nem igen megyünk túl. Ez
szomoru, a kutyafáját, de így van. Nézze, ha el tudnám hinni, hogy
csakugyan szeret, és amúgy igazán… böcsületesen, az angyalát!…
akkor én is tudnám szeretni magát, rögvest, abban a minutumban…
nem tudom mért, de így van… annyira sokra tartom a szerelmet…
mert én azt hiszem, hogy ez soha se maradhat egyoldalu… olyan tűz
az, hogy a másik oldalra is jut belőle meleg… De nem tudom elhinni,
lelkem, legalább egyelőre nem, s félek, hogy soha sem, ehhez maga
nekem túlságosan ismerős. Ha mindjárt ilyen kemény káromkodással
kezdte volna, hamarjában talán belébomolhattunk volna egymásba.
De, istenem, maga már harmadszor kezdi ujra a dolgot! És ha most
több buzgalommal lát hozzá, mint előbb, ez egy kicsit későn történik.
Az efféle nem nyer a fölmelegítéssel, mint a káposzta. A ki
elmulasztja a moment psychologique-ot, az elvesztette a partit, és
tartok tőle, Miska, hogy maga a kedvező alkalmat már régen
eludvarolta, nem tudom melyik Zelmánál.
Máska. – Az igazság az, hogy maga egy hideg teremtés, a ki most
csak arra gondol, hogy én nem vagyok valami fényes parti, mert
szegény vagyok.
Eäza. – No hát valami különösen fényes partinak csakugyan nem
tartom, ha éppen erre akar hivatkozni…
Máska. – Mért? Azt hiszem, elég jó családból vagyok…
Eäza. – Mi hasznom volna nekem a maga származásából? Azt
hiszi, hogy azok az ajtók, a melyek most zárva vannak előttem,
azonnal megnyílnának, ha véletlenül Soroksáry grófnévá lennék?!
Ugy-e, nem? Azoknak, ott, a zárt ajtók mögött, én akkor is csak a
Berzsenyi Elza maradnék. Ha még a kalapom mellé tehetném az
őseit!
Máska. – Már engedjen meg…
Eäza. – Csak azt akarom mondani, hogy ezeket az ősöket nem
számítva, valami nagy társadalmi különbség nem igen van
közöttünk. Én nem juthatok bele azokba a zárt körökbe, maga meg

kikopott onnan; tehát kívül vagyunk mind a ketten, ez nem olyan
nagy különbség. Megjegyzem, hogy maga, mialatt szorgalmasan
járta a zsúrokat, a városligeti fasortól a Krisztinavárosig, lakásról
lakásra czipelve a rangját dísznek, déli növénynek, meglehetősen
kivetkőzött grófi jellegéből; a ranggal is úgy van, mint egyebekkel:
ha sokat exponálja az ember, hamar megkopik. Mi pedig
megbecsüljük a rangunkat; egy kicsit uj még, egy kicsit festékszagu:
de megbecsüljük. És mi már nem ereszkedünk le oda, a hol maga
mindennap megfordul.
Máska. – Abban igaza van, hogy meglehetősen kikoptam a
társaságomból. De csak rajtam áll, hogy visszatérjek oda.
Eäza. – Hogyhogy? Carrièret fog csinálni? Negyven éves korára
miniszteri titkár lesz? Ugyan menjen! A hivatal – a hol annyi
szabadságot adnak magának, a mennyit akar, mert a
minisztériumban, akár fogalmaz, akár nem fogalmaz, maga
semmiképpen se numerál – a hivatal, Miska, nem fogja magát lábra
állítani.
Máska. – A hivatal! Ki beszél itt a hivatalról? Törődöm is én a
hivatallal! Nekem partit kell csinálnom, Elza; nekem ez olyan
kötelességem, mint az anyáknak az, hogy katonákat adjanak a
hazának. Ha én nem csinálnék partit, halálos vétket követnék el
egész családom ellen; megcsúfolnám őseimet és megrabolnám
dédunokáimat. Ez meglesz; ennek meg kell lennie. Ha ugyanakkor, a
mikor ezt a kötelességet teljesítem, elnyerhetem annak a kezét, a kit
szeretek, én leszek a legboldogabb ember a világon; de ha maga
nem akar tudni rólam, akkor is megteszem ezt, legfeljebb
boldogtalan leszek egész életemre, örökre, a teremfáját!… Így vagy
úgy: az kétszer kettő, hogy vissza fogok térni a társaságomba, és
hogy úgy térek vissza, mint egy oligarcha! Meglátja, mekkorát növök
huszonnégy óra alatt! Mert mikor a mesében a boszorkánynyá
varázsolt Ilona visszaváltozik tündérré, az nem olyan roppant
színváltozás, mint mikor egy szegény gróf hirtelen milliomos
mágnássá vedlik át. A tulajdon édesapja se ismer rá, úgy megszépül
egyszerre. És ez az átalakulás a mi családunkban már hagyományos.
É

Én partit csinálok, a fiam elherdálja a vagyont, az unokám ismét
partit csinál: így megy ez a mi családunkban Zápolya óta. Bízza rám.
Ha egyéb bajom nem volna, a mindenségit!…
Eäza. – Adok magának egy jó tanácsot. Ne vesztegesse rám ezt a
félesztendőt, hanem siessen vissza a Zelmához, mondja el neki
ugyanezt, a Zelmát ez a kilátás egyszerűen el fogja szédíteni. És
aztán, kérem, hívjanak meg egyszer magukhoz! Nem tagadom, hogy
nagyon szeretnék, legalább egyszer életemben, egy oligarchánál
vacsorálni.
Máska. – Egy félesztendő mulva ennél az oligarchánál fog
vacsorálni, de reggelizni és ebédelni is, mindennap. A szerelem erős,
mint a halál. Akármilyen kemény a szíve, a kitartásom meg fogja
hatni, átkozott kis boszorkány!
Eäza. – Kivánnám, hogy jó próféta legyen, bár nem remélem.
Olyan kellemes volna szerelmesnek lenni!… Még egyszer, utóljára!…
Aztán meg a szegény papát igazán boldoggá tenné, ha
megszerezném neki ezt a kis örömöt. Mondja, nem tragikus sors az
övé? A barátnőjét férjhez adta, és mikor ez a férj elhalálozott,
férjhez adta másodszor is. A leányai pedig, a milliomos, szép és jó
leányok: először… másodszor… senki többet… harmadszor!… hoppon
maradnak! Hej, de fura is az élet, a teremburáját!

XI.
MARCEL, A NAGY FESTŐ.
– Várakozó-terem a nagy festőnél. Fejedelmi fényűzés; minden
egyes tárgyból csak úgy kiabál a művészet. A vendégek körül négy
inas forgolódik; a feketébe öltözött titkár úr, kinek ábrázata komoly,
mint egy operatőré, a feltűnést tapintattal kerülve jegyzi fel egy
pergamen-alakú japáni papirra a jelentkezők nevét s a pihenés
perczeiben logaritmus-tábla segítségével számítja ki, hogy a
jelentkezők minő rangsorban járulhatnak a nagy festő elé. A
várakozó-termet perzsaszőnyeggel borított óriás kapu és két,
gobelinnel fedett ajtó választja el a szentélytől. Déli félegy óra. A
várakozó-terem telidestele: orosz nagyherczegekkel, levantei
herczegnőkkel, száműzött királynékkal, bibornokokkal, herczegek,
grófok, bárók és pénzfejedelmek sokaságával. A száműzött királynék
oly türelmesen várakoznak, mint az özvegy méltóságos asszonyok a
belügyminisztérium folyosóján, mikor kegydíjért folyamodnak…
Megjelennek: Soroksáry Miska gróf és menyasszonya, Berzsenyi Elza
bárókisasszony. A várakozók az ujonnan érkezettekkel,
rangviszonyaikhoz és az ismeretség mértékéhez képest, hosszabb
vagy rövidebb, észrevehető vagy észrevehetetlen pillantást váltanak,
a mely pillantások némelyike köszönésként kíván szerepelni. Az
ujonnan érkezettek elfoglalják a szomszédsággal legkevésbbé terhelt
helyeket. –
Máska (Elzához, halkan). – És mi most itt várakozni fogunk?
Eäza (Miskához, még halkabban). – Csak addig, míg Bianca
Capello ide fuvarozza a szegény mamát. Azaz… persze, hogy
persze!… Floridornak igaza van, a papának is ide adtunk randevút s
a mi még súlyosabb, uraságod kiintrikálta, hogy a szegény angolna

is itt szedjen fel bennünket! Szóval, ezen a helyen kell összeverődni
az egész mispotálynak.
Máska (mint fent). – Na, előre kiváncsi vagyok, hogy az én
csillagkeresztes tántim mit fog szólni ehhez a stílushoz.
Eäza (épp úgy). – Már korrigál? Vigyázzon, még nem melegedett
meg a vőlegénységben! Különben a maga tántija azt fogja mondani:
»Gyermekem, ez Iphigénia Taurisban, csakhogy szebbek a
toilettejei!…«
Máska (mint fent). – Magától kitelik, hogy a képmutatásával le
fogja kányázni a szegény öreg nőt. Különben bánom is én! Vagyok
én olyan csibész, mint maga!
Eäza (tovább is halkan). – Na hát akkor mi baja?
Máska (mint fent). – Semmi. Csakhogy tudnék én okosabb dolgot
is, mint itt ülni és várni egy rossz arczképnek a leleplezését.
Eäza (pianissimo). – Mit akar? Hisz ez egy egész színház! Nézze,
itt vannak a Frányaváry konteszszek is! Le akarják festetni magukat,
hadd tudja meg az utókor is, hogy ők milyen öregek voltak!
Máska (mint fentebb). – Kérem, ne gúnyolja a tiszteletreméltó
multat. Gondolja meg, hogy az ő nagybátyjuk volt az, a ki ezt az
országot elfoglalta mindnyájunk számára.
Eäza (a fogai közül). – No lám, megint az orromhoz dörgöli
Lengyelországot, Mezopotámiát, et toute la lyre!… A fiatal úr
rosszkedvű; nem tetszik neki a kávéház!… Bezzeg, ha egy
körömfeketényi primadonna volna jelen!…
Máska (azonképpen). – Ugyan, mi az ördögöt csinálnék vele?!
Eäza (suttogva). – Magának, a milyen perverz, különös örömet
okozna, ha a menyasszonya háta mögül telegrafálhatna neki!
Máska (ugyanúgy). – Hallja, nem bánom, akármit fog rám, de azt
ne kapja fel, hogy perverz vagyok, mert az ilyesmit mindjárt elhiszik.

Eäza (mint előbb). – Na, és ha a házasságunk kútba esnék, ez
kompromittálná a hölgyek előtt, ugy-e?!…
(Ekközben az egyik inas az ujonnan érkezetteket bejelenti a
titkárnak.)
Az ánas (a ki azelőtt színházi súgó volt a vidéken). – Titkár úr!…
(Tisztelettel teli, megilletődötten komoly ábrázatot vág; halkan). –
Soroksáry Miska és titkos arája, Kund Abigél.
A íáíkár (hivatalos csendben). – Na, és Ő?
Az ánas (mint fentebb). – Miféle Ő?
A íáíkár (hasonlóképpen). – A másik!
Az ánas (ajkait meg nem mozdítva, egy lépésnyire már nem
hallható mély baritonnal). – Bár a nap sárga korongja már jó előre
haladt az égbolt peremén, még nem pillantám meg kis lába havát!
A íáíkár (hasonló hangon). – Annál jobb, mert elfelejtettem
megmondani, hogy a másikat soron kívül kell előállítani. A donyesz
kicsikarta a mázolótól, hogy azonnal bemehessen, mihelyt
megérkezik.
Az ánas (mint előbb). – Nem tudom, egyáltalán lehetséges-e?
A íáíkár (ugyancsak piano). – Hogyne! A mázolónál nincs senki;
ujságot olyas. Csak a hatás kedvéért várakoztatja a kompániát.
Az ánas (szintúgy). – No, és hol vezessük be a lammermoori
menyasszonyt? A túlsó oldalról?
A íáíkár (még folyvást csak az inas számára). – Hám’sz án Idé?…
Abban nem volna semmi öröme! Ki kell nyitni neki a műterem nagy
ajtaját, hadd vonuljon be rajta egész kíséretével, hogy az előkelők
meghaljanak a sárga irígységtől!
Az ánas. – Igenis, milord!

(Ezalatt Miska gróf és Elza halkan beszélgetnek.)
Máska (a váróterem falain lévő képeket nézegetve). – Csak nem
jövendőbeli sógorom festette ezeket a képeket?
Eäza. – Dehogy! Ezek a cinquecentó-ból való remekművek.
Máska. – No azért, mert ezek egészen csinosak.
Eäza. – Úgy látszik, maga nem sokat tart Szilassynak a
művészetéről.
Máska. – Hát maga sokat tart róla?
Eäza. – Na, tudja, elhiszem, hogy vannak nagyobb festőink is…
öreg urak, a kik csak a művészetöknek élnek… de a budapesti
fiatalok között… azok között, a kik Budapest társadalmában szerepet
játszanak… a kik nem mentek ki a nagyvilágba, hanem hazafiasan
itthon keresik a sikert…
Máska. – … ez a legokosabb ember.
Eäza. – Mondja, hogy: a legügyesebb festő! Hisz látta a Zelma
portréját! Olyan gyönyörű, hogy bele lehetne bolondulni. Hát,
mondja, nem nagy művész az, a ki még Zelmában is fel tudja
fedezni a szépséget?
Máska. – Ez az, a mit mondok. A mi festőink közül sokan… azok, a
kik, hogy a maga szavait ismételjem, hazafiasan itthon keresik a
sikert, a kik Budapest társadalmában szerepet játszanak, s a
zsúrokon a legerősebb konkurrencziát csinálják a házasulandó fiatal
ügyvédeknek, ha egyébbel nem exczellálhatnak, ha másban nem
főzhetik le rosszul öltözködő, zsúrokra nem járó kollégáikat, azokat,
a kik csak a művészetöknek élnek, egy dologban nagyot tudnak
produkálni: en gros reparálják meg, a mit a természet rosszul
csinált. Szépnek festik, a ki csúnya; fiatalnak, a ki öreg; jóképű
bácsinak a szívtelen vén panamistát, és ha vérebet pingálnak, kék
pántlikát festenek a nyakára. Az emberek bolondok, és ők nem
bolondok, hogy ezt ne vegyék észre. Ebben az iskolában nőtt nagyra

Welcome to our website – the perfect destination for book lovers and
knowledge seekers. We believe that every book holds a new world,
offering opportunities for learning, discovery, and personal growth.
That’s why we are dedicated to bringing you a diverse collection of
books, ranging from classic literature and specialized publications to
self-development guides and children's books.
More than just a book-buying platform, we strive to be a bridge
connecting you with timeless cultural and intellectual values. With an
elegant, user-friendly interface and a smart search system, you can
quickly find the books that best suit your interests. Additionally,
our special promotions and home delivery services help you save time
and fully enjoy the joy of reading.
Join us on a journey of knowledge exploration, passion nurturing, and
personal growth every day!
ebookbell.com

Finitestate Methods And Natural Language Processing 5th International Workshop Fsmnlp 2005 Helsinki Finland September 12 2005 Revised Papers 1st Edition Tero Harju Auth

About This Presentation

Slide Content

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

Finitestate Methods And Natural Language Processing 5th International Workshop Fsmnlp 2005 Helsinki Finland September 12 2005 Revised Papers 1st Edition Tero Harju Auth

About This Presentation

Slide Content

Slide 1

Slide 2

Slide 3

Slide 5

Slide 6

Slide 7

Slide 8

Slide 9

Slide 10

Slide 11

Slide 12

Slide 13

Slide 14

Slide 15

Slide 16

Slide 17

Slide 18

Slide 19

Slide 20

Slide 21

Slide 22

Slide 23

Slide 24

Slide 25

Slide 26

Slide 27

Slide 28

Slide 29

Slide 30

Slide 31

Slide 32

Slide 33

Slide 34

Slide 35

Slide 36

Slide 37

Slide 38

Slide 39

Slide 40

Slide 41

Slide 42

Slide 43

Slide 44

Slide 45

Slide 46

Slide 47

Slide 48

Slide 49

Slide 50

Slide 51

Slide 52

Slide 53

Slide 54

Slide 55

Slide 56

Slide 57

Slide 58

Slide 59

Slide 60

Slide 61

Slide 62

Slide 63

Slide 64

Slide 65

Slide 66

Slide 67

Slide 68

Slide 69

Slide 70

Slide 71

Slide 72

Slide 73

Slide 74

Slide 75

Slide 76

Slide 77

Slide 78