Parsing with TAG and LFG
- Lecture 5Syntactic
formalisms for natural language parsing
FI MU autumn 2011
2
Tree Adjoining Grammar (TAG) and Lexical Functional Grammar (LFG)
A) Same goal
â—Ź formal system to model human speech
â—Ź model the syntactic properties of natural language
â—Ź syntactic frame work which aims to provide a computaionally precise and
psychologically realistic representation of language
B) Properties
â—Ź Unfication based
â—Ź Constraint-based
â—Ź Lexicalized grammar
C) Polynominal model
â—Ź Meta-grammar (LFG-TAG grammar: Owen, R., ClĂ©ment, L. & Kinyon, A., 2003-2006)
3
How to parse the sentence in TAG?
by Joshi, A. Levy, L and Takahashi, M. in 1975
4
TAG's basic component
â—Ź Representation structure: phrase-structure trees
â—Ź Finite set of elementary trees
â—Ź Two kinds of elementary trees
â€“ Initial trees (Î±): trees that can be substituted
â€“ Auxiliary trees (Î˛): trees that can be adjoined
â€“ Lexical trees (derived trees: Î´): initial trees corresponding to
arguments
5
â—Ź The tree in (Xâ�ŞZ) are called elementary trees.
6
â—Ź An initial tree (Î±)
â€“ all interior nodes are labeled with non-terminal symbols
â€“ the nodes on the frontier of initial tree are either labeled with terminal
symbols, or with non-terminal symbols marked for substitution (â†“)
â—Ź An auxiliary tree (Î˛)
â€“ one of its frontier nodes must be marked as foot node (*)
â€“ the foot node must be labeled with a non-terminal symbol which is
identical to the label of the root node.
â—Ź A derived tree (Îł)
â€“ tree built by composition of two other trees
â€“ the two composition operations that TAG uses adjoining and substitution.
7
Main operations of combination (1): adjunction
â—Ź Sentence of the language of a TAG are derived from the
composition of an Î± and any number of Î˛ by this operation.
â€“ It allows to insert a complete structure into an interior
node of another complete structure.
â—Ź Three constraints possible
â€“ Null adjunction (NA)
â€“ Obligatory adjunction (OA)
â€“ Selectional adjunction (SA)
8
9
Main operations of combination (2): substitution
â—Ź It inserts an initial tree or a lexical tree into an elementary tree.
â—Ź One constraint possible
â€“ Selectional substitution
10
Adjoining constraints
â—Ź Selective Adjunction (SA(T)):
only members of a set T âŠ† A can be adjoined on the given
node, but the adjunction is not mandatory
â—Ź Null Adjunction (NA):
any adjunction is disallowed for the given node (NA = SA(Đ¤))
â—Ź Obligatory Adjunction (OA(T)):
an auxiliary tree member of the set T âŠ† A must be adjoined on
the given node
for short OA = OA(A)
11
Example 1: selective adjunction (SA)
â—Ź One possible analysis of â€śsendâ€ť could involve selective
adjunction:
send
send away
send to
send something
12
â—Ź For when you absolutely must have adjunction at a node:
Example 2: obligatory adjunction
has
is
has seen
Is seen
13
Elementary trees (initial trees and auxiliary trees)
Yesterday a man saw Mary
*: foot node/root node
â†“: substitution node
14
15
Derivation tree
â€“ Specifies how a derived tree was constructed
â€“ The root node is labeled by an S-type initial tree.
â€“ Other nodes are labeled by auxiliary trees in the case of adjoining or
initial trees in the case of substitution.
â€“ A tree address of the parent tree is associated with each node.
16
â—Ź Derivation tree and derived tree Î±5
: substitution operation
: adjunction operation
17
Example 1: Harry likes peanuts passionately
Step 1
Step 2: substitution
=>
Step 3: adjunction
=>
(Î±1
) (Î±2
)
(Î±3
)
(Î˛1
)
18
Derivation tree and derived tree of Harry likes peanuts passionately
19
Two important properties of TAG
â—Ź Elementary trees can be of arbitrary size, so the domain of locality is
increased
â€“ Extended domain of locality (EDL)
â—Ź Small initial trees can have multiple adjunctions inserted within them,
so what are normally considered non-local phenomena are treated
locally
â€“ Factoring recursion from the domain of dependency (FRD)
20
Extended domain of locality (EDL): Agreement
â—Ź The lexical entry for a verb like â€ślovesâ€ť will contain a tree like the
following:
With EDL, we can easily state agreement between the subject and the verb
in a lexical entry
21
Factoring recursion from the domain of dependency (FRD):
Extraction
The above trees for the sentence â€śwho did John tell Sam that Bill likes ?â€ť allow the
insertion of the auxiliary tree in between the WH-phrase and its extraction site, resulting
a long distance dependency; yet this is factored out from the domain of locality in
TAG.
22
23
Variations of TAG
â—Ź Feature Structure Based TAG (FTAG: Joshi and Shanker, 1988)
each of the nodes of an elementary tree is associated with two feature structures:
top & bottom Substitution
Substitution with features
Adjoining with features
24
â—Ź Synchronous TAG (STAG: Shieber and Schabes, 1990)
â€“ A pair of TAGs characterize correspondences between languages
â€“ Semantic interpretation, language generation and translation
â—Ź Muti-component TAG (MCTAG: Chen-Main and Joshi, 2007)
â€“ A set of auxiliary tree can be adjoined to a given elementary tree
â—Ź Probabilistic TAG (PTAG: Resnik, 1992, Shieber, 2007)
â€“ Associating a probability with each elementary tree
â€“ Compute the probability of a derivation
25
XTAG Project (UPenn, since 1987 ongoing)
î€Ś A long-term project to develop a wide-coverage grammar for English
using the Lexicalized Tree-Adjoining Grammar (LTAG) formalism
î€Ś Provides a grammar engineering platform consisting of a parser, a
grammar development interface, and a morphological analyzer
î€Ś The project extends to variants of the formalism, and languages other
than English
26
XTAG system
27
Components in XTAG system
âž˘ Morphological Analyzer & Morph DB: 317K inflected items derived
from over 90K stems
âž˘ POS Tagger & Lex Prob DB: Wall Street Journal-trained 3-gram
tagger with N-best POS sequences
âž˘ Syntactic DB: over 30K entries, each consisting of:
âś— Uninflected form of the word
âś— POS
âś— List of trees or tree-families associated with the word
âś— List of feature equations
âž˘ Tree DB: 1004 trees, divided into 53 tree families and 221 individual
trees
28
(a) Morphology database (b) syntactic database
Interfaces to the database maintenance tools
29
Interface to the XTAG system
âž˘ Parser evaluation in XTAG Project by [Bangalore,S. et.al, 1998]
âž˘ http://www.cis.upenn.edu/~xtag/
30
How to parse the sentence in LFG?
by Bresnan, J. and Kaplan, R.M. In 1982
31
Main representation structures
â—Ź c-structure: constituent structure
level where the surface syntactic form, including categorical information, word order
and phrasal grouping of constituents, is encoded.
â—Ź f-structure: functional structure
internal structure of language where grammatical relations are represented. It is
largely invariable across languages. (e.g. SUBJ, OBJ, OBL, (X)COMP, (X)ADJ)
â—Ź a-structure: argument structure
They encode the number, type and semantic roles of the arguments of a predicate.
32
Level of structures and their interaction in LFG
33
â—Ź In LFG, the parsing result is grammatically correct
only if it satisfies 2 criteria:
1) the grammar must be able to assign a correct c-structure
2) the grammar must be able to assign a correct well-formed f-
structure
34
c-structure
âž˘ The constituent structure represents the organization of overt phrasal syntax
âž˘ It provides the basis for phonological interpretation
âž˘ Languages are very different on the c-structure level :external factors that usually
vary by language
Properties of c-structure
âž” c-structures are conventional phrase structure trees:
they are defined in terms of syntactic categories, terminal nodes, dominance and precedence.
âž” They are determined by a context free grammar that describes all possible surface strings of the
language.
âž” LFG does not reserve constituent structure positions for affixes: all leaves are individual words.
35
f-structure
â—Ź Attribute-Value notation for f-structure
1) representation of the functional structure of a sentence
2) f-structure match with c-structure
3) it has to satisfy three formal constraints: consistency, coherence, completeness
4) language are similar on this level: allow to explain cross-linguistic properties of phenomena
36
Examples of f-structure
1 2
37
Constraint 1: f-structure must be consistent
1) Two paths in the graph structure may designate the same element
-called unification, structure-sharing
Ex: John must leave
38
2) attributes are functionally unique
- there may not be two arcs with the same attribute from the same f-structure
39
3) The symbols used for atomic f-structure are district
- it is impossible to have two names for a single atomic f-structure (â€śclashâ€ť)
40
All argument functions in an f-structure must be selected by the local PRED
feature.
Constraint 2: f-structure must be coherent
41
All functions specified in the value of a PRED feature must be present in the
f-structure of that PRED.
Constraint 3: f-structure must be complete
42
Correspondence between different levels in LFG
+
43
Structural correspondence
âž˘ c-structures and f-structures represent different properties of an utterance
âž˘ How can these structures be associated properly to a particular sentence?
âž˘ Words and their ordering carry information about the linguistic dependencies in the
sentence
âž˘ This is represented by the c-structure (licensed by a CFG)
âž˘ LFG proposes simple mechanisms that maps between elements from one structure and
those of another: correspondence functions
âž˘ A function allows to map c-structures to f-structures
Đ¤: N â†’ F
44
Mapping the c-structure into the f-structure
â—Ź Since there is no isomorphic relationship between structure and function LFG assumes c-structure
and f-structure
â—Ź The mapping between c-structure and f-structure is the core of LFGâ€�s descriptive power
â—Ź The mapping between c-structure and f-structure is located in the grammar (PS) rules
45
Mapping mechanism: 6 steps
STEP 1: PS rules
âž˘ Context-free phrase structure rules
âž˘ Annotated with functional schemata
46
STEP 2: Lexicon entries
âž˘ Lexicon entries consists of three parts: representation of the word,
syntactic category, list of functional schemata
47
STEP 3: c-structure
âž˘ Like the PS rules, each node in the tree is associated with a functional schemata
âž˘ With the functional schemata of the lexical entries at the leaves we obtain a
complete c-structure
48
STEP 4: Co-indexation
âž˘ An f-structure is assigned to each node of the c-structure
âž˘ Each of these f-structures obtains a name (f1
- fn
)
âž˘ Nodes in the c-structure and associated f-structure are co-indexed, i.e. obtain the same name
âž˘ F-structure names f1
- fn
can be chosen freely but they may not occur twice
49
STEP 5: Metavariable biding
âž˘ All meta-variables are replaced by the names of the
f-structure representation
50
âž˘ We introduce at this point the notion of functional equation
âž˘ By listing all functional equations from a c-structure we obtain the
functional description, called f-description
f-description
51
STEP 6: From f-description to f-structure
âž˘ Computation of an f-structure is based on the f-description
âž˘ For the derivation of f-structures from the f-description it is important that
no information is lost and that no information will be added
âž˘ The derivation is done by the application of the functional equations
List of functional equations
a) Simple equations of the form:(fn
A)=B
b) f-equations of the form: fn
=fm
c) f-equations of the form : (fn
A)=fm
â†’Functional equations with the same name are grouped into an f-structure of the same name
52
Application of the functional equation (a): (fn
A)=B
Application of the functional equation (b): fn
=fm
53
Application of the functional equation (c): (fn
A)=fm
54
Parse the input of sentence in LFG
STEP 1: lexical entries
STEP 2: c-structure
55
STEP 3: f-structure
STEP 4: unification