... Hierarchical Pitman-Yor Process HMM
for Unsupervised Part of Speech Induction
Phil Blunsom
Department of Computer Science
University of Oxford
Phil.Blunsom@cs.ox.ac.uk
Trevor Cohn
Department of Computer ... translation systems.
The HMM ignores orthographic information,
which is often highly indicative of a word’s part-
of- speech, particularly so in morphologica...
... 2: A selection of extracted rules, with ranks
after filtering for the development set. All have X for
their left-hand sides.
5.2 Hierarchical model
We ran the training process of Section 3 on ... represent D as a set of triples
r, i, j, each of which stands for an application of
a grammar rule r to rewrite a nonterminal that spans
f(D)
j
i
on the French side.
3
Then the w...
... intersection of the sets that formed the in-
put. So it seems natural to embed a partial order of
types X, into a partial order (in fact, a lattice)
of sets Y, , where Y is the power set of some
set ... second partial order, we use for its order
relation and for its join operation. We are espe-
cially interested in a class of partial orders called
meet semilattices, i...
... consists of a se-
quence of strata, each stratum being de-
fined by a set of regular-expression pat-
terns for recognizing phrases. [ ] The
output of stratum 0 consists of parts of
speech. ... semantics and part- of- speech used
in the input, as well as the head of each phrase
and the grammatical functions: TIME, SUBJ(ect)
and P-0BJ(ect).
4 Evaluation
The perform...
... encoded in the distribution of
entities in document P(e), the distribution of
possible names of a specific entity P(s|e), and
the distribution of possible contexts of a
specific entity P(c|e). ... Association for Computational Linguistics
A Generative Entity-Mention Model for Linking Entities with
Knowledge Base
Xianpei Han Le Sun
Institute of Software, Chinese Academy...
... providing for a
to be on the stack when b is processed.
(2) A2. Some of the executives also signed letters on
behalf of the Clinton program.
B2. Nearly all of them praised the president for
his efforts ... A Hierarchical Account of Referential Accessibility
Nancy IDE
Department of Computer Science
Vassar College
Poughkeepsie, New York 12604-0520 USA
ide@cs.vassar.edu
Dan CRIS...
... manually created document ontology to
model the content of an underlying document col-
lection. While the primary usage of ontologies is
as a means of organizing and navigating document
collections, ... signif-
icant amount of information about the documents
attached to them, including path-level, statistical,
representations of content, and fine-grained views
on the level of speci...
... observed, most of
them rarely. In particular, for a vocabulary of un-
bounded size and for d > 0, the number of unique
words scales as O(θT
d
) where T is the total num-
ber of words. For d = 0, ... levels of the model.
4 Hierarchical Chinese Restaurant
Processes
We describe a generative procedure analogous
to the Chinese restaurant process of Section 2
for drawing...
... Previous
work mainly focused on the selection of
either the source side of a hierarchical rule
or the target side of a hierarchical rule
rather than considering both of them si-
multaneously. This paper ... function words,
part- of- speech (POS) tags, syntactic structure in-
formation and so on. Our model can be easily in-
corporated as an independent feature into the prac-
ti...
... shows a
hierarchical TC system has achieved a micro-
averaged F
1
value of 86.6, which is compara-
ble to the performance of state -of- the-art flat
classification systems.
1 Introduction
The task of ... F4 has lowered the performance of the system.
However, adding F3 and F4 into bag -of- words fea-
ture set has improved the performance of both sys-
tems. Finally, the best performan...