NLPコロキウム | Modeling Fine-Grained Entity Types with Box Embeddings (尾上康雅)

尾上康雅 / Yasumasa Onoe (The University of Texas at Austin)

[Webサイト]

Yasumasa Onoe is a PhD student in Computer Science at the University of Texas at Austin, advised by Greg Durrett. His recent research includes entity analysis tasks such as fine-grained entity typing, entity linking (named entity disambiguation), and effective entity representations (e.g., box embeddings). Currently, he is interested in how to teach machines common sense about real-world entities.

概要

Neural entity typing models typically represent entity types as vectors in a high-dimensional space, but such spaces are not well-suited to modeling these types' complex interdependencies. We study the ability of box embeddings, which represent entity types as d-dimensional hyperrectangles, to represent hierarchies of fine-grained entity type labels even when these relationships are not defined explicitly in the ontology. Our model represents both types and entity mentions as boxes. Each mention and its context are fed into a BERT-based model to embed that mention in our box space; essentially, this model leverages typological clues present in the surface text to hypothesize a type representation for the mention. Soft box containment can then be used to derive probabilities, both the posterior probability of a mention exhibiting a given type and the conditional probability relations between types themselves. We compare our approach with a strong vector-based typing model, and observe state-of-the-art performance on several entity typing benchmarks. In addition to competitive typing performance, our box-based model shows better performance in prediction consistency (predicting a supertype and a subtype together) and confidence (i.e., calibration), implying that the box-based model captures the latent type hierarchies better than the vector-based model does.

※トークは日本語です。

[動画] [スライド] [論文] (ACL 2021)

メーリングリストへの登録: 参加用URLなどNLPコロキウムに関するお知らせを受け取りたい方はメーリングリストへのご登録をお願いします。