My Account Log in

2 options

From Discourse Structure to Text Specificity: Studies of Coherence Preferences / Junyi Jessy Li.

Connect to full text Available online

View online

Dissertations & Theses @ University of Pennsylvania Available online

View online
Format:
Book
Thesis/Dissertation
Author/Creator:
Li, Junyi Jessy, author.
Contributor:
Marcus, Mitchell P., degree supervisor.
Nenkova, Ani, degree supervisor.
Webber, Bonnie, degree committee member.
Liberman, Mark, degree committee member.
Eisenstein, Jacob, degree committee member.
Carpuat, Marine, degree committee member.
University of Pennsylvania. Computer and Information Science, degree granting institution.
Language:
English
Subjects (All):
Artificial intelligence.
Computer and Information Science--Penn dissertations.
Penn dissertations--Computer and Information Science.
Local Subjects:
Artificial intelligence.
Computer and Information Science--Penn dissertations.
Penn dissertations--Computer and Information Science.
Genre:
Academic theses.
Physical Description:
1 online resource (154 pages)
Contained In:
Dissertation Abstracts International 79-01B(E).
Place of Publication:
[Philadelphia, Pennsylvania]: University of Pennsylvania ; Ann Arbor : ProQuest Dissertations & Theses, 2017.
Language Note:
English
System Details:
Mode of access: World Wide Web.
text file
Summary:
To successfully communicate through text, a writer needs to organize information into an understandable and well-structured discourse for the targeted audience. This involves deciding when to convey general statements, when to elaborate on details, and gauging how much details to convey, i.e., the level of specificity. This thesis explores the automatic prediction of text specificity, and whether the perception of specificity varies across different audiences.
We characterize text specificity from two aspects: the instantiation discourse relation, and the specificity of sentences and words. We identify characteristics of instantiation that signify a change of specificity between sentences. Features derived from these characteristics substantially improve the detection of the relation. Using instantiation sentences as the basis for training, we propose a semi-supervised system to predict sentence specificity with speed and accuracy. Furthermore, we present insights into the effect of underspecified words and phrases on the comprehension of text, and the prediction of such words.
We show distinct preferences in specificity and discourse structure among different audiences. We investigate these distinctions in both cross-lingual and monolingual context. Cross-lingually, we identify discourse factors that significantly impact the quality of text translated from Chinese to English. Notably, a large portion of Chinese sentences are significantly more specific and need to be translated into multiple English sentences. We introduce a system using rich syntactic features to accurately detect such sentences. We also show that simplified text is more general, and that specific sentences are more likely to need simplification. Finally, we present evidence that the perception of sentence specificity differs among male and female readers.
Notes:
Source: Dissertation Abstracts International, Volume: 79-01(E), Section: B.
Advisors: Ani Nenkova; Mitchell P. Marcus; Committee members: Marine Carpuat; Jacob Eisenstein; Mark Liberman; Bonnie Webber.
Department: Computer and Information Science.
Ph.D. University of Pennsylvania 2017.
Local Notes:
School code: 0175
ISBN:
9780355183580
Access Restriction:
Restricted for use by site license.

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account